BlockType 'TABLE' is never returned from REST API [117948935]

Assigned

Feature Request

Status Update

No update yet.

Description

pu...@gmail.com

created issue #1

Oct 19, 2018 10:58AM

Please provide as much information as possible. At least, this should include a description of your issue and steps to reproduce the problem. If possible please provide a summary of what steps or workarounds you have already tried, and any docs or articles you found (un)helpful.

Problem you have encountered:
BlockType 'TABLE' is never returned from REST API I receive only TEXT
Attached one of the images i tried to get table from.

What you expected to happen:
To receive a BlockType TABLE

Steps to reproduce:
I run the default example on pyton:
```
from google.cloud import vision
client = vision.ImageAnnotatorClient()

with io.open(path, 'rb') as image_file:
content = image_file.read()

image = vision.types.Image(content=content)

response = client.document_text_detection(image=image)
```

Other information (workarounds you have tried, documentation consulted, etc):

billsTbl.jpg

130 KB

View

Download

Comments

gs...@google.com <gs...@google.com> #2Oct 19, 2018 08:47PM

Assigned to gs...@google.com.

Hello Alexander,

You mention other images of tables; what other images have you tried? Have you tried cases where the table did not fill the entire page, but less than a third? BlockType TABLE is defined as "Table block." on the "Method: images.annotate" documentation page. [1]

What extra information would this BlockType value as TABLE would bring for you? In fact, you are already aware that what you submit to the API call is the image of a table.

A few images that you tried would be of help in reproducing this issue on our side. The image you attached is particular in that the table fills the entire page.

[1]

https://cloud.google.com/vision/docs/reference/rest/v1/images/annotate#blocktype

pu...@gmail.com <pu...@gmail.com> #3Oct 22, 2018 05:56PM

Hi,

I cropped the table from full invoice in order to have small result data set for testing porposes, usually the table comes integrated in bigger document.
I need a block_type value because:
1.I want to get table and other data from full invoice and parse them separately with different logic.See bill.jpg
2.I have invoices with multiple tables on them and additional data so i need to distinguish the tables(or table like structures) and parse them accordingly. See super.jpg

I am attaching another image, in all those 3 images i get only block_type = 1 for all blocks.

I am testing with code that taken from

https://cloud.google.com/vision/docs/fulltext-annotations
just adding the print under the block 'for'.
```
for block in page.blocks:
print('BLOCK TYPE: ' + str(block.block_type))
```

bill.jpg

362 KB

View

Download

super.jpg

98 KB

View

Download

fly.jpg

357 KB

View

Download

gs...@google.com <gs...@google.com> #4Oct 23, 2018 05:57PM

Reassigned to gc...@google.com.

Your specific issue has been made known to Engineering, who will address it in due course. No estimated time to resolution has been set. Meanwhile, you may follow developments in this thread.

gs...@google.com <gs...@google.com> #5Feb 6, 2019 06:08PM

Status: Won't Fix (Intended Behavior)

OCR does not provide "table" block type. A table parsing enhancement to DOCUMENT_TEXT_DETECTION might be implemented in future. As yet, there is no established release date. Progress with get recorded in this thread.

jm...@google.com <jm...@google.com> Feb 13, 2019 07:09PM

Status: Assigned (reopened)

ja...@diverseprogrammers.com <ja...@diverseprogrammers.com> #6Nov 30, 2021 06:10PM

Hi, any update on this specifically for Google Cloud Vision API?

Referring to:

https://cloud.google.com/vision/docs/reference/rpc/google.cloud.vision.v1#blocktype

[Deleted User] <[Deleted User]> #7Dec 23, 2021 08:30AM

Suprisingly, the REST API doesn't provide structure information or blocktype while rpc does, I wonder why is that the case.

js...@gmail.com <js...@gmail.com> #8Nov 3, 2022 02:48PM

Any update on this?

se...@doncelsoft.com <se...@doncelsoft.com> #9Nov 3, 2022 11:23PM

I'm also interested on this

sr...@gmail.com <sr...@gmail.com> #10Dec 20, 2022 02:53PM

I am also interested in this, AWS Textract is giving option to find the table in the images. Can we expect the same here also

Message last modified on Dec 20, 2022 02:54PM

hr...@gmail.com <hr...@gmail.com> #11Jul 25, 2023 01:22PM

I am also interested in this, I am trying to compare this with Azure and AWS.

sw...@gmail.com <sw...@gmail.com> #12Sep 18, 2023 12:55PM

Hello Google, Why is there no update on this issue? Please update!

ni...@aramix.ai <ni...@aramix.ai> #13Sep 20, 2023 03:35PM

I am interested too

fr...@aramix.ai <fr...@aramix.ai> #14Sep 22, 2023 07:11AM

I am interested too

Issue 117948935

Description

Issue summary

Comments

gs...@google.com <gs...@google.com> #2Oct 19, 2018 08:47PM

pu...@gmail.com <pu...@gmail.com> #3Oct 22, 2018 05:56PM

gs...@google.com <gs...@google.com> #4Oct 23, 2018 05:57PM

gs...@google.com <gs...@google.com> #5Feb 6, 2019 06:08PM

jm...@google.com <jm...@google.com> Feb 13, 2019 07:09PM

ja...@diverseprogrammers.com <ja...@diverseprogrammers.com> #6Nov 30, 2021 06:10PM

[Deleted User] <[Deleted User]> #7Dec 23, 2021 08:30AM

js...@gmail.com <js...@gmail.com> #8Nov 3, 2022 02:48PM

se...@doncelsoft.com <se...@doncelsoft.com> #9Nov 3, 2022 11:23PM

sr...@gmail.com <sr...@gmail.com> #10Dec 20, 2022 02:53PM

hr...@gmail.com <hr...@gmail.com> #11Jul 25, 2023 01:22PM

sw...@gmail.com <sw...@gmail.com> #12Sep 18, 2023 12:55PM

ni...@aramix.ai <ni...@aramix.ai> #13Sep 20, 2023 03:35PM

fr...@aramix.ai <fr...@aramix.ai> #14Sep 22, 2023 07:11AM

Add comment

Issue metadata