Confidence score is not properly set when using Python [200069044]

Assigned

Bug

Status Update

No update yet.

Description

sa...@gmail.com

created issue #1

Sep 16, 2021 12:59PM

Below is the original text of

issue 161429872

, which was closed but it is still present. It was closed as "won't fix (intended behavior), but it looks like from the comments that the person who tried to reproduce the issue used 'document_text_detection' (which works) instead of 'text_detection'.
--

- Problem you have encountered: When using OCR to detect text in images [1], with Python client library, the confidence field is not set properly.

- What you expected to happen: Confidence field should have the same value in the "Try the API" option and when performing the recognition with the python client library.

- Steps to reproduce:

You can check this easily with any image. Run the following function with any image url:
```
from google.cloud import vision

def detect_text_uri(uri):
client = vision.ImageAnnotatorClient()
image = vision.types.Image()
image.source.image_uri = uri

response = client.text_detection(image=image)
texts = response.text_annotations
print('Texts:')

for text in texts:
print('\n"{}"'.format(text.description))

vertices = (['({},{})'.format(vertex.x, vertex.y)
for vertex in text.bounding_poly.vertices])

print('bounds: {}'.format(','.join(vertices)))
print("confidence: {}".format(text.confidence))

if response.error.message:
raise Exception(
'{}\nFor more info on error messages, check: '
'

https://cloud.google.com/apis/design/errors'.format(
response.error.message))
```

You will see that confidences are 0.0.
Then, use "Try the API" in [2] for the same image. You will see non zero confidences.

- Other information (workarounds you have tried, documentation consulted, etc):

I have also checked that the same happens with a local image.

[1]

https://cloud.google.com/vision/docs/ocr \
[2]

https://cloud.google.com/vision

Comments

mi...@google.com <mi...@google.com> Sep 27, 2021 08:08AM

Assigned to gc...@google.com.

mi...@google.com <mi...@google.com> #2Sep 27, 2021 03:37PM

I have informed our engineering team of this feature request. There is currently no ETA for its implementation.

A current workaround would be to check the returned "boundingPoly" [1] "vertices" for the returned "textAnnotations". If the calculated rectangle's heights > widths, than your image is sideways.

[1]

https://cloud.google.com/vision/reference/rest/v1/images/annotate#boundingpoly

sa...@gmail.com <sa...@gmail.com> #3Sep 27, 2021 07:26PM

I also need this problem solved :)

mi...@google.com <mi...@google.com> #4Sep 28, 2021 07:27AM

same :D

da...@gmail.com <da...@gmail.com> #5Sep 29, 2021 07:54AM

ls...@gmail.com <ls...@gmail.com> #6Sep 29, 2021 09:55AM

Message last modified on Sep 29, 2021 09:57AM

kr...@softagent.se <kr...@softagent.se> #7Apr 15, 2022 07:49AM

This needs more attention. It's not just a display issue as described in the report. The co-ordinates returned in 'boundingPoly' are incorrect if the image was taken on a phone. All the x points should be y and vice versa.

The workaround does not make sense as "boundingPoly" [1] "vertices" for "textAnnotations" does not indicate the image dimensions - it indicates the dimensions of the relevant text block inside the image.

Message last modified on Apr 18, 2022 07:49AM

si...@google.com <si...@google.com> Jun 10, 2022 06:35AM

Reassigned to si...@google.com.

si...@google.com <si...@google.com> Jun 21, 2022 08:20AM

Reassigned to gc...@google.com.

ve...@gmail.com <ve...@gmail.com> #8May 2, 2023 07:32AM

nr...@google.com <nr...@google.com> Nov 14, 2023 06:37AM