Assigned
Status Update
Comments
ja...@gmail.com <ja...@gmail.com> #2
I have informed our engineering team of this feature request. There is currently no ETA for its implementation.
A current workaround would be to check the returned "boundingPoly" [1] "vertices" for the returned "textAnnotations". If the calculated rectangle's heights > widths, than your image is sideways.
[1]https://cloud.google.com/vision/reference/rest/v1/images/annotate#boundingpoly
A current workaround would be to check the returned "boundingPoly" [1] "vertices" for the returned "textAnnotations". If the calculated rectangle's heights > widths, than your image is sideways.
[1]
ji...@google.com <ji...@google.com> #3
I also need this problem solved :)
ja...@gmail.com <ja...@gmail.com> #4
same :D
ji...@google.com <ji...@google.com> #5
+1
ja...@gmail.com <ja...@gmail.com> #6
+1
ji...@google.com <ji...@google.com> #7
This needs more attention. It's not just a display issue as described in the report. The co-ordinates returned in 'boundingPoly' are incorrect if the image was taken on a phone. All the x points should be y and vice versa.
The workaround does not make sense as "boundingPoly" [1] "vertices" for "textAnnotations" does not indicate the image dimensions - it indicates the dimensions of the relevant text block inside the image.
The workaround does not make sense as "boundingPoly" [1] "vertices" for "textAnnotations" does not indicate the image dimensions - it indicates the dimensions of the relevant text block inside the image.
mb...@gmail.com <mb...@gmail.com> #8
+1
ki...@gmail.com <ki...@gmail.com> #9
Would be great if this could be implemented.
lu...@gmail.com <lu...@gmail.com> #10
+1
vi...@gmail.com <vi...@gmail.com> #11
+1
vi...@gmail.com <vi...@gmail.com> #12
+1
oc...@gmail.com <oc...@gmail.com> #13
+1.
aa...@scious.io <aa...@scious.io> #14
The rotation information should already be available, basically the order of the bounding box vertices encode that rotation information:
https://cloud.google.com/vision/docs/reference/rest/v1/images/annotate#block
Could you please test and see if that works for your case?
Could you please test and see if that works for your case?
lu...@gmail.com <lu...@gmail.com> #15
Each bounding box on the page can have a different orientation, so it can be frustrating to figure out.
ne...@bergmann-infotech.de <ne...@bergmann-infotech.de> #16
+1
mi...@mcfile.com.br <mi...@mcfile.com.br> #17
"A current workaround would be to check the returned "boundingPoly" [1] "vertices" for the returned "textAnnotations". If the calculated rectangle's heights > widths, than your image is sideways. " - The proposed workaround does not work if the image needs to be rotated at 180 degrees. Any ETA after 2y and 1 day?
an...@gmail.com <an...@gmail.com> #18
+1
vd...@gmail.com <vd...@gmail.com> #19
+1
zy...@gmail.com <zy...@gmail.com> #20
+1
mo...@gmail.com <mo...@gmail.com> #21
+1
ca...@gmail.com <ca...@gmail.com> #22
Any ETA after such long time.
mo...@gmail.com <mo...@gmail.com> #23
We just prioritized this work and tentatively aiming for August for releasing various OCR improvements including orientation detection.
ac...@gmail.com <ac...@gmail.com> #24
Great stuff, really good to hear, this will have some really useful applications!
m....@gmail.com <m....@gmail.com> #25
Just to gather some details, this feature request is for *page* level orientation information? Like a enum UP DOWN LEFT RIGHT. Is orientation also desired for individual blocks, paragraphs, words, symbols, etc?
Also, this new orientation info will only apply to the DOCUMENT_TEXT_DETECTION feature.
Also, this new orientation info will only apply to the DOCUMENT_TEXT_DETECTION feature.
al...@gmail.com <al...@gmail.com> #26
Comment has been deleted.
na...@gmail.com <na...@gmail.com> #27
I think there should be an option field where you can decide if the orientation should also be given for individual blocks, words etc.
ga...@gmail.com <ga...@gmail.com> #28
Ideally we would have word-level information around rotation as well
sg...@gmail.com <sg...@gmail.com> #29
Great to hear that this feature will be added in the near future!
ja...@gmail.com <ja...@gmail.com> #30
+1 please!
si...@google.com <si...@google.com>
si...@google.com <si...@google.com>
lu...@gmail.com <lu...@gmail.com> #31
+1
ja...@gmail.com <ja...@gmail.com> #32
+1
je...@gmail.com <je...@gmail.com> #33
+1
Description
What you would like to accomplish:
Google's OCR API returns a json back when you upload a pdf. The JSON is the only export format. A PDF export option will be great and this is what commercial OCRs like abbyy offer. A textual extraction loses original pdf's formatting and text positioning information. Exporting to PDF will help me use the pdf for further processing such as converting it to another format/ editing the pdf.
How this might work:
A pdf export will help position Google OCR as a fully capable commercial offering. Currently, the accuracy of text extraction is on par or sometimes even better than other commercial offerings, but the export to text/json only can greatly add to the API.
If applicable, reasons why alternative solutions are not sufficient:
The information in json is not enough for me to build a pdf from scratch because vital information such as font formatting is already lost in the process of Google OCR.
Other information (workarounds you have tried, documentation consulted, etc):
I've tried different parameters in Google OCR, such as using TEXT_EXTRACTION, but it jumbles the order of the document text.