Status Update
Comments
at...@google.com <at...@google.com> #2
When i am trying to access Google Translate API using javascript when the query string size is around 13000 characters i am getting server response error code 400. Can you please let me know what is the request size limit if any exist.
da...@gear4music.com <da...@gear4music.com> #3
according to Google API FAQ
The maximum size of each text to be translated is 5000 characters, not including any HTML tags.
al...@gmail.com <al...@gmail.com> #4
em...@tomas-novotny.eu <em...@tomas-novotny.eu> #5
zo...@gmail.com <zo...@gmail.com> #6
ro...@gmail.com <ro...@gmail.com> #7
In some context:
<a id="text">bla bla...</a> -> <a id="text">alb</a>alb<a id="text">...</a>
after change anchor tag to 【】,it worked:
【bla bla ...】 -> 【alb alb ...】
mo...@gmail.com <mo...@gmail.com> #8
Input: <span id='1'>Per </span><span id='2'>Web-ERV</span><span id='3'>Handelsgericht Wien</span><span id='4'>Marxergasse 1a</span><span id='5'>1030 Wien</span><span id='6'>Firmenbuchsache</span>
Output: <span id='1'>Via</span> <span id='2'>web ERV</span> <span id='3'>Vienna Commercial Court</span> <span id='4'>Marxergasse 1a</span> <span id='5'>1030 <span id='6'>Wien</span></span> <span id='6'>Commercial thing</span>
Notice above in the output from the API, there is now an extra <span id="6"> now nested in <span id="5">, around partial content of <span id="5">.. This is completely unacceptable for a paid service, html translation is completely unusable with this bug.
ma...@gmail.com <ma...@gmail.com> #9
any update on this one?
sa...@gmail.com <sa...@gmail.com> #10
This issue is also affecting a user in case number 17353456.
Is there any update?
zo...@gmail.com <zo...@gmail.com> #11
to...@gmail.com <to...@gmail.com> #12
I kind of "solved" this by replacing my placeholders with some utf8 encoded string.
I was having trouble with strings like: "{s} Lorem Ipsum is simply dummy text of the printing and typesetting industry." but if I encode it like "{123_115_125} Lorem Ipsum is simply dummy text of the printing and typesetting industry." it seems to work fine.
Of course it's kind of cheating but it worked so far.
jo...@kognity.com <jo...@kognity.com> #13
<p class="tile-land-image">
<a href="../tutorials/1_overview.htm">
<img class="tile-image" src="../Resources/images/thumbnails/landing_environs.png"></a> </img>
</p>
sh...@duve.com <sh...@duve.com> #14
ja...@massivemedia.eu <ja...@massivemedia.eu> #16
Part of html Sent for translation:
<a class="d-flex align-items-center text-decoration-none" href="/customers">
<div class="ml-4">
<h6 class="font-weight-bold text-uppercase text-primary mb-0">Customer</h6>
</div>
</a>
Part of html received after translation:
<div class="ml-4">
<h6 class="font-weight-bold text-uppercase text-primary mb-0"> Client </h6>
</div>
This can be reproduced following the code from the Python guide[1]. The library version used is google-cloud-translate==2.0.0
[1]
nw...@gmail.com <nw...@gmail.com> #17
cz...@google.com <cz...@google.com>
se...@google.com <se...@google.com> #18
th...@gmail.com <th...@gmail.com> #19
ym...@gmail.com <ym...@gmail.com> #20
pe...@psmits.com <pe...@psmits.com> #21
It appears that if the verb includes the % it will be broken into "% 1 $ s"
How can I prevent this from happening ?
yi...@google.com <yi...@google.com> #22
From the thread, there are two types of space issue, one is introduced between html tags, another is between some special characters. I made some improvements on the html one, the special characters one will be in prod later.
Now the html space will not have more than one space between two plain words,
Before: <a>Text</a> <p> Hallo <a>Text</a> </p>
Now: <a>Text</a> <p>Hallo <a>Text</a></p>
It may not respect to the source sentence, but it can guarantee the browser can show correct sentences.
te...@synology.com <te...@synology.com> #23
API version:
"google/apiclient": "^2.0",
"google/cloud-automl": "^0.10.0",
"google/cloud-translate": "^1.7"
Example:
Source language:
<li>If you must access file services over the Internet, it is strongly recommended that you use a VPN to connect to your device (see <a href="
target language:
<li>Si vous devez accéder aux services de fichiers sur Internet, il est fortement recommandé d'utiliser un VPN pour vous connecter à votre périphérique (voir <a href="
What we expected to happen:
<li>Si vous devez accéder aux services de fichiers sur Internet, il est fortement recommandé d'utiliser un VPN pour vous connecter à votre périphérique (voir <a href="
ap...@gmail.com <ap...@gmail.com> #24
br...@gmail.com <br...@gmail.com> #25
Workaround I came up with This is for text translation. I don't know how google behaves for html input. Could be similar...
Before passing the string to translator, run through this regex /(?=[^a-zA-Z _,.])/gm which matches all special characters and replace the matches with a special character that's not present in the string. For me, I used * character. Then pass it on to the translator and run the translated string through another simple regex /( * )|( *)|(* )/gm and replace the matches with empty string. Then you get the desired output. Other languages where input string contain non alphabet chars can filter out those chars too in the first regex if needed but it's not necessary. The first regex can further be modified(if speed is a concern) to match only those special characters which doesn't have a preceeding space character. Google won't add space to any special character which already has a preceeding space..
g3...@gmail.com <g3...@gmail.com> #26
se...@google.com <se...@google.com> #28
Hello,
Thank you for your patience.
I have forwarded your concerns to the product and they are working on it. Once I have an update it will be shared in this thread.
Kind Regards
se...@google.com <se...@google.com>
al...@gmail.com <al...@gmail.com> #29
su...@google.com <su...@google.com>
ar...@gmail.com <ar...@gmail.com> #30
si...@google.com <si...@google.com>
si...@google.com <si...@google.com>
ty...@gmail.com <ty...@gmail.com> #31
ty...@gmail.com <ty...@gmail.com> #32
Can we get an update on this issue since it has been a year since the last communication on this ticket or get its priority upped to a P1 since this issue has been around for four years and is the second most user-impacted cloud translation bug?
Thanks and best regards!
ba...@mellink.net <ba...@mellink.net> #33
This is annoying (also because the API is a paid service). Any update would be appreciated. Problem also happens with special HTML characters such as & for '&' and it seems to be depending on the fact if the words around the characters are in upper or lower case. Examples (translating from EN to FR):
The word "Crypto" is in uppercase: Extra space added in the middle of the & char:
The Crypto Fear & Greed index --> La crypto peur & amp; Indice de cupidité
The word "crypto" is in lowercase: No extra space added in the middle:
The crypto Fear & Greed index -> La crypto Peur & Indice de cupidité
The location of the space is inconsistent. Sometimes & will translate to & amp; and other times to & ;
da...@ex-situ.com <da...@ex-situ.com> #34
pe...@foundry.com <pe...@foundry.com> #35
I am wondering if there is any update on this, I'm still getting this issue when using the google.cloud.translate_v2 API?
Cheers
ku...@google.com <ku...@google.com>
pu...@google.com <pu...@google.com>
27...@student.pwr.edu.pl <27...@student.pwr.edu.pl> #36
ri...@gmail.com <ri...@gmail.com> #37
va...@gmail.com <va...@gmail.com> #38
ni...@babbels.nl <ni...@babbels.nl> #39
Any progress in the last 3 years?
Description
Google break add extra space to some tags
Example: <a>text</a><p>Hello <a>text</a></p><li>Hello</li><li>World</li>
Get: <a>Text</a> <p> Hallo <a>Text</a> </p><li> Hallo </li><li> Welt </li>
What you expected to happen:
Should be translated: <a>Text</a><p>Hallo <a>text</a></p><li>Hallo</li><li>Welt</li>
But Google break word by added space between letters
Steps to reproduce:
The input text to translate - <a>text</a><p>Hello <a>text</a></p><li>Hello</li><li>World</li>
Target: de or ru
Source: en