Provide read/write access to comments in Google Docs [36756056]

Assigned

Feature Request

Status Update

No update yet.

Description

[Deleted User]

created issue #1

Jul 26, 2012 04:41AM

Seems currently it is not possible to access comments in google docs through Google Apps Script. This feature could be of help in an application we are planning to develop.

Stack overflow thread:

http://stackoverflow.com/questions/11630812

Comments

jk...@google.com <jk...@google.com> Aug 1, 2012 08:25PM

Assigned to jk...@google.com.

ko...@gmail.com <ko...@gmail.com> Jan 8, 2014 04:19PM

Reassigned to ko...@gmail.com.

ch...@gmail.com <ch...@gmail.com> #2Jan 21, 2014 06:27PM

Edit:
- Spurious line "65 / ~350" after the first paragraph

ko...@gmail.com <ko...@gmail.com> #3Apr 9, 2014 07:04PM

ch...@gmail.com <ch...@gmail.com> #4Apr 9, 2014 07:12PM

Could you perhaps use REGEX to parse the json string? Something like this should work (with some modifications for your use case):

WITH
yourTable AS (
SELECT
'{"bar": ["vimota", ""]}' AS json
UNION ALL
SELECT
'{"bar": [, "Brazil"]}' )
SELECT
ARRAY(
SELECT
REGEXP_EXTRACT(num, r'"(.*)"')
FROM
UNNEST(SPLIT(REGEXP_EXTRACT(JSON_EXTRACT(json,
'$.bar'), r'\[(.*)\]'))) AS num
WHERE
REGEXP_EXTRACT(num, r'"(.*)"') IS NOT NULL)
FROM
yourTable;

dr...@gmail.com <dr...@gmail.com> #5Oct 28, 2014 03:13PM

Nope, many of our json arrays contain json string values with user-input chars like " which would break a regex-based approach to parsing the json, since we'd have to distinguish " from \" from \\" from \\\", etc.

da...@gmail.com <da...@gmail.com> #6May 26, 2015 03:32PM

Thanks for the feedback! We'll take this suggestion into account as we plan JSON-related functionality, and I'll update here if and when there is more to share.

ch...@gmail.com <ch...@gmail.com> #7May 26, 2015 03:36PM

Thanks! In the meantime, what's the best way to turn a json array into a bq array? Looking through the docs on json functions I don't see a way to achieve this, other than writing a custom javascript udf, which imposes the strict limitations of queries that use udfs.

jo...@lumapps.com <jo...@lumapps.com> #8May 26, 2015 05:03PM

The best option right now--if you need to take escaping into account--is using a JavaScript UDF. If you generally have a small number of JSON array elements and you want to handle escaped strings, you could use a hack like this one:

CREATE TEMP FUNCTION JsonExtractArray(json STRING) AS (
(SELECT ARRAY_AGG(v IGNORE NULLS)
FROM UNNEST([
JSON_EXTRACT_SCALAR(json, '$.foo[0]'),
JSON_EXTRACT_SCALAR(json, '$.foo[1]'),
JSON_EXTRACT_SCALAR(json, '$.foo[2]'),
JSON_EXTRACT_SCALAR(json, '$.foo[3]'),
JSON_EXTRACT_SCALAR(json, '$.foo[4]'),
JSON_EXTRACT_SCALAR(json, '$.foo[5]'),
JSON_EXTRACT_SCALAR(json, '$.foo[6]'),
JSON_EXTRACT_SCALAR(json, '$.foo[7]'),
JSON_EXTRACT_SCALAR(json, '$.foo[8]'),
JSON_EXTRACT_SCALAR(json, '$.foo[9]')]) AS v)
);

Even though there is an escaped quote inside the "bar" string in this example, you'll get the expected three elements:

SELECT JsonExtractArray('{"foo":[1,2,3,"ba\\"r"]}');

ch...@gmail.com <ch...@gmail.com> #9May 26, 2015 05:22PM

Yeah, hardcoding a max length on the input arrays is a non starter for us.

ja...@gmail.com <ja...@gmail.com> #10Jul 1, 2015 11:18AM

check -

https://stackoverflow.com/a/45315826/5221944 for another option

to...@gmail.com <to...@gmail.com> #11Dec 16, 2016 07:10PM

> Process the data differently (e.g. using Cloud Dataflow or another tool) so that you can load it from newline-delimited JSON into BigQuery.

We've been taking advantage of bigquery to follow an ELT (extract-load-transfer) pattern, where the T happens in bigquery sql itself, so adding another T step like ETLT would be a heavy and undesirable change for us.

> Use a JavaScript UDF that takes the input JSON and returns the desired type; this is fairly straightforward but generally uses more CPU (and hence may require a higher billing tier).

(Discussed above.)

> Use SQL functions with the understanding that the solution breaks down if there are too many elements.

(Discussed above.)

ek...@google.com <ek...@google.com> Apr 5, 2017 02:04PM

Reassigned to ek...@google.com.

oa...@gmail.com <oa...@gmail.com> #12Jul 25, 2017 02:13PM

We have a similar issue with map stored in JSON, parsing via REGEX is rather error prone. Right now seems JavaScript UDF is the only option and as mentioned before I'm fearing performance issues. In our case it's up to ~1M rows with each row containing map encoded as JSON (up to ~100 key-value pairs, might become more later).

Should I open a separate ticket for json_extract_map: string -> map<string, string> ?

jo...@gmail.com <jo...@gmail.com> #13Oct 4, 2018 01:37PM

Yes, please do (this is more along the lines of supporting a new type). Thanks!

[Deleted User] <[Deleted User]> #14Feb 20, 2019 09:32AM

ok, opened

https://issuetracker.google.com/issues/65488665

ek...@google.com <ek...@google.com> Mar 20, 2019 01:00PM

Reassigned to ap...@google.com.

se...@gmail.com <se...@gmail.com> #15Oct 15, 2019 02:28PM

>> Process the data differently (e.g. using Cloud Dataflow or another tool) so that you can load it from newline-delimited JSON into BigQuery.
> We've been taking advantage of bigquery to follow an ELT (extract-load-transfer) pattern, where the T happens in bigquery sql itself, so adding another T step like ETLT would be a heavy and undesirable change for us.

I think what the StackOverflow user, Elliott Brossard, was proposing is that instead of using an ELT pattern, use an ETL pattern, with DataProc/DataFlow as your transformation technology/layer.

Basically:

1. Extract from source into Google Cloud Storage. (E)
2. Run a DataProc/DataFlow job to parse your data, and transform it as necessary. (T)
3. Write the result(s) to BigQuery. (L)

[Deleted User] <[Deleted User]> #16Dec 4, 2019 11:41PM

Another option is to add an STRING_TO_ARRAY() function, as we already have the reverse one: ARRAY_TO_STRING()

It should basically do this:
regexp_extract_all(json_extract(FIELD, '$.keyWithArrayAsVal'), '{{[^}}]+}}')

al...@gmail.com <al...@gmail.com> #17Jul 29, 2020 05:25PM

Any updates on this?

as...@gmail.com <as...@gmail.com> #18Sep 2, 2020 04:53PM

Not yet. The best workarounds are those listed above, e.g. a JavaScript UDF or splitting with a regex, assuming the strings don't have escaped quotes in them.

da...@gmail.com <da...@gmail.com> #19Oct 9, 2020 04:43PM

+1 for this -- would be extraordinary helpful for occasions where you may use a 3rd party tool that integrates with BigQuery but where you can't control how the data arrives (e.g. as a string).

A prime example of such a tool is Segment.com:
- can be configured to use Redshift or BigQuery as a data warehouse
- stringifies arrays before sending to warehouse

ab...@gmail.com <ab...@gmail.com> #20Dec 29, 2020 06:14PM

+1 for this.

br...@gmail.com <br...@gmail.com> #21Jun 20, 2021 11:43AM

+100 for this.
Please implement a aolution for thia.

dy...@gmail.com <dy...@gmail.com> #22Jul 30, 2021 09:55AM