Provide ability to create a Drive API Comment anchor resource as method on DocumentApp selection class [36763384]

Assigned

Feature Request

Status Update

No update yet.

Description

cl...@newvisions.org

created issue #1

Jan 9, 2016 03:30PM

Currently there is no way to fully leverage the Drive API Comments endpoint to connect comments programmatically to user selections in the Docs editor. This has massive implications for Add-ons for Docs where markup and commenting for feedback is concerned. In the EDU space, this is a major blocker for more advanced document feedback, markup, and grading is concerned.

From what I can tell, there are two pieces of the puzzle that could be combined to make this possible:

1) The Drive API (running as Advanced Service) currently allows for comment insertions, but these comments cannot be tied to user selections because the comment anchor resource for Docs is "proprietary." See

http://stackoverflow.com/questions/23498275/creating-anchored-comments-programmatically-in-google-docs

Inspecting selection-bound comments returned by the list endpoint suggests that behind the scenes, Google has an anchors resource that is a core part of the Docs product, and that these have simple string identifiers like "kix.74d1qqbpcb6s"

2) The DocumentApp service has the ability to identify the current user's cursor selection from the active Document (e.g. via DocumentApp.getActiveDocument().getSelection();) however the selection resource does not have a method to extract anything that can be used as a Drive Comment anchor.

My proposal is to add a method to the selection class that creates a "kix" anchor that can be used with the Drive API comments endpoint.

This same approach should also be taken for Sheets and any other editor where the active user's selections are made programmatically available.

Comments

to...@ziggipapers.com <to...@ziggipapers.com> #2Dec 16, 2016 09:23AM

Edit:
- Spurious line "65 / ~350" after the first paragraph

te...@gmail.com <te...@gmail.com> #3Dec 16, 2016 11:08AM

[Deleted User] <[Deleted User]> #4Aug 15, 2017 04:26PM

Could you perhaps use REGEX to parse the json string? Something like this should work (with some modifications for your use case):

WITH
yourTable AS (
SELECT
'{"bar": ["vimota", ""]}' AS json
UNION ALL
SELECT
'{"bar": [, "Brazil"]}' )
SELECT
ARRAY(
SELECT
REGEXP_EXTRACT(num, r'"(.*)"')
FROM
UNNEST(SPLIT(REGEXP_EXTRACT(JSON_EXTRACT(json,
'$.bar'), r'\[(.*)\]'))) AS num
WHERE
REGEXP_EXTRACT(num, r'"(.*)"') IS NOT NULL)
FROM
yourTable;

tp...@gmail.com <tp...@gmail.com> #5Aug 15, 2017 06:18PM

Nope, many of our json arrays contain json string values with user-input chars like " which would break a regex-based approach to parsing the json, since we'd have to distinguish " from \" from \\" from \\\", etc.

[Deleted User] <[Deleted User]> #6Aug 16, 2017 08:54AM

Thanks for the feedback! We'll take this suggestion into account as we plan JSON-related functionality, and I'll update here if and when there is more to share.

su...@gmail.com <su...@gmail.com> #7Nov 16, 2017 06:05AM

Thanks! In the meantime, what's the best way to turn a json array into a bq array? Looking through the docs on json functions I don't see a way to achieve this, other than writing a custom javascript udf, which imposes the strict limitations of queries that use udfs.

es...@gmail.com <es...@gmail.com> #8Feb 27, 2018 02:22PM

The best option right now--if you need to take escaping into account--is using a JavaScript UDF. If you generally have a small number of JSON array elements and you want to handle escaped strings, you could use a hack like this one:

CREATE TEMP FUNCTION JsonExtractArray(json STRING) AS (
(SELECT ARRAY_AGG(v IGNORE NULLS)
FROM UNNEST([
JSON_EXTRACT_SCALAR(json, '$.foo[0]'),
JSON_EXTRACT_SCALAR(json, '$.foo[1]'),
JSON_EXTRACT_SCALAR(json, '$.foo[2]'),
JSON_EXTRACT_SCALAR(json, '$.foo[3]'),
JSON_EXTRACT_SCALAR(json, '$.foo[4]'),
JSON_EXTRACT_SCALAR(json, '$.foo[5]'),
JSON_EXTRACT_SCALAR(json, '$.foo[6]'),
JSON_EXTRACT_SCALAR(json, '$.foo[7]'),
JSON_EXTRACT_SCALAR(json, '$.foo[8]'),
JSON_EXTRACT_SCALAR(json, '$.foo[9]')]) AS v)
);

Even though there is an escaped quote inside the "bar" string in this example, you'll get the expected three elements:

SELECT JsonExtractArray('{"foo":[1,2,3,"ba\\"r"]}');

gi...@gmail.com <gi...@gmail.com> #9Apr 28, 2018 06:47PM

Yeah, hardcoding a max length on the input arrays is a non starter for us.

da...@newfangled.com <da...@newfangled.com> #10May 23, 2018 06:44PM

check -

https://stackoverflow.com/a/45315826/5221944 for another option

[Deleted User] <[Deleted User]> #11Jul 19, 2018 05:01PM

> Process the data differently (e.g. using Cloud Dataflow or another tool) so that you can load it from newline-delimited JSON into BigQuery.

We've been taking advantage of bigquery to follow an ELT (extract-load-transfer) pattern, where the T happens in bigquery sql itself, so adding another T step like ETLT would be a heavy and undesirable change for us.

> Use a JavaScript UDF that takes the input JSON and returns the desired type; this is fairly straightforward but generally uses more CPU (and hence may require a higher billing tier).

(Discussed above.)

> Use SQL functions with the understanding that the solution breaks down if there are too many elements.

(Discussed above.)

[Deleted User] <[Deleted User]> #12Feb 15, 2019 03:41PM

We have a similar issue with map stored in JSON, parsing via REGEX is rather error prone. Right now seems JavaScript UDF is the only option and as mentioned before I'm fearing performance issues. In our case it's up to ~1M rows with each row containing map encoded as JSON (up to ~100 key-value pairs, might become more later).

Should I open a separate ticket for json_extract_map: string -> map<string, string> ?

ro...@gmail.com <ro...@gmail.com> #13Feb 16, 2019 04:20AM

Yes, please do (this is more along the lines of supporting a new type). Thanks!

vb...@gmail.com <vb...@gmail.com> #14Feb 22, 2019 03:25PM

ok, opened

https://issuetracker.google.com/issues/65488665

Message last modified on Feb 22, 2019 03:27PM

[Deleted User] <[Deleted User]> #15Aug 7, 2019 11:54AM

>> Process the data differently (e.g. using Cloud Dataflow or another tool) so that you can load it from newline-delimited JSON into BigQuery.
> We've been taking advantage of bigquery to follow an ELT (extract-load-transfer) pattern, where the T happens in bigquery sql itself, so adding another T step like ETLT would be a heavy and undesirable change for us.

I think what the StackOverflow user, Elliott Brossard, was proposing is that instead of using an ELT pattern, use an ETL pattern, with DataProc/DataFlow as your transformation technology/layer.

Basically:

1. Extract from source into Google Cloud Storage. (E)
2. Run a DataProc/DataFlow job to parse your data, and transform it as necessary. (T)
3. Write the result(s) to BigQuery. (L)

du...@gmail.com <du...@gmail.com> #16Aug 31, 2019 05:50AM

Another option is to add an STRING_TO_ARRAY() function, as we already have the reverse one: ARRAY_TO_STRING()

It should basically do this:
regexp_extract_all(json_extract(FIELD, '$.keyWithArrayAsVal'), '{{[^}}]+}}')

ap...@gmail.com <ap...@gmail.com> #17Sep 12, 2019 07:00PM

Any updates on this?

[Deleted User] <[Deleted User]> #18Sep 26, 2019 06:40PM

Not yet. The best workarounds are those listed above, e.g. a JavaScript UDF or splitting with a regex, assuming the strings don't have escaped quotes in them.

se...@gmail.com <se...@gmail.com> #19Oct 15, 2019 02:19PM

+1 for this -- would be extraordinary helpful for occasions where you may use a 3rd party tool that integrates with BigQuery but where you can't control how the data arrives (e.g. as a string).

A prime example of such a tool is Segment.com:
- can be configured to use Redshift or BigQuery as a data warehouse
- stringifies arrays before sending to warehouse

al...@curvestone.io <al...@curvestone.io> #20May 19, 2020 10:30PM

+1 for this.

sw...@terpmail.umd.edu <sw...@terpmail.umd.edu> #21May 22, 2020 06:57PM

+100 for this.
Please implement a aolution for thia.

sw...@terpmail.umd.edu <sw...@terpmail.umd.edu> #22May 22, 2020 06:57PM

+1 for this.

[Deleted User] <[Deleted User]> #23May 28, 2020 09:36AM

+1 for this

[Deleted User] <[Deleted User]> #24Jun 1, 2020 05:54PM

+1 for this

gr...@gmail.com <gr...@gmail.com> #25Jul 19, 2020 12:13PM

+1 for this

al...@gmail.com <al...@gmail.com> #26Jul 29, 2020 05:23PM

[Deleted User] <[Deleted User]> #27Aug 10, 2020 04:07AM

as...@gmail.com <as...@gmail.com> #28Sep 2, 2020 04:47PM

+1

It seems insane that in late 2019, BigQuery can't unnest a stringified JSON array without resorting to performance-breaking hacks. What the heck Google?

da...@gmail.com <da...@gmail.com> #29Oct 9, 2020 04:42PM

Unnest JSON Arrays is blocking us to migrate our application completely into the cloud.

[Deleted User] <[Deleted User]> #30Oct 25, 2020 10:59AM

ca...@gmail.com <ca...@gmail.com> #31Oct 25, 2020 11:17AM

bj...@measureone.com <bj...@measureone.com> #32Oct 25, 2020 08:21PM

ru...@gmail.com <ru...@gmail.com> #33Nov 10, 2020 08:52PM

+1

I ran into this issue again today. I was able to use a workaround using REGEXP_EXTRACT, but now I have to teach this hack (and associated pitfalls and limitations) to the whole team.

ne...@acerb.biz <ne...@acerb.biz> #34Jan 9, 2021 12:26PM

ct...@gmail.com <ct...@gmail.com> #35Feb 8, 2021 01:37PM

+1

What I wouldn't give for a JSON_EXTRACT_ARRAY function. JSON_EXTRACT already allows array access by index, and JSON_EXTRACT_SCALAR will actually return NULL if the result is an array (or an object), so it seems safe to assume there are already means within those functions to parse JSON arrays - can we not expose arrays natively?

In addition, pretty much every ETL platform converts JSON columns into strings within BigQuery, but if the data in those columns can not be readily converted into an array, BigQuery becomes a real handicap in the ETL process. I appreciate any consideration here.

en...@slidespro.io <en...@slidespro.io> #36Apr 9, 2021 01:49PM

cd...@escp.eu <cd...@escp.eu> #37Apr 27, 2021 07:47AM

pr...@gmail.com <pr...@gmail.com> #38Jun 3, 2021 08:29AM

lu...@britanniaeducationtrust.com <lu...@britanniaeducationtrust.com> #39Jun 28, 2021 09:12AM

ig...@carely.group <ig...@carely.group> #40Jul 27, 2021 09:51AM

by...@gmail.com <by...@gmail.com> #41Jul 27, 2021 09:59AM

+1 this would be really helpful.

dy...@gmail.com <dy...@gmail.com> #42Jul 30, 2021 09:56AM

th...@gmail.com <th...@gmail.com> #43Aug 6, 2021 12:55PM

or...@gmail.com <or...@gmail.com> #44Aug 11, 2021 07:44AM

en...@gmail.com <en...@gmail.com> #45Aug 11, 2021 02:28PM

Message last modified on Aug 20, 2021 06:47AM

jd...@gmail.com <jd...@gmail.com> #46Aug 27, 2021 09:14AM

+1 !

km...@gmail.com <km...@gmail.com> #47Sep 4, 2021 05:10PM

just tried to create UDF function called json_extract_array but it reported an error that "User-defined function name 'json_extract_array' conflicts with a reserved built-in function name". Unfortunately, it's not usable yet :) so it looks like there is some progress.

rm...@gmail.com <rm...@gmail.com> #48Oct 28, 2021 02:52PM

al...@gmail.com <al...@gmail.com> #49Nov 29, 2021 10:06PM

Comment has been deleted.

Message last modified on Nov 29, 2021 10:07PM

pr...@gmail.com <pr...@gmail.com> #50Dec 28, 2021 10:05PM

mg...@gmail.com <mg...@gmail.com> #51Feb 15, 2022 12:19AM

Comment has been deleted.

Message last modified on Feb 15, 2022 12:19AM

mp...@gmail.com <mp...@gmail.com> #52May 6, 2022 09:51PM

ju...@gmail.com <ju...@gmail.com> #53Sep 1, 2022 11:21PM

pa...@gmail.com <pa...@gmail.com> #54Jan 3, 2023 09:26PM

an...@gmail.com <an...@gmail.com> #55Mar 2, 2023 04:18AM

ar...@gmail.com <ar...@gmail.com> #56Mar 17, 2023 10:18PM

ro...@gmail.com <ro...@gmail.com> #57Apr 2, 2023 04:29PM

Did this literally just get released?

I'm seeing documentation for a native JSON_EXTRACT_ARRAY function now here:

https://cloud.google.com/bigquery/docs/reference/standard-sql/json_functions#json_extract_array

7h...@gmail.com <7h...@gmail.com> #58Apr 12, 2023 11:05AM

https://cloud.google.com/bigquery/docs/release-notes#May_01_2020

se...@queries.co.jp <se...@queries.co.jp> #59Apr 21, 2023 12:19PM

Thanks for all the upvotes.

ja...@confluent.io <ja...@confluent.io> #60Jun 9, 2023 07:25PM

jp...@google.com <jp...@google.com> Jul 13, 2023 01:42AM

Assigned to wo...@google.com.

vi...@gmail.com <vi...@gmail.com> #61Jul 14, 2023 01:51AM

dh...@gmail.com <dh...@gmail.com> #62Jul 21, 2023 01:28AM

sm...@gmail.com <sm...@gmail.com> #63Aug 12, 2023 08:55AM

ks...@stanford.edu <ks...@stanford.edu> #64Aug 14, 2023 02:47AM

+1 Please add this feature

Message last modified on Aug 14, 2023 02:47AM

di...@gmail.com <di...@gmail.com> #65Aug 27, 2023 12:49AM

hu...@gmail.com <hu...@gmail.com> #66Aug 29, 2023 04:59AM

+1 Please add

va...@gmail.com <va...@gmail.com> #67Sep 13, 2023 06:18PM

be...@gmail.com <be...@gmail.com> #68Sep 18, 2023 05:14PM

ou...@gmail.com <ou...@gmail.com> #69Sep 21, 2023 09:36AM

jo...@gmail.com <jo...@gmail.com> #70Sep 25, 2023 08:26PM

va...@breaktrue.ai <va...@breaktrue.ai> #71Nov 7, 2023 10:16PM

pr...@gmail.com <pr...@gmail.com> #72Nov 24, 2023 01:20AM

+10000000000000

be...@gmail.com <be...@gmail.com> #73Nov 24, 2023 01:19PM

+1000000000000000000000 please fix this after 8 years!!

su...@gmail.com <su...@gmail.com> #74Nov 29, 2023 06:31AM

Google, come on, do this!

rh...@gmail.com <rh...@gmail.com> #75Jan 2, 2024 06:49PM

Yes - about time!

ni...@elunic.com <ni...@elunic.com> #76Jan 19, 2024 12:09PM

PUSH

ba...@gmail.com <ba...@gmail.com> #77Feb 24, 2024 04:37PM

ma...@gmail.com <ma...@gmail.com> #78Apr 4, 2024 05:17AM

+googol

sa...@gmail.com <sa...@gmail.com> #79Apr 13, 2024 01:49AM

[Deleted User] <[Deleted User]> #80Apr 15, 2024 11:19PM

aa...@aaf.lu <aa...@aaf.lu> #81Apr 18, 2024 01:49PM

se...@vezign.com <se...@vezign.com> #82Jun 16, 2024 09:15PM

Oh no... this has been on here since 2016... not much hope for it being resolved eh?

vi...@gmail.com <vi...@gmail.com> #83Jul 29, 2024 09:24PM

ca...@gmail.com <ca...@gmail.com> #84Sep 6, 2024 05:17PM

da...@dbuxton.com <da...@dbuxton.com> #85Jan 22, 2025 02:56PM

mj...@mozilla.com <mj...@mozilla.com> #86Jan 28, 2025 07:48PM

de...@burai.online <de...@burai.online> #87Feb 18, 2025 03:04PM

Comment has been deleted.

Message last modified on Feb 18, 2025 03:06PM

ss...@umbrellab.com <ss...@umbrellab.com> #88Apr 14, 2025 05:06AM

+1
2025 and it's still not possible?

ni...@elunic.com <ni...@elunic.com> #89Apr 14, 2025 06:35AM

How can such a vital bug not be fixed after almost 10 years....

ti...@telus.com <ti...@telus.com> #90Apr 17, 2025 02:54PM

+1 I was hyped after google next this year to start building some new integrations with all the new LLM capabilities that have come out in the last couple of years, but have been stopped dead in my tracks by lack of a basic feature that still hasn't been implemented in 10 years. My disappointment is immeasurable and my day is ruined.

Message last modified on Apr 17, 2025 02:55PM

Issue 36763384

Description

Issue summary

Comments

to...@ziggipapers.com <to...@ziggipapers.com> #2Dec 16, 2016 09:23AM

te...@gmail.com <te...@gmail.com> #3Dec 16, 2016 11:08AM

[Deleted User] <[Deleted User]> #4Aug 15, 2017 04:26PM

tp...@gmail.com <tp...@gmail.com> #5Aug 15, 2017 06:18PM

[Deleted User] <[Deleted User]> #6Aug 16, 2017 08:54AM

su...@gmail.com <su...@gmail.com> #7Nov 16, 2017 06:05AM

es...@gmail.com <es...@gmail.com> #8Feb 27, 2018 02:22PM

gi...@gmail.com <gi...@gmail.com> #9Apr 28, 2018 06:47PM

da...@newfangled.com <da...@newfangled.com> #10May 23, 2018 06:44PM

[Deleted User] <[Deleted User]> #11Jul 19, 2018 05:01PM

[Deleted User] <[Deleted User]> #12Feb 15, 2019 03:41PM

ro...@gmail.com <ro...@gmail.com> #13Feb 16, 2019 04:20AM

vb...@gmail.com <vb...@gmail.com> #14Feb 22, 2019 03:25PM

[Deleted User] <[Deleted User]> #15Aug 7, 2019 11:54AM

du...@gmail.com <du...@gmail.com> #16Aug 31, 2019 05:50AM

ap...@gmail.com <ap...@gmail.com> #17Sep 12, 2019 07:00PM

[Deleted User] <[Deleted User]> #18Sep 26, 2019 06:40PM

se...@gmail.com <se...@gmail.com> #19Oct 15, 2019 02:19PM

al...@curvestone.io <al...@curvestone.io> #20May 19, 2020 10:30PM

sw...@terpmail.umd.edu <sw...@terpmail.umd.edu> #21May 22, 2020 06:57PM

sw...@terpmail.umd.edu <sw...@terpmail.umd.edu> #22May 22, 2020 06:57PM

[Deleted User] <[Deleted User]> #23May 28, 2020 09:36AM

[Deleted User] <[Deleted User]> #24Jun 1, 2020 05:54PM

gr...@gmail.com <gr...@gmail.com> #25Jul 19, 2020 12:13PM

al...@gmail.com <al...@gmail.com> #26Jul 29, 2020 05:23PM

[Deleted User] <[Deleted User]> #27Aug 10, 2020 04:07AM

as...@gmail.com <as...@gmail.com> #28Sep 2, 2020 04:47PM

da...@gmail.com <da...@gmail.com> #29Oct 9, 2020 04:42PM

[Deleted User] <[Deleted User]> #30Oct 25, 2020 10:59AM

ca...@gmail.com <ca...@gmail.com> #31Oct 25, 2020 11:17AM

bj...@measureone.com <bj...@measureone.com> #32Oct 25, 2020 08:21PM

ru...@gmail.com <ru...@gmail.com> #33Nov 10, 2020 08:52PM

ne...@acerb.biz <ne...@acerb.biz> #34Jan 9, 2021 12:26PM

ct...@gmail.com <ct...@gmail.com> #35Feb 8, 2021 01:37PM

en...@slidespro.io <en...@slidespro.io> #36Apr 9, 2021 01:49PM

cd...@escp.eu <cd...@escp.eu> #37Apr 27, 2021 07:47AM

pr...@gmail.com <pr...@gmail.com> #38Jun 3, 2021 08:29AM

lu...@britanniaeducationtrust.com <lu...@britanniaeducationtrust.com> #39Jun 28, 2021 09:12AM

ig...@carely.group <ig...@carely.group> #40Jul 27, 2021 09:51AM

by...@gmail.com <by...@gmail.com> #41Jul 27, 2021 09:59AM

dy...@gmail.com <dy...@gmail.com> #42Jul 30, 2021 09:56AM

th...@gmail.com <th...@gmail.com> #43Aug 6, 2021 12:55PM

or...@gmail.com <or...@gmail.com> #44Aug 11, 2021 07:44AM

en...@gmail.com <en...@gmail.com> #45Aug 11, 2021 02:28PM

jd...@gmail.com <jd...@gmail.com> #46Aug 27, 2021 09:14AM

km...@gmail.com <km...@gmail.com> #47Sep 4, 2021 05:10PM

rm...@gmail.com <rm...@gmail.com> #48Oct 28, 2021 02:52PM

al...@gmail.com <al...@gmail.com> #49Nov 29, 2021 10:06PM

pr...@gmail.com <pr...@gmail.com> #50Dec 28, 2021 10:05PM

mg...@gmail.com <mg...@gmail.com> #51Feb 15, 2022 12:19AM

mp...@gmail.com <mp...@gmail.com> #52May 6, 2022 09:51PM

ju...@gmail.com <ju...@gmail.com> #53Sep 1, 2022 11:21PM

pa...@gmail.com <pa...@gmail.com> #54Jan 3, 2023 09:26PM

an...@gmail.com <an...@gmail.com> #55Mar 2, 2023 04:18AM

ar...@gmail.com <ar...@gmail.com> #56Mar 17, 2023 10:18PM

ro...@gmail.com <ro...@gmail.com> #57Apr 2, 2023 04:29PM

7h...@gmail.com <7h...@gmail.com> #58Apr 12, 2023 11:05AM

se...@queries.co.jp <se...@queries.co.jp> #59Apr 21, 2023 12:19PM

ja...@confluent.io <ja...@confluent.io> #60Jun 9, 2023 07:25PM

jp...@google.com <jp...@google.com> Jul 13, 2023 01:42AM

vi...@gmail.com <vi...@gmail.com> #61Jul 14, 2023 01:51AM

dh...@gmail.com <dh...@gmail.com> #62Jul 21, 2023 01:28AM

sm...@gmail.com <sm...@gmail.com> #63Aug 12, 2023 08:55AM

ks...@stanford.edu <ks...@stanford.edu> #64Aug 14, 2023 02:47AM

di...@gmail.com <di...@gmail.com> #65Aug 27, 2023 12:49AM

hu...@gmail.com <hu...@gmail.com> #66Aug 29, 2023 04:59AM

va...@gmail.com <va...@gmail.com> #67Sep 13, 2023 06:18PM

be...@gmail.com <be...@gmail.com> #68Sep 18, 2023 05:14PM

ou...@gmail.com <ou...@gmail.com> #69Sep 21, 2023 09:36AM

jo...@gmail.com <jo...@gmail.com> #70Sep 25, 2023 08:26PM

va...@breaktrue.ai <va...@breaktrue.ai> #71Nov 7, 2023 10:16PM

pr...@gmail.com <pr...@gmail.com> #72Nov 24, 2023 01:20AM

be...@gmail.com <be...@gmail.com> #73Nov 24, 2023 01:19PM

su...@gmail.com <su...@gmail.com> #74Nov 29, 2023 06:31AM

rh...@gmail.com <rh...@gmail.com> #75Jan 2, 2024 06:49PM

ni...@elunic.com <ni...@elunic.com> #76Jan 19, 2024 12:09PM