Assigned
Status Update
Comments
pu...@google.com <pu...@google.com>
pu...@google.com <pu...@google.com> #2
Apologies, I intended to create a feature request not a bug. I will create a feature request instead.
Description
This will create a public issue which anybody can view and comment on.
Please provide as much information as possible. At least, this should include a description of your issue and steps to reproduce the problem. If possible please provide a summary of what steps or workarounds you have already tried, and any docs or articles you found (un)helpful.
Problem you have encountered:
Zeppelin Notebook Component which is configured on a DataProc Cluster when used to run BigQuery queries reports an error on the Zeppelin UI - "SQL Execution returned an error!" for BigQuery regional datasets
However, in case where the BigQuery Datasets are multi-region the query doesn't throw the error in the Zeppelin UI and displays the result appropriately
What you expected to happen:
Zeppelin UI should be able to display the results for regional datasets as well.
Steps to reproduce:
Create a Dataproc cluster '2.2.27-debian12' in 'us-central1' with Zeppelin and Component gateway. Open Zeppelin UI, go to "Interpreter" settings for BigQuery and add your project ID to the 'zeppelin.bigquery.project_id' property.
Created a SA and provide it with the BigQuery Job User and BigQuery DataViewer role and generated the Key in JSON.
Follow this doc[1] , and SSH into the DataProc Cluster master node and uploaded the JSON key and copied the location path and add it in the Zeppelin Interpreter configuration file as can be seen below :
[1]https://zeppelin.apache.org/docs/0.6.2/interpreter/bigquery.html#setup-service-account-credentials
Other information (workarounds you have tried, documentation consulted, etc):
Request cx to query multi-regional datasets from Zeppelin UI as querying regional datasets reports this error.