Support the creation and refreshing of Materialized Views [62244996]

Verified

Feature Request

Status Update

No update yet.

Description

r....@gmail.com

created issue #1

Jun 1, 2017 07:17AM

We are building a data pipeline pipeline using layered BigQuery views. To control for performance and especially costs (build aggregates) it makes sense materialize the intermediate results.

Of course we can do this by writing the result of a view to a new table and scheduling this refresh with Airflow (or something else), but having properly Materialized views (MV) implemented in BigQuery 'the oracle way' would really reduce this amount of work:

https://docs.oracle.com/database/121/DWHSG/refresh.htm#DWHSG8360

Suggested modes for refreshing:
1) Automatic / scheduled: add metadata to a view (cron like) where you can configure the refresh time / frequency
2) Automatic / triggered: based on a source table, used in the view where data that is added / changed
2) Manual, similar to using DBMS_MVIEW.REFRESH in Oracle

Furthermore, having the option to refersh a complete dependency tree of materialized views would also be great (something similar to DBMS_MVIEW.REFRESH_DEPENDENT in Oracle)

Having this kind of functionality will really make Bigquery an even more NoOps system. After all, people building the following kind of extra functionality is not NoOps, right?:

https://open.nytimes.com/faster-simpler-workflow-analytical-insights-ae6c7055e187

I realize this issue is related to

https://issuetracker.google.com/issues/35905577 and

https://issuetracker.google.com/issues/35905241, difference here being that these views should be created using ANSI style DDL statements.

Comments

bl...@google.com <bl...@google.com> Jun 1, 2017 07:33AM

Assigned to eu...@google.com.

eu...@google.com <eu...@google.com> #2Jun 1, 2017 10:43PM

I think it makes a perfect sense as a feature request and is indeed distinct from other issues you referenced. Thanks for filing!

eu...@google.com <eu...@google.com> #3Jun 1, 2017 10:51PM

I have CCed stakeholders and leaving the feature request assigned to myself for now.

r....@gmail.com <r....@gmail.com> #4Jun 2, 2017 01:30PM

Thanks for accepting this request, having this would be fantastic in my opinion (and my team's)!

eu...@google.com <eu...@google.com> #5Jun 2, 2017 09:00PM

RE:"Thanks for accepting this request": this is just being brought for consideration. Thanks again for filing the feature request.

[Deleted User] <[Deleted User]> #6Mar 6, 2018 08:05AM

Any progress on this?
This request is similar to

https://issuetracker.google.com/issues/72224267

Why are we so interested in this?

Our goal is to build an easy to maintain, basically 'NoETL' datawarehouse pipeline. Everywhere where we have to 'fall back' to Python programming makes the whole pipeline more complex to maintain. Having (parametrized) materialized views that could do a full or partial refresh would really be THE enabler for this.

Basically it would be something like Looker's Persistent Direived Tables, but then with the added abilities to:
-Refresh incrementally
-Fully use time partitioning and table suffixing.

eu...@google.com <eu...@google.com> May 31, 2018 01:40AM

Reassigned to ha...@google.com.

ha...@google.com <ha...@google.com> Aug 2, 2018 04:55PM

Reassigned to zh...@google.com.

zh...@google.com <zh...@google.com> #7Aug 2, 2018 05:12PM

What types of queries do you want to use in the MV?

Using aggregation views to optimize query performance and cost is a different use case than using MV as a workaround to ELT.

[Deleted User] <[Deleted User]> #8Aug 13, 2018 07:41AM

#7, for us the use cases would be both ETL and costs optimization queries (fyi, I am the original poster of these request)

IMO the main difference between the two patterns are:
1) ETL type MV's have more joins
2) Cost optimization MV's are simpler / aggregate style queries

In the end, what is important is that it supports the following refresh patterns:
1) easy: a full refresh (truncate / insert pattern). Oracle calls this a full refresh
2) more complex: incremental update, for this to work the insert/update/delete (or merge) has to be driven by a key. Oracle calls this the fast refresh option.

IMO this functionality could / should replace BQ scheduled queries, which I guess won't be a real focus anymore now that cloud composer is here...

zh...@google.com <zh...@google.com> Dec 26, 2018 09:35PM

Reassigned to vt...@google.com.

tj...@wakr.com <tj...@wakr.com> #9Feb 7, 2019 09:15AM

Is there any progress with regards to this topic? Having materialised views in BigQuery would be the trigger for us to move away from our current solution, onto the BQ platform....

vt...@google.com <vt...@google.com> #10Feb 7, 2019 04:35PM

Tjeerd - please sync up with your Google Cloud account team to share roadmap under an NDA.

[Deleted User] <[Deleted User]> #11Feb 18, 2019 10:48AM

Is materialized view in early-access stage?
We just saw it in gcloud CLI and would like to try this feature.

ma...@tix.com.au <ma...@tix.com.au> #12Mar 26, 2019 11:40PM

How do we get access to the experimental materialized view feature?

vt...@google.com <vt...@google.com> #13Mar 27, 2019 01:56PM

please sync up with your Google Cloud account team to share roadmap under an NDA.

cc...@sperdegroot.nl <cc...@sperdegroot.nl> #14Apr 18, 2019 10:14AM

Had hoped something would be announced at Google Next '19, guess we'll have to wait :-(

[Deleted User] <[Deleted User]> #15Jun 19, 2019 10:27AM

Yep, would love this.

p....@gmail.com <p....@gmail.com> #16Oct 25, 2019 12:57AM

We are looking for this feature too, and it often comes up when we evaluate bq vs snowflake. Would be great if bq can support MVs.

vt...@google.com <vt...@google.com> Oct 30, 2019 08:54PM

Reassigned to bv...@google.com.

[Deleted User] <[Deleted User]> #17Oct 30, 2019 08:58PM

with bq mk this is sort of possible (transfer jobs), something like:

bq mk \
--transfer_config \
--target_dataset='target_dataset' \
--display_name='schedule_name' \
--params='{"query":"SELECT xyz FROM table","destination_table_name_template":"target_table","write_disposition":"WRITE_APPEND"}' \
--data_source='scheduled_query' \
--schedule='every day 01:00'

mi...@gmail.com <mi...@gmail.com> #18Nov 22, 2019 05:59AM

nothing announced in Google Next 19 UK :(

mi...@gmail.com <mi...@gmail.com> #19Dec 6, 2019 01:00AM

was reading a blog about ML from google , and they dropped this info that materialized views are already in Alpha !!!!!!

ku...@xiatech.co.uk <ku...@xiatech.co.uk> #20Dec 24, 2019 10:43AM

Any update on this BQ team? If it is in alpha, let us know ASAP!

am...@gmail.com <am...@gmail.com> #21Jan 14, 2020 09:56AM

are they still working on this? the issue is created on "Jun 1, 2017 12:47PM" , But still can't see any update on Materialized view.

fr...@google.com <fr...@google.com> #22Feb 8, 2020 02:10AM

Another customer expressed interest in this capability.

Message last modified on Feb 8, 2020 02:21AM

lo...@google.com <lo...@google.com> #23Feb 12, 2020 12:54PM

Hi,

I have a customer that is very interested in joining early access for this functionality.
Can I help them by signing up somewhere?

Thanks,
Lorin

mi...@gmail.com <mi...@gmail.com> #24Feb 27, 2020 10:51PM

Ok, Good news, I asked the product manager of BigQuery in twitter, and he said, Google next 2020, which will be in a couple of weeks.

re...@pup-eeze.com <re...@pup-eeze.com> #25Mar 22, 2020 12:30AM

Ch...@walmart.com <Ch...@walmart.com> #26Apr 2, 2020 02:28PM

[Deleted User] <[Deleted User]> #27Apr 2, 2020 02:38PM

bv...@google.com <bv...@google.com> #28Apr 3, 2020 04:36PM

We will announce support for materialized views in BQ soon. Stay tuned!

ho...@google.com <ho...@google.com> #29May 7, 2020 07:20AM

Beta available:

https://cloud.google.com/bigquery/docs/materialized-views-best-practices

(emphasis on beta!)

ma...@gmail.com <ma...@gmail.com> #30Oct 30, 2020 09:52AM

Is there any scope to extend support to unnesting within the same table for MV's on the near horizon?

[Deleted User] <[Deleted User]> #31Mar 30, 2021 08:50PM

Materialized views are now GA, including support for UNNEST: https://cloud.google.com/blog/products/data-analytics/bigquery-materialized-views-now-ga

bw...@google.com <bw...@google.com> #32May 6, 2021 03:49AM

Verified by bw...@google.com.

Thanks everyone for the suggestions. Materialized Views V2 will be coming soon.

pe...@b6tp.com <pe...@b6tp.com> #33May 6, 2021 01:35PM

Thank you! Do you have reference/list of features to expect on V2?

bw...@google.com <bw...@google.com> #34May 6, 2021 04:27PM

The upcoming version of materialized views will add JOIN support to allow multi-table views as well as table projections.