Status Update
Comments
vi...@google.com <vi...@google.com>
pr...@azurgames.com <pr...@azurgames.com> #2
ma...@gmail.com <ma...@gmail.com> #3
+1
em...@gmail.com <em...@gmail.com> #4
The ability to set a default value to a column has been recently added to BigQuery. I believe this solves the problem. See the doc here :
pb...@deloitte.com <pb...@deloitte.com> #5
ma...@amprobotics.com <ma...@amprobotics.com> #6
When loading parquet files with an explicit schema (rather than the schema autodetection feature), column default values do not get used.
vi...@keepler.io <vi...@keepler.io> #7
he...@google.com <he...@google.com> #8
Hi,
If you want to auto populate CURRENT_TIMESTAMP() in bq load, please reference
Default values are applied when the loaded data has fewer columns than the destination table. NULL values in the loaded data are not converted to default values.
Let me know if you have any questions!
Description
What you would like to accomplish:
When inserting rows through streaming (with dataflow and the 'STREAMING_INSERTS' option), it is possible to have the insertion timestamp in a column but putting "AUTO" as a value of a column of type TIMESTAMP (see https://issuetracker.google.com/issues/72080883 ). However, this does not work with
bq load
, which means that one cannot switch to 'FILE_LOADS' insertion option in dataflow as it throws the error :How this might work:
Make this feature available to data loaded though
bq load
.If applicable, reasons why alternative solutions are not sufficient:
I do not know of an alternative.
Other information (workarounds you have tried, documentation consulted, etc):
N/A