Status Update
Comments
kh...@google.com <kh...@google.com> #2
Thank you for opening this feature request. I've forwarded this request to our engineering team for evaluation. Any progress updates or a resolution will be posted on this thread.
Sincerely,
Sincerely,
mo...@gmail.com <mo...@gmail.com> #3
are there any updates on this?
Thanks!
Thanks!
ho...@google.com <ho...@google.com> #4
Not an update, but a related link: BEAM support for sketch-based statistics
https://issues.apache.org/jira/browse/BEAM-2728
ro...@google.com <ro...@google.com>
ro...@google.com <ro...@google.com> #5
Update: Andrea and I are actively working on this.
ma...@gmail.com <ma...@gmail.com> #6
It would be great if you open sourced the solution. We want to use this with Apache Spark also.
[Deleted User] <[Deleted User]> #7
Ditto.
[Deleted User] <[Deleted User]> #8
Ditto.
Is there any update on this item? It was asked over two years ago, and the only update has been "we're actively working on this." Is there an ETA when this could be ready?
Is there any update on this item? It was asked over two years ago, and the only update has been "we're actively working on this." Is there an ETA when this could be ready?
ro...@google.com <ro...@google.com> #9
Thanks for asking, David! We don't have an ETA that we can share yet, but I can come back in a month and share our progress made on this project.
ro...@google.com <ro...@google.com> #10
Update: A BigQuery-compatible implementation of HyperLogLog++ has been open-sourced to https://github.com/google/zetasketch , and a design doc (https://docs.google.com/document/d/1U5aXdC9lDSOqT6FPHRulp-EutYiQ9KeHpgu-19CIfEI ) about integrating it into Apache Beam has been sent out to dev@beam.apache.org.
ro...@google.com <ro...@google.com> #11
Re #6: Thank you for your interest in ZetaSketch (https://github.com/google/zetasketch )! It would be great if the algorithm will be supported in Apache Spark. Also note that Apache Beam will support it and you can run Beam on Spark as well.
[Deleted User] <[Deleted User]> #12
Oh wow, this is fantastic -- nice job Google team!!
ro...@google.com <ro...@google.com> #13
Update: Zetasketch's HLL++ algorithm is supported in Beam now!
(HllCount:https://github.com/apache/beam/blob/master/sdks/java/extensions/zetasketch/src/main/java/org/apache/beam/sdk/extensions/zetasketch/HllCount.java )
I will follow up with the release manager to make sure this feature becomes available in Beam 2.16.
(HllCount:
I will follow up with the release manager to make sure this feature becomes available in Beam 2.16.
or...@google.com <or...@google.com> #14
Hello,
My customer (Moonactive) could not use HLL between Dataflow and BigQuery before. Does it mean it should work now?
Thanks
My customer (Moonactive) could not use HLL between Dataflow and BigQuery before. Does it mean it should work now?
Thanks
ro...@google.com <ro...@google.com> #15
Hi Oren! Yes! Once Beam 2.16.0 is out they will be able to use the Beam HllCount transforms to use HLL between Dataflow and BigQuery.
Description