Fixed
Status Update
Comments
ac...@gmail.com <ac...@gmail.com> #2
Can't COUNT_DISTINCT() over a specific date range!!! This appears to be a common problem! Please fix ASAP!
[Deleted User] <[Deleted User]> #3
Instead of counting unique values in a specified column COUNT_DISCTINCT counts rows in a database (on a view, not a table, to be more specific)... It works as expected with spreadsheets.
ph...@verizon.com <ph...@verizon.com> #4
Having the same issue with count distinct when selecting a specific filter
[Deleted User] <[Deleted User]> #5
From my side, I'm using Postgres Database (All good in Postgres doing the same grouping-unique, so no issues in DB or in Data).
If no values are selected in Filter Control widget the COUNT_DISTINCT is working as expected as it's getting Unique values for the entire dataset, but when I put a value in Filter Control, COUNT_DISTINCT is generating a wrong value.
If no values are selected in Filter Control widget the COUNT_DISTINCT is working as expected as it's getting Unique values for the entire dataset, but when I put a value in Filter Control, COUNT_DISTINCT is generating a wrong value.
th...@eyeful.us <th...@eyeful.us> #6
I've run into the same issue. I'm connected to Postgres. When I apply a date filter to any metrics that use COUNT DISTINCT, the results wind up being COUNTs.
[Deleted User] <[Deleted User]> #7
I can't believe that Google has left this bug unresolved for months, this is a joke. Any time we apply date filtering, and try to utilise count distinct aggregation, when we apply further filtering through other dimensions, the metrics become extremely overstated.
Please sort this out Google, we will have to move to a different tool as this is fundamental functionality...
Please sort this out Google, we will have to move to a different tool as this is fundamental functionality...
ju...@gmail.com <ju...@gmail.com> #8
My read on this is that Google doesn't care about Postgres/Redshift support for GDS, and that anyone not on BigQuery should choose a different BI tool. It's really unfortunate because GDS seemed to have a lot of potential.
ph...@verizon.com <ph...@verizon.com> #9
We're using Big Query and have similar issue, don't know how long we have
to wait to get this resolved.
On Tue, Feb 4, 2020 at 1:02 PM <buganizer-system@google.com> wrote:
to wait to get this resolved.
On Tue, Feb 4, 2020 at 1:02 PM <buganizer-system@google.com> wrote:
[Deleted User] <[Deleted User]> #10
Yeah, I don't think this is connector specific. It more so seems like a bug in how Data Studio handles count distinct aggregation when applying a date range and in conjunction with filtering with other dimensions. I created another thread for this here: https://issuetracker.google.com/issues/148813902
Feel free to jump in on it and comment as well, this bug is affecting us massively here.
Feel free to jump in on it and comment as well, this bug is affecting us massively here.
[Deleted User] <[Deleted User]> #11
On our side it's probably working with a view vs actual table. Funny enough same data with the same filters displays in expected number of rows (so it can distinguish between unique entries properly (same string)).
[Deleted User] <[Deleted User]> #12
Interesting. I am using Postgres and I gave your idea a shot, but a view and a table result in the same bug. However, I also tested to load the exact same data source onto Google Sheets and then load it onto Data Studio and the problem was resolved. So I was wrong^^ it IS connector specific. I guess the guy above^ is probably right. I'm pretty sure Google is aware of this but doubt they will ever fix it.
Best case if any one stumbles upon this 1) Either use a proxy like GSheets to load your data on there first if you are experiencing this bug with MySQL or Postgres or whatever other engine, or 2) Switch to a different BI tool!
Best case if any one stumbles upon this 1) Either use a proxy like GSheets to load your data on there first if you are experiencing this bug with MySQL or Postgres or whatever other engine, or 2) Switch to a different BI tool!
mi...@netfoundry.io <mi...@netfoundry.io> #13
Using Redshift with the Postgresql connector. When I render the individual values as a table, everything is correct, but when I render a COUNT DISTINCT in a score card I get incorrect numbers similar to what has been described above. We've really liked Data Studio so far as a BI Tool but it's a serious bullet if it can't do such a basic counting function correctly. The worst thing to present in a business report is data that is obviously wrong.
ju...@gmail.com <ju...@gmail.com> #14
Same problem here. Count_distinct(column) fails when multiple values of a filter is applied.
Very Critical for us.
Very Critical for us.
da...@gmail.com <da...@gmail.com> #15
It's incredible given how debilitating this is, how long this bug has existed 😢
[Deleted User] <[Deleted User]> #16
Same problem on Postgres, please get this fixed, it GROUP BY on the date range dimension field so it's unable to COUNT DISTINCT properly
an...@gmail.com <an...@gmail.com> #17
I've ran into this issue before using a Postgres connector but I've ran into it again now using the file upload (.csv) connector too. Interestingly, the Data Studio Explorer shows the correct values (using same config)!
[Deleted User] <[Deleted User]> #18
I'm having the same issue, reading from a CSV and unable to count_distinct() a text field. Somehow this is fixed if I blend the data
ja...@roeye.com <ja...@roeye.com> #19
Same here. Postgres connector. count_distinct() returns the value as if it were grouped by day and then summed, rather than a true distinct count for the duration of the date range selected
gu...@gmail.com <gu...@gmail.com> #20
Same issue here...csv file, when a data range is selected, them returns incorrect value
eg...@gmail.com <eg...@gmail.com> #21
same here. Mysql connector. Data range filter + count distinct >> wrong value
[Deleted User] <[Deleted User]> #22
Same issue. We solved by combining the data with itself, but it's a very dirty solution. Wait for a real fix !!
I also try using directly a calculated field with COUNT_DISTINCT() but it gave the same wrong result.
I also try using directly a calculated field with COUNT_DISTINCT() but it gave the same wrong result.
mo...@swisslos.ch <mo...@swisslos.ch> #23
This has nothing to do with connetors or anything.
The problem is probably that COUNT_DISTINCT get's executed before the filter and afterwards the filtered values get aggregated together (mostly summed). It's a real Deal breaker ...
The problem is probably that COUNT_DISTINCT get's executed before the filter and afterwards the filtered values get aggregated together (mostly summed). It's a real Deal breaker ...
et...@dsconnect.us <et...@dsconnect.us> #24
Same issue here, from postgres connector. I would really appreciate if some time was allocated to fix this issue.
co...@contatointernet.com.br <co...@contatointernet.com.br> #25
Same here. Postgres connector. count_distinct() returns the value as if it were grouped by day and then summed, rather than a true distinct count for the duration of the date range selected
fo...@gmail.com <fo...@gmail.com> #26
It fails when dimension is a date and metric is CTD. I use mysql connector and experiencing same issue.
da...@e-me.co <da...@e-me.co> #27
+1 It fails when dimension is a date and metric is CTD. I use mysql connector and experiencing same issue.
ad...@nr1motor.nl <ad...@nr1motor.nl> #28
+1 same problem here wtih PostgreSQL connector.
ja...@wavo.me <ja...@wavo.me> #29
+ 1 same problem here with CTD and PostgreSQL
pa...@myhotelshop.de <pa...@myhotelshop.de> #30
We are also facing this issue, especially when doing a roll-up to year-week. Funny thing is, in some cases it works and in other cases not.
I even get different results on two different data sources while both of them use the same custom query.
I can not put my finger on where the difference is.
EDIT 1:
I was able to get arround the issue by adding
CAST(DATE_FORMAT(date, '%Y%u') AS CHAR) AS year_week
to the query. Using this as a dimension works fine.
EDIT 2:
After having added filter controls to the page, the described workarround ceased to work on that page. On another page that uses the same data source, it still works however.
The next strange thing is, that with my described workarround I still get correct numbers in a pivot table visualisation but not with stacked column chart visualisation. It becomes even more weird as I already successfully applied my workarround to another stacked column visualisation.
I am baffled and this issue is indeed a huge annoyance.
I even get different results on two different data sources while both of them use the same custom query.
I can not put my finger on where the difference is.
EDIT 1:
I was able to get arround the issue by adding
CAST(DATE_FORMAT(date, '%Y%u') AS CHAR) AS year_week
to the query. Using this as a dimension works fine.
EDIT 2:
After having added filter controls to the page, the described workarround ceased to work on that page. On another page that uses the same data source, it still works however.
The next strange thing is, that with my described workarround I still get correct numbers in a pivot table visualisation but not with stacked column chart visualisation. It becomes even more weird as I already successfully applied my workarround to another stacked column visualisation.
I am baffled and this issue is indeed a huge annoyance.
x-...@qstream.com <x-...@qstream.com> #31
It seems to be the number of filters applied which causes a problem.
[Deleted User] <[Deleted User]> #32
I seem to be having this issue with a csv upload. Data studio sums up the field as opposed to counting unique accounts. This is only an issue when multiple drop downs are selected from the filter. It works as it should at total level or with one filter selected.
ji...@gmail.com <ji...@gmail.com> #33
Same issue, distinct count is not working on applying the date range filter. However, it populates correct result without any filter.
[Deleted User] <[Deleted User]> #34
Same here.
[Deleted User] <[Deleted User]> #35
+1
[Deleted User] <[Deleted User]> #36
+1
ma...@gmail.com <ma...@gmail.com> #37
+.1 I have noticed count distinct displaying incorrect numbers after a filter has been selected, and decided it's best to stop using it until a fix has been implemented properly.
li...@splacer.co <li...@splacer.co> #38
Hi.
I was able to easily reproduce the issue - Google - It's time to fix this - Its becoming a real show-stopper for your platform.
How to reproduce:
1. Create a new Postgres data source.
2. Use the following custom query:
select '1-jan-2020'::date as sold_date, 'text1' as name
union all
select '2-jan-2020'::date as sold_date, 'text1' as name
3. Build the data source, and create a report. put the sold_date as the date dimension, and count and count distinct the names.
You can see the issue happening in the screenshot. Count = 2 while Count Distinct should be 1 - but shows 2.
Why this is happening:
-----------------------------------
The reason for this bug is the underline query, sent by the Google Data Connector to Postgres:
SELECT name, COUNT(DISTINCT name) AS qt_53gfjojccc, COUNT(1.000000000000000) AS qt_mct0dojccc, sold_date
FROM (select '1-jan-2020'::date as sold_date, 'text1' as name
union all
select '2-jan-2020'::date as sold_date, 'text1' as name) AS t0 GROUP BY name, sold_date ORDER BY qt_mct0dojccc DESC
As you can see - the grouping includes the sold_date field - which will return separate distinct values for each day. Summarising them is a wrong distinct calculation.
BTW Google - why aren't you passing the date dimension as a WHERE filter - Your queries query all the data on the table - without the need - causing a bad performance and long running queries.
This is related, because the right way to do this distinct calculation is to use the date in the WHERE clause and not group by date.
Google - please fix this.
Lior Ash
I was able to easily reproduce the issue - Google - It's time to fix this - Its becoming a real show-stopper for your platform.
How to reproduce:
1. Create a new Postgres data source.
2. Use the following custom query:
select '1-jan-2020'::date as sold_date, 'text1' as name
union all
select '2-jan-2020'::date as sold_date, 'text1' as name
3. Build the data source, and create a report. put the sold_date as the date dimension, and count and count distinct the names.
You can see the issue happening in the screenshot. Count = 2 while Count Distinct should be 1 - but shows 2.
Why this is happening:
-----------------------------------
The reason for this bug is the underline query, sent by the Google Data Connector to Postgres:
SELECT name, COUNT(DISTINCT name) AS qt_53gfjojccc, COUNT(1.000000000000000) AS qt_mct0dojccc, sold_date
FROM (select '1-jan-2020'::date as sold_date, 'text1' as name
union all
select '2-jan-2020'::date as sold_date, 'text1' as name) AS t0 GROUP BY name, sold_date ORDER BY qt_mct0dojccc DESC
As you can see - the grouping includes the sold_date field - which will return separate distinct values for each day. Summarising them is a wrong distinct calculation.
BTW Google - why aren't you passing the date dimension as a WHERE filter - Your queries query all the data on the table - without the need - causing a bad performance and long running queries.
This is related, because the right way to do this distinct calculation is to use the date in the WHERE clause and not group by date.
Google - please fix this.
Lior Ash
[Deleted User] <[Deleted User]> #39
I spend so much time for nothing because of this bug, unbeliveable!
br...@riskgenius.com <br...@riskgenius.com> #40
What is the status on this?
4v...@gmail.com <4v...@gmail.com> #41
I have the same issue.
se...@antidote.me <se...@antidote.me> #42
+1
We're also impacted by this issue - using the Postgres connector, count distinct (on non-primary key fields) is incorrect on scorecards, tables, ... when applying a date filter.
I don't have anything new to add - post #38 above has perfectly explained the issue: distinct counts are generated per day and then summed, which is incorrect as the distinct counts need to be performed across the entire subset.
We're also impacted by this issue - using the Postgres connector, count distinct (on non-primary key fields) is incorrect on scorecards, tables, ... when applying a date filter.
I don't have anything new to add - post #38 above has perfectly explained the issue: distinct counts are generated per day and then summed, which is incorrect as the distinct counts need to be performed across the entire subset.
x-...@qstream.com <x-...@qstream.com> #43
Belated Happy Birthday little bug. One year old!
da...@fashion.cloud <da...@fashion.cloud> #44
Congratz! Seems like you're going to have a long life Little bug:)
[Deleted User] <[Deleted User]> #45
any news in this fix!?
[Deleted User] <[Deleted User]> #46
Switched to Grafana, totally worth it.
j....@gmail.com <j....@gmail.com> #47
It's been a year since this was reported originally... A fix would be great!
x-...@qstream.com <x-...@qstream.com> #48
Or even an acknowledgement that somebody is listening.
*Chris McCauley*
Chief Technology Officer
e: chris.mccauley@qstream.com
a: WeWork, Level 3, 5 Harcourt Road, St Kevin's, Dublin 2, Ireland
*TRANSFORM LEARNING* | Engage. Reinforce. Analyze.
On Thu, Nov 19, 2020 at 6:01 AM <buganizer-system@google.com> wrote:
*Chris McCauley*
Chief Technology Officer
e: chris.mccauley@qstream.com
a: WeWork, Level 3, 5 Harcourt Road, St Kevin's, Dublin 2, Ireland
*TRANSFORM LEARNING* | Engage. Reinforce. Analyze.
On Thu, Nov 19, 2020 at 6:01 AM <buganizer-system@google.com> wrote:
ju...@komed-health.com <ju...@komed-health.com> #49
Same issue here. Any progress?
fe...@comprandoengrupo.net <fe...@comprandoengrupo.net> #50
Please, would be great to have an answer if this is going to be fixed or not, so then we can finally keep or move to other products... Thank you.
[Deleted User] <[Deleted User]> #51
--
[image: Ayenda]
*Andrés Sarrazola*
CEO | *Ayenda Hoteles*
Celular: +573015945919
Web:ayenda.com
email: andres@ayenda.co
Dirección: Calle 7D #43A 117 - Torre Almagrán - Piso 9
Medellín - Colombia
<https://www.facebook.com/ayendahoteles/ >
<https://www.instagram.com/ayendahoteles/ >
[image: Ayenda]
*Andrés Sarrazola*
CEO | *Ayenda Hoteles*
Celular: +573015945919
Web:
email: andres@ayenda.co
Dirección: Calle 7D #43A 117 - Torre Almagrán - Piso 9
Medellín - Colombia
<
<
pr...@ramp.network <pr...@ramp.network> #52
+1
bg...@gmail.com <bg...@gmail.com> #53
Was enjoying GDS until I spent an evening trying to track down why our dashboard was completely wrong in counting users.
Sounds like best available solution is "Don't use GDS"?
Sounds like best available solution is "Don't use GDS"?
[Deleted User] <[Deleted User]> #54
I've discovered this incredible bug yesterday. Hope there will be a solution to fix this... (there is no problem when the source is a google sheet, but with PostgreSQL or CSV files)
me...@paradigmadigital.com <me...@paradigmadigital.com> #55
+1
More than one year later this bug still persists...
More than one year later this bug still persists...
ch...@avondalepark.com <ch...@avondalepark.com> #56
Someone should change the Status from 'New' to 'Getting kind of old'
[Deleted User] <[Deleted User]> #57
Hi everyone,
I've been also suffering this bug for more than a year and today I realized there's a dirty workaround that works for me — I'm not sure if it would work for all combinations, filters, etc.
- Create your COUNT_DISTINCT scorecard as usual, and apply filters as needed
- Convert it into a table
- Add whatever dimension you're filtering as dimensions in the table
- Use styles to hide everything except the metric scorecard you need
I've attached a screenshot so it's easier to explain.
I hope it helps! And I hope the team can prioritize this fix.
PS: I've gotten now too used to display tables as scorecards due to limitations or bugs, let me know if my feedback is valuable here :-)
mu...@ambar.tech <mu...@ambar.tech> #58
+1
rm...@google.com <rm...@google.com> #59
Hi everyone,
Thanks for upvoting this issue and sharing all the helpful details. We’ve investigated the issue and are actively working on a solution. As a few of you have noted, there's an issue that's connector-specific (it applies to the Cloud SQL for MySQL, MySQL, and PostgreSQL connectors). When charts/visualizations are filtered, or grouped by calculated dimensions, and perform specific aggregations such as Count Distinct, the generated query can yield incorrect results. While we wish this were a quick fix, it's not. We're actively working on an overhaul of the query generator to address this issue. Timing may change as work progresses, but we estimate we may be able to roll out an improvement this summer. We'll post any updates here as we get closer to rollout or the timeframe changes.
Thanks for upvoting this issue and sharing all the helpful details. We’ve investigated the issue and are actively working on a solution. As a few of you have noted, there's an issue that's connector-specific (it applies to the Cloud SQL for MySQL, MySQL, and PostgreSQL connectors). When charts/visualizations are filtered, or grouped by calculated dimensions, and perform specific aggregations such as Count Distinct, the generated query can yield incorrect results. While we wish this were a quick fix, it's not. We're actively working on an overhaul of the query generator to address this issue. Timing may change as work progresses, but we estimate we may be able to roll out an improvement this summer. We'll post any updates here as we get closer to rollout or the timeframe changes.
da...@fashion.cloud <da...@fashion.cloud> #60
That's great news! Good luck with that, we'll be cheering for you.
cr...@ayenda.co <cr...@ayenda.co> #61
:+1
[Deleted User] <[Deleted User]> #62
I have the same issue
iv...@gmail.com <iv...@gmail.com> #63
I have the same issue. In my case count distinct shows the sum of count distinct but over the filtered periods (months in my case).
[Deleted User] <[Deleted User]> #64
Thanks contacted
[Deleted User] <[Deleted User]> #65
+1
Need this issue to be resolved ASAP.
Need this issue to be resolved ASAP.
ol...@121gameon.com <ol...@121gameon.com> #66
+1.
The same for me. PG queries. CTD works as an ordinary "count" and it doesn't work for all queries where I use datetime.
I have to use DATE(created_at)/to_char(created_at,'YYYYMMDD') to make it work
The same for me. PG queries. CTD works as an ordinary "count" and it doesn't work for all queries where I use datetime.
I have to use DATE(created_at)/to_char(created_at,'YYYYMMDD') to make it work
ch...@avondalepark.com <ch...@avondalepark.com> #67
We've given up waiting and moved to Amazon.
kk...@cloudflare.com <kk...@cloudflare.com> #68
+1
rm...@google.com <rm...@google.com> #69
Hi everyone,
We’ve now rolled out an improvement to these connectors (PostgreSQL as well as MySQL and Cloud SQL for MySQL) to address this issue. The improved connector generates SQL queries that more consistently yield accurate Count Distinct results. It does this by performing all filtering and grouping in the SQL query when possible, or by querying unaggregated data when necessary. (Some calculations don’t have a direct translation in a particular SQL dialect, such as PostgreSQL’s.) We’re eager to hear your feedback about whether this improvement resolves your issues.
We’ve now rolled out an improvement to these connectors (PostgreSQL as well as MySQL and Cloud SQL for MySQL) to address this issue. The improved connector generates SQL queries that more consistently yield accurate Count Distinct results. It does this by performing all filtering and grouping in the SQL query when possible, or by querying unaggregated data when necessary. (Some calculations don’t have a direct translation in a particular SQL dialect, such as PostgreSQL’s.) We’re eager to hear your feedback about whether this improvement resolves your issues.
ka...@logicalposition.com <ka...@logicalposition.com> #70
I am having this issue again
[Deleted User] <[Deleted User]> #71
I too am still having this same issue
[Deleted User] <[Deleted User]> #72
I am having this issue using BigQuery as a data source
[Deleted User] <[Deleted User]> #73
this is not fixed
[Deleted User] <[Deleted User]> #74
For me GDS was not showing a correct count_distinct when displaying data in a chart. Since my data source are quite old I added them once more and created new charts and now it works.
jt...@amg.team <jt...@amg.team> #75
I am currently using the Google Cloud Platform using File Upload data, and the Count Distinct function doesn't seem to work with or without date ranges. It was interesting to see that because I had 18 days of data, it seemed to multiply the numbers by the number of days in the month...As I adjust the date to 1 day, 2 days, the count distinct seemed to multiply each time to those date ranges. Is there any further progress on this bug? We are purely attempting to count the unique values for a field for a specific date range.
[Deleted User] <[Deleted User]> #76
Change your methodology to connect it to a Google sheet
Dump your raw data in Google sheet every time you need to update
Fixed my count issue. Had chronic issues with file upload
On Sun, Oct 24, 2021 at 2:31 PM <buganizer-system@google.com> wrote:
Dump your raw data in Google sheet every time you need to update
Fixed my count issue. Had chronic issues with file upload
On Sun, Oct 24, 2021 at 2:31 PM <buganizer-system@google.com> wrote:
--
James Alexander HeinenAssistant Product Manager - Global Portfolio/Master
Data
The Scotts Miracle-Gro CompanyC:937-209-8879scottsmiraclegro.com
James Alexander HeinenAssistant Product Manager - Global Portfolio/Master
Data
The Scotts Miracle-Gro CompanyC:
te...@factor3.co.uk <te...@factor3.co.uk> #77
still not working with CSV import, just bumping this because we are April 2023
Going to have to rebuild using Google sheets or an SQl Connector
Going to have to rebuild using Google sheets or an SQl Connector
vp...@ptc.com <vp...@ptc.com> #78
Still not working with CSV import when using date ranges. At report level without date filter seems to count correct. May 2023
[Deleted User] <[Deleted User]> #79
This issue is still not fixed
00...@gmail.com <00...@gmail.com> #80
It appears the team just never cared to look into a fix for the CSV connector. Still an issue years later.
ni...@mercadolivre.com <ni...@mercadolivre.com> #81
I'm having the same issue, but I'm connected to a bigQuery table and the count distinct seems to be broken only with a specific id field, and it's not like "kinda broken" its counting 4 where its supposed to be 500+
um...@gmail.com <um...@gmail.com> #82
having the same issue i made the manual and the difference between distinct count its now like 1000 its to much to considere this a slight percent of counting difference or something, why last year was working correctly and this year is having this issue that even if F = F on excel its different according to data, i had to make some arrange to make it locket studio do the right and its really bothersome
ma...@clearlyfiltered.com <ma...@clearlyfiltered.com> #83
I am also experiencing this issue-- when the same data is placed in a table the table creates the correct number of unique rows, but, when using the scorecard the "count" and "count_distinct" options produce a total record count that is identical.
br...@bmcclenahan.com <br...@bmcclenahan.com> #84
A problem has started occurring in the past week in which COUNT_DISTINCT returns incorrect values when it is applied to an expression that modifies a field - the unique values of the underlying field are returned and the expression is ignored eg COUNT_DISTINCT(DATETIME_TRUNC(datevalue, WEEK)) returns the number of values of datevalue not the number of weeks (which would usually be less)
ka...@colab.re <ka...@colab.re> #85
I'm trying to use count distinct with date range and the results are wrong. I'm using with a CSV connector. That's sad that they didn't correct yet.
[Deleted User] <[Deleted User]> #86
Found a solution for this for data source via csv connector (or actually GCP cloud storage to csv files)
We have a column named "Delivery Date" in a csv. Values are on the following format "YYYY-MM-DD", e.g "2023-12-14".
By creating a new calculated field (we just gave it the same name, and hid the other one) it it started to work for us.
PARSE_DATE(“%Y-%m-%d”, CAST(Delivery Date as TEXT))
Also, as mentioned by many others here, it seems to be a problem with connecting to csv files. We tried importing the full same csv (62,000 rows) to a Google Sheet, and then using it as a data source. Count distinct works fine with that data source.
See attached screenshot.
We have a column named "Delivery Date" in a csv. Values are on the following format "YYYY-MM-DD", e.g "2023-12-14".
By creating a new calculated field (we just gave it the same name, and hid the other one) it it started to work for us.
PARSE_DATE(“%Y-%m-%d”, CAST(Delivery Date as TEXT))
Also, as mentioned by many others here, it seems to be a problem with connecting to csv files. We tried importing the full same csv (62,000 rows) to a Google Sheet, and then using it as a data source. Count distinct works fine with that data source.
See attached screenshot.
Description