Power BI is turning 10, and we’re marking the occasion with a special community challenge. Use your creativity to tell a story, uncover trends, or highlight something unexpected.
Get startedJoin us for an expert-led overview of the tools and concepts you'll need to become a Certified Power BI Data Analyst and pass exam PL-300. Register now.
Hi-
I'm running a dataflow on a PPU P1 capacity.
I have ~12 million rows.
I'm attempting a Table.Distinct on six columns of various types. I get incorrect distinct results (it removes at least a row that isn't a duplicate).
Per other threads I tried but this produces an out-of-memory error-
Any suggestions? I see the recommendation that buffer is used for more consistent results but i don't think that is an option in my case.
Solved! Go to Solution.
the syntax is correct, try to group the rows based on that 6 columns and see the groups of rows that remoed wrongly this might help to descover the reason of reamoving this rows.
The reason you received that suggestion about the Table.Buffer is because sometimes when query folding occurs (when the query is being run at source rather than in the Power BI engine), then it can rearrange steps in order to be more efficient. Sometimes this leads to incorrect results.
Another way to stop this is to use Table.StopFolding instead of the Table.Buffer but I think you may end up getting the same message as they do near enough the same thing.
Let me know if it happens to work for you though
The reason you received that suggestion about the Table.Buffer is because sometimes when query folding occurs (when the query is being run at source rather than in the Power BI engine), then it can rearrange steps in order to be more efficient. Sometimes this leads to incorrect results.
Another way to stop this is to use Table.StopFolding instead of the Table.Buffer but I think you may end up getting the same message as they do near enough the same thing.
Let me know if it happens to work for you though
Also, as another suggestion, could you try to concatenate those columns into one column and then do a distinct count, directly on that one column?
the syntax is correct, try to group the rows based on that 6 columns and see the groups of rows that remoed wrongly this might help to descover the reason of reamoving this rows.
This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.
Check out the June 2025 Power BI update to learn about new features.
User | Count |
---|---|
14 | |
13 | |
8 | |
8 | |
7 |
User | Count |
---|---|
17 | |
13 | |
7 | |
6 | |
6 |