Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and more. Get started

Reply
grggmrtn
Post Patron
Post Patron

Remove duplicates without breaking query folding?

Is there a way to remove duplicates without breaking query folding? I need the folding (for incremental refresh), but I have a ton of duplicates (I go from 3 0378 271 rows to 10 472 after removing duplicates)...

7 REPLIES 7
Toddmcdermid
New Member

Use a Group By transform instead - that will fold on SQL data sources.

OPTIONAL - If you don't want an aggregate column added to the rowset (so you want it to behave functionally exactly like a remove duplicates), then edit the M that gets generated to remove the aggregate. For example, turn this:

= Table.Group(Source, {"Grouping Column 1","Grouping Column 2"}, {{"Count", each 1, type Int64}})

into this:

= Table.Group(Source, {"Grouping Column 1","Grouping Column 2"}, {})

amitchandak
Super User
Super User

@grggmrtn , how are you deleting duplicate as of now, power Query- delete duplicate ?

Yeah, exactly.

Hi @grggmrtn ,

 

This can happen if the date ranges are set NOT up to as follows: [Date] >= RangeStart and [Date] < RangeEnd. Is your upper bound set to be equal to or less than or just less than?

Refer to this thread:

https://community.powerbi.com/t5/Desktop/Problem-with-incremental-refresh-on-power-bi-premium/td-p/7... 

 

Best Regards,
Liang
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Hi @V-lianl-msft thanks for the reply...

 

Your answer would make sense if I had actually set ranges, but I haven't gotten to setting the ranges yet, because query folding on my dataset breaks. The final step in my query (remove duplicates) breaks the folding.

 

So my question is, how can I remove the duplicates without breaking the folding?

@grggmrtn @amitchandak 

I could use this as well, did you ever find out how to do this? 

 

Table.Distinct() breaks the fold. How do we tell the servers to return a distinct list after all our previous steps? 

Never mind. Table.Distinct() only breaks the fold if you use it to remove duplicates from a subset of the tables columns, instead of the whole table. Using Table.Distinct() on the entire table does not break the query folding.

Helpful resources

Announcements
Fabcon_Europe_Social_Bogo

Europe’s largest Microsoft Fabric Community Conference

Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.

Power BI Carousel June 2024

Power BI Monthly Update - June 2024

Check out the June 2024 Power BI update to learn about new features.

RTI Forums Carousel3

New forum boards available in Real-Time Intelligence.

Ask questions in Eventhouse and KQL, Eventstream, and Reflex.