This time we’re going bigger than ever. Fabric, Power BI, SQL, AI and more. We're covering it all. You won't want to miss it.
Learn moreDid you hear? There's a new SQL AI Developer certification (DP-800). Start preparing now and be one of the first to get certified. Register now
It would be useful to have an M function to keep duplicates based on selected column(s). This would allow the users to see the user to recognise where the duplicates occur and to choose which row to keep in the Table (from the duplicates). At this moment to find the duplicates I am using GroupBy, Allrows, Count rows, Select the groups that has more than 2 in count. For large dataset that has over 500000 rows having few duplicates, this approach creates unwanted too many groups a Thus, this is not an efficient method I feel. A Table.NonDistinct function in M may help. Is that possible?
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
We actually have an option out of the box today. You can find it inside of the Home tab inside the "Reduce rows" group, and with the label "Keep duplicates" as shown in the screenshot below