Advance your Data & AI career with 50 days of live learning, dataviz contests, hands-on challenges, study groups & certifications and more!
Get registeredGet Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Learn more
It would be useful to have an M function to keep duplicates based on selected column(s). This would allow the users to see the user to recognise where the duplicates occur and to choose which row to keep in the Table (from the duplicates). At this moment to find the duplicates I am using GroupBy, Allrows, Count rows, Select the groups that has more than 2 in count. For large dataset that has over 500000 rows having few duplicates, this approach creates unwanted too many groups a Thus, this is not an efficient method I feel. A Table.NonDistinct function in M may help. Is that possible?
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
We actually have an option out of the box today. You can find it inside of the Home tab inside the "Reduce rows" group, and with the label "Keep duplicates" as shown in the screenshot below