Solved: Re: Removing rows with unique values

cvegter · ‎05-19-2025

Hi,

I have a data set (~30 000 rows) where instead of removing the rows with duplicate values, I would like to remove the rows that have a value that's unique. There are too many rows to toggle off manually all the values that are only mentioned once. I can't seem to work out how to do this but surely there must be a way. This is because we need a list with solely the duplicates.

Edit: came here with this question because using the keep duplicate rows in the query editor also keeps the singular values.

Thanks!

p45cal · ‎05-19-2025

There'll probably be many ways but there are 2 in the attached Excel workbook below.

In the Power Query editor after loading the data:

Version 1

1. Add an index column

2. Group the rows on the fields you want to include as duplicates (not the index column!) creating All Rows and Count

3. Filter out the 1s form the Count column

4. Expand the Tables column but only the index column

5. Optional: Sort on the Index column to retain original source data row order

6. Remove the Count and Index columns.

Version 2

1. Group the rows on the fields you want to include as duplicates creating a Count column

2. Filter the Count column to keep the 1s

3. Remove the Count column (leaves you a table with only rows that occur once)

4. Merge the Source table with the table in step 3

5. Expand this new column choosing any one of the fields (the field chosen must have no nulls)

6. Filter this expanded column for only nulls

7. Remove this expanded column.

View solution in original post

v-karpurapud · ‎05-20-2025

Hi @cvegter

Could you please confirm if your query have been resolved the solution provided by @p45cal ? If they have, kindly mark the helpful response and accept it as the solution. This will assist other community members in resolving similar issues more efficiently.

Thank you

p45cal · ‎05-19-2025