Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Learn more

Reply
Anonymous
Not applicable

Trying to eliminate cross column duplicates

Hi,

 

I am building a check for my database to eliminate duplicates. Straight up duplicates are easy to remove but sometimes people make typos or something, and I need to remove those too. 

 

This is a rough example of my data:

ReferenceDateDescription
ref11-1-2020Blub
ref23-1-2020Blob
ref34-1-2020Blab
ref3-14-1-2020Blab
ref11-1-2020Blib

 

Now I fuzzy matched it on itself to create this: 

ReferenceDateDescriptionDup.ReferenceDup.DateDup.Description
ref11-1-2020Blubref11-1-2020Blib
ref23-1-2020Blobref23-1-2020Blob
ref34-1-2020Blabref3-14-1-2020Blab
ref3-14-1-2020Blabref34-1-2020Blab
ref11-1-2020Blibref11-1-2020Blub

 

As you can see the same record effectively is still double, appearing once on the left as x-y and once on the right as y-x.
Now I need to find a way to eliminate these cross column duplicates and end up with this:

 

ReferenceDateDescriptionDup.ReferenceDup.DateDup.Description
ref11-1-2020Blubref11-1-2020Blib
ref34-1-2020Blabref3-14-1-2020Blab

 

I need both sides to check whether the double record really is double or intended that way. 

 

Thank you!

 

2 REPLIES 2
Greg_Deckler
Community Champion
Community Champion

Power Query or DAX?

 

In DAX you can remove duplicates by using SELECTCOLUMNS to select each column or columns into a table VAR. Then UNION and then DISTINCT.

 



Follow on LinkedIn
@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
DAX For Humans

DAX is easy, CALCULATE makes DAX hard...
Anonymous
Not applicable

Hej,

 

No M in Power Query.

 

And I'm not sure how that would work... 

 

Thanks though 🙂

 

Helpful resources

Announcements
Fabric Data Days Carousel

Fabric Data Days

Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!

October Power BI Update Carousel

Power BI Monthly Update - October 2025

Check out the October 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors