Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Learn more

Reply
Amazeroth
Regular Visitor

Unique value of each column

Hello,


In order to identify the cleanups I still have to do on a fairly large dataset, I would like to generate a new table where each column contains the unique values of my original table.

 

I have tried to remove duplicates but this is reflected in all the columns, I want each column to be independent.

I don't know if Power BI is the most appropriate tool for this so I'm listening to other solutions if more appropriate.

 

Thank you in advance,

 

 

1 ACCEPTED SOLUTION

Hi @Amazeroth ,

After researching, I think you just can create each column as a table, remove duplicates and then merge them together. For your fairly large dataset, the way isn't available apparently. There is not a solution that can implement it directly. 

 

Can I ask why you want to do this?  After the operation, the data doesn't make sense.

 

Best Regards,

Xue Ding

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Best Regards,
Xue Ding
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

View solution in original post

4 REPLIES 4
Anonymous
Not applicable

I think I understand what you are trying to do but if you have examples that would be great.

Have you tried group by?

 

The downside here is if you have multiple columns it will group all columns like this

1234
ACCC
AACC
AAAC
BAAA
BBAA
BBBA
CBBB
CCBB
CCCB

Thank you for your help, I'm going to dig up this lead but it doesn't seem to match. In the meantime, here are some more explanations:

 

Here's an example of my starting table:

Columns 1Columns 2Columns 3
A12John
A14Bob
B10Bob
C12Bob
B12John

 

And this is the table I'd like to create:

Columns 1Columns 2Columns 3
A12John
B14Bob
C10 

 

This would allow me to easily detect inconsistent values that I still need to correct in my queries.

Hi @Amazeroth ,

After researching, I think you just can create each column as a table, remove duplicates and then merge them together. For your fairly large dataset, the way isn't available apparently. There is not a solution that can implement it directly. 

 

Can I ask why you want to do this?  After the operation, the data doesn't make sense.

 

Best Regards,

Xue Ding

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Best Regards,
Xue Ding
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Hi @v-xuding-msft , 

Thanks for your feedback, I came to the same conclusion on my side... I'm thinking of using Python instead.

 

There were two goals behind this request:
1. Our dataset being large, coming from several sources and very dirty, this table would allow me to discuss efficiently with the business teams about the cleaning/format change operations to be done.
2. Once the dataset cleaned, it would allow me to quickly generate a documentation on each column and the values it can contain.

 

I will close the post, thanks for your help.

Regards,

 

Amazeroth,

Helpful resources

Announcements
Fabric Data Days Carousel

Fabric Data Days

Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!

October Power BI Update Carousel

Power BI Monthly Update - October 2025

Check out the October 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors