Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get certified in Microsoft Fabric—for free! For a limited time, get a free DP-600 exam voucher to use by the end of 2024. Register now

Reply
maxon
Frequent Visitor

Count Distinct per each column based on unique IDs

Hello,

 

I have the raw data that looks like the one below:

 

UIDQuer_1Query_2...Query_40
AYYYN
BNYNY
CYNNN
AYNYN
DYYYY
ANYYN

 

And I need to transfotm the data into:

 

Query_1Number of Y per each unique ID
Query_2Number of Y per each unique ID
Query_3Number of Y per each unique ID
...Number of Y per each unique ID
Query_40Number of Y per each unique ID

 

I did this by removing duplicates from UID column and then calculating Y, but I noticed that is the wrong order and I have wrong number of Y, as I should only count Y per each Query, but I imagine that it's possible to create a universal measure instead of creating measure per query. 

 

My question is, how to count Y but considering to not duplicate to UID? From VBA perspective I could do it using Collection or Dictionary, but how to achive it using DAX?

4 REPLIES 4
vanessafvg
Super User
Super User

you could probably do something like this

Total Queries Query 1 = CALCULATE(sumx(values(test[UID]), COUNTROWS(test)), test[Quer_1] ="Y")
 
however that wouldn't solve the writing a measure per column.
 
you would probably need to transpose your data in power query  and if you plot the query on visual then you could probably create a generic measure 
see attached
pivoted is in the format you have  now the measure looks like this
test
Total Queries Query 1 = CALCULATE(sumx(values(Pivoted[UID]), COUNTROWS(Pivoted)), Pivoted[Quer_1] ="Y")
 
more generically would be  (unpivoted)
Total Queries for Unpivoted = CALCULATE(SUMX(VALUES('Unpivoted'[UID]), COUNTROWS('Unpivoted')), 'Unpivoted'[Value] = "y")
 
see attatched
 




If I took the time to answer your question and I came up with a solution, please mark my post as a solution and /or give kudos freely for the effort 🙂 Thank you!

Proud to be a Super User!




Hello, thank you for your solution. I have unpivoted the data, but the result of the measure is not right. For example for Query 1 the 'A' should be counted only once. Unfortunatelty I cannot remove duplicates in UID as for each query there different Y or N. 
----

----------

Maybe adding custom column:

Column = [UID] & [Query] &[Value])

then something like this:

CALCULATE(COUNTROWS(VALUES(Table[Column])), 'Table'[Value] = "Y"), what do think about it?

bcdobbs
Super User
Super User

I'd start by using the unpivot transform in Power Query to get it in form:

 

UID, Query Name, Value

 

Then add a custom column that has 1 if Y and 0 if N

 

At that point you can use AVERAGEX in a measure to do something like:

 

AVERAGEX (

 VALUES ( Table[UID] ),

 CALCULATE ( 

   SUM ( Table[CustomColumn]
)

)


Yoi can then use that measure in a matrix with Query Name in the rows.

 

That might need a bit of tweaking but I think the principal is sound.



Ben Dobbs

LinkedIn | Twitter | Blog

Did I answer your question? Mark my post as a solution! This will help others on the forum!
Appreciate your Kudos!!
maxon
Frequent Visitor

Hello, thank you for your solution, but for Query 1 the 'A' should be counted only once, not twice, how to modify it? 

----------

Maybe adding custom column:

Column = [UID] & [Query] &[Value])

then something like this:

CALCULATE(COUNTROWS(VALUES(Table[Column])), 'Table'[Value] = "Y"), what do think about it?

Helpful resources

Announcements
November Carousel

Fabric Community Update - November 2024

Find out what's new and trending in the Fabric Community.

Live Sessions with Fabric DB

Be one of the first to start using Fabric Databases

Starting December 3, join live sessions with database experts and the Fabric product team to learn just how easy it is to get started.

Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early Bird pricing ends December 9th.

Nov PBI Update Carousel

Power BI Monthly Update - November 2024

Check out the November 2024 Power BI update to learn about new features.