Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.
I have a table where I see overall row count as 2304 with a unique column but somehow when I enable the column distribution I see the 2323 distinct values however the unique records shows as 2304. Unable to figure out why this difference shows up and why the distinct values show more than total values in the column?
Hi @hitank13 ,
Try Debugging.
Create a measure.
DC = DISCTINCTCOUNT('Table'[ColumnName])
CR = COUNTROWS('Table')
See which value turns up twice.
Regards,
Harsh Nathani
I created the measures and I see the count for both the measures is same with the unique values showing in column distribution however I see the difference in the distinct values in column distribution which is more than the number I see in overall values/rows in the table.
DC Measure count - 2302
CR Measure count - 2302
Unique value count in Column distribution- 2302
Distinct value count in Column distribution- 2323
Hey Maggie,
Hi @hitank13
Distinct value means distinct count of the values,
Unqiue values means how many values repeat only once.
countrows will count all rows in one table.
Best Regards
Maggie
Community Support Team _ Maggie Li
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
Hi Maggie,
Thanks for explaning the difference between Unique and Distinct, though I understand the difference.
Let me explain the issue with the image you posted. As per the image you see row count as 8, distinct value as 5 and unique value as 3. however as per my case it is showing row count as 2302, unique count as 2302, but distinct count as 2319 instead of showing 2302 only(It is more on primary key column which does not have any duplicate record). I am wondering how come the distinct values are showing more than the total row count?