Microsoft Fabric Community Conference 2025, March 31 - April 2, Las Vegas, Nevada. Use code FABINSIDER for a $400 discount.
Register nowGet inspired! Check out the entries from the Power BI DataViz World Championships preliminary rounds and give kudos to your favorites. View the vizzies.
Hello All,
I have below calculated table..can you please let me know how to remove duplicate ID rows in below table.
For example 123 ID is duplicated in below table and I need to remove 123 ID row with 10/19/2022 date where link column is empty and keep 123 ID with link value present for 10/24/2022 date.
we would like to remove duplicate id's based on above condition
ID | link | Date |
123 | 10/19/2022 | |
456 | https://www.wikipedia.org | |
789 | ||
123 | https://www.wikipedia.org | 10/24/2022 |
568 | ||
897 |
thank you!
regards,
Ashwini
Solved! Go to Solution.
Hi @ashuaswinireddy ,
Here are the steps you can follow:
1. In Power query. Add Column – Index Column – From 1.
2. Create calculated column.
Rank =
RANKX(FILTER(ALL('Table'),'Table'[ID]=EARLIER('Table'[ID])),[Index],,ASC)
Flag =
var _maxdate=MAXX(FILTER(ALL('Table'),
'Table'[ID]=EARLIER('Table'[ID])),[Date])
var _count=COUNTX(FILTER(ALL('Table'),'Table'[ID]=EARLIER('Table'[ID])),[ID])
return
IF(
_count=1&&[Rank]=1,
1,
IF(
_count >1&&'Table'[Date]=_maxdate,1,0)
)
3. Create calculated table.
Table 2 =
var _table1=
FILTER('Table',[Flag]=1)
return
SUMMARIZE(
_table1,[ID],[link],[Date])
4. Result:
Best Regards,
Liu Yang
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly
Hi again,
Sorry for the late answer, but you can actually solve this in one DAX expression and one step:
Hi again,
Sorry for the late answer, but you can actually solve this in one DAX expression and one step:
I had a similar issue with duplicate Document Number - 1 with Blank Transmittal No and 1 with valid Transmittal No. I used this method to remove the blank Tr no! I like this 1 DAX query to build the desired table! Many thanks. 🙂
Thank you for your time and response!
Hi @ashuaswinireddy ,
Here are the steps you can follow:
1. In Power query. Add Column – Index Column – From 1.
2. Create calculated column.
Rank =
RANKX(FILTER(ALL('Table'),'Table'[ID]=EARLIER('Table'[ID])),[Index],,ASC)
Flag =
var _maxdate=MAXX(FILTER(ALL('Table'),
'Table'[ID]=EARLIER('Table'[ID])),[Date])
var _count=COUNTX(FILTER(ALL('Table'),'Table'[ID]=EARLIER('Table'[ID])),[ID])
return
IF(
_count=1&&[Rank]=1,
1,
IF(
_count >1&&'Table'[Date]=_maxdate,1,0)
)
3. Create calculated table.
Table 2 =
var _table1=
FILTER('Table',[Flag]=1)
return
SUMMARIZE(
_table1,[ID],[link],[Date])
4. Result:
Best Regards,
Liu Yang
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly
Thank you for your time and response!
I have another scenario where link is available for the previous date but not for the latest date.
For example- ID 897 link is available on 10/18/2022 and not on 10/26/2022. In this case I need to select ID 897 with link available row on 10/18/2022. Can you please let me know how to handle this scenario.
ID | link | Date |
123 | 10/19/2022 | |
456 | https://www.wikipedia.org | |
789 | ||
123 | https://www.wikipedia.org | 10/24/2022 |
568 | ||
897 | https://www.youtube.com | 10/18/2022 |
897 | 10/26/2022 | |
Once again thank you for your time!
Regards,
Ashwini
If you want to remove the rows completely from the report, it's more efficient to do this in Power Query than DAX.
If you just want to keep the non blank links, then click 'Transform Data' to open power query. Filter the Link column to exclude null and blank values, then Close and Apply changes.
If you want to remove duplicates and keep only the latest date, then Sort by Date in Power Query first. You'll need to add Table.Buffer to ensure the sort stays for the next step as per this answer: Solved: How do I buffer my table in Power Query? - Microsoft Power BI Community
Then right click on the ID column and remove duplicates. Finally, close and Apply.
Copying DAX from this post? Click here for a hack to quickly replace it with your own table names
Has this post solved your problem? Please Accept as Solution so that others can find it quickly and to let the community know your problem has been solved.
If you found this post helpful, please give Kudos C
I work as a Microsoft trainer and consultant, specialising in Power BI and Power Query.
www.excelwithallison.com
Hi,
If you want to keep certain rows duplicated on the ID you need to use rank function:
If you need more help, Please let me know How to rank (wich rules do you want to use to keep the correct row)
Br
Marius
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code FABINSIDER for a $400 discount!
Check out the February 2025 Power BI update to learn about new features.
User | Count |
---|---|
89 | |
84 | |
66 | |
52 | |
31 |
User | Count |
---|---|
121 | |
113 | |
73 | |
65 | |
46 |