Join us at FabCon Atlanta from March 16 - 20, 2026, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.
Register now!Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.
Hello everyone, a have a folder query wich I'll be updating every week, I need to delete duplicates and keep the lastest by "date modified" which is the reference date when the file was loaded in the folder. So I think im gonna need a new column in the table with the "Date modified" and then sort by this and remove duplicates, I don't know if this will work, any ideas?
Hi @Anonymous ,
I'm not sure what your table structure looks like, suppose your table are as shown in the screenshot below . It includes Field 1~Field 5 and field Date modified . The rows of the table display the same values except for the value of field date modified .
Field 1 | Field 2 | Field 3 | Field 4 | Field 5 | Date modified |
1 | AA | OK | 5 | 8 | 2020/2/16 |
1 | AA | OK | 5 | 8 | 2020/2/18 |
1 | AA | OK | 5 | 8 | 2020/2/20 |
2 | BB | FAIL | 7 | 6 | 2020/1/13 |
2 | BB | FAIL | 7 | 6 | 2020/1/14 |
3 | CC | OK | 4 | 2 | 2020/2/16 |
Now you want to remove the duplicate rows and only show the rows of latest date modified ? Whether your expected result should be like below screenshot?
Field 1 | Field 2 | Field 3 | Field 4 | Field 5 | Date modified |
1 | AA | OK | 5 | 8 | 2020/2/20 |
2 | BB | FAIL | 7 | 6 | 2020/1/14 |
3 | CC | OK | 4 | 2 | 2020/2/16 |
If the above understanding is incorrect, please correct and provide more details(include table structure and some sample data). Thank you.
Best Regards
Rena