Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.

Reply
Anonymous
Not applicable

Removes duplicates and keep the lastest

Hello everyone, a have a folder query wich I'll be updating every week, I need to delete duplicates and keep the lastest by "date modified" which is the reference date when the file was loaded in the folder. So I think im gonna need a new column in the table with the "Date modified" and then sort by this and remove duplicates, I don't know if this will work, any ideas?

2 REPLIES 2
Anonymous
Not applicable

Hi @Anonymous ,

I'm not sure what your table structure looks like, suppose your table are as shown in the screenshot below . It includes Field 1~Field 5 and field Date modified . The rows of the table display the same values except for the value of field date modified .

Field 1Field 2Field 3Field 4Field 5Date modified
1AAOK582020/2/16
1AAOK582020/2/18
1AAOK582020/2/20
2BBFAIL762020/1/13
2BBFAIL762020/1/14
3CCOK422020/2/16

Now you want to remove the duplicate rows and only show the rows of latest date modified ? Whether your expected result should be like below screenshot?

Field 1Field 2Field 3Field 4Field 5Date modified
1AAOK582020/2/20
2BBFAIL762020/1/14
3CCOK422020/2/16

 

If the above understanding is incorrect, please correct and provide more details(include table structure and some sample data). Thank you.

 

Best Regards

Rena

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors
Top Kudoed Authors