Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Data Days is here! Join us now for 60+ days of learning, challenges, and connection. Learn more

Reply
Elisa112
Helper V
Helper V

Copy data de duplication

Hello, I am very new to synapse and am tasked with getting data from cloud azure database to a dedicated pool.  So far I have created a copy task to bring the data in, so far so good, the copy task is now populating tables (around 20 tables) , however now the issue is duplication and increasing data.  What is the best option for deduplicating the tables, should I now create pipeline tasks to drop the tables each day, since a full data refresh is needed daily and if so how should this be done in the simplest and most efficient way? 

All assistance greatly received.

Thank you

1 ACCEPTED SOLUTION
Anonymous
Not applicable

HI @Elisa112,

Perhaps you can try to invoke the query editor for further data cleanup in the data pipeline if it suitable for your requirements:

Use a dataflow in a pipeline - Microsoft Fabric | Microsoft Learn

Regards,

Xiaoxin Sheng

View solution in original post

2 REPLIES 2
Element115
Memorable Member
Memorable Member

@Elisa112 You need to implement an incremental copy algorithm so you only ingest new data every day instead of ingesting everything from scratch again, day after day.

Anonymous
Not applicable

HI @Elisa112,

Perhaps you can try to invoke the query editor for further data cleanup in the data pipeline if it suitable for your requirements:

Use a dataflow in a pipeline - Microsoft Fabric | Microsoft Learn

Regards,

Xiaoxin Sheng

Helpful resources

Announcements
Fabric Data Days is here Carousel

Fabric Data Days 2026

Don't miss out on Data Days, June 15 through August 7. Learn Fabric, Power BI, SQL, AI and more.

June Fabric Update Carousel

Fabric Monthly Update - June 2026

Check out the June 2026 Fabric update to learn about new features.