Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Enhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends September 15. Request your voucher.

Reply
Karthick_Balaje
New Member

Data Flow Gen 2 Issue

I used Dataflow Gen 2 to push the data from bronze to Silver with a little transformation (wrote a basic logic to perform Distinct Filtered Rows" operation - Filtering rows based on a condition (Primary Key not null and not 0)). When I select save and run, after validation, the dataflow gen 2 runs for a while, and it fails. Even after failing, I see most of the tables are pushed to Silver, and one table with 4 million rows is alone pushed. Why? 

Because of this, I am unable to run my main pipeline, and sometime I see there are duplicate rows at my target destination even after selecting "Replace" as an update method (whenver the dataflow gen 2 runs).

- Thanks,
Karthick

3 REPLIES 3
AntoineW
Responsive Resident
Responsive Resident

@Karthick_Balaje,

 

If I understand well, you have perform a transformation to deduplicate a column but you also have deplucate in the destination. 

 

- Because by selecting the update method as "Replace", it will normally replace all records. Make sure your 4M-row entity is truly set to Replace (and not “Append” on an existing table with manual settings)

 

- If two dataflows/pipelines write to the same table (or a new run starts before the previous one finishes), you can get duplicates—especially if any writer uses Append. Make sure in the pipeline or others artefacts that is using the table, there is nothing in the "append" mode.

 

- Otherwise, create a tickets to microsoft support team.

 

Best regards,

Antoine

BhaveshPatel
Community Champion
Community Champion

@Karthick_Balaje Thank you. You should not have to move data from bronze to silver.

There are two ways you can achieve this: One you should use Power BI Data flow Gen 2 or the second option is Use Notebooks and divide that into bronze, silver and gold layer.

 

If I use Power BI Dataflow Gen 2, then do all the transformations and save the data as a Fabric SQL Database. 

BhaveshPatel_0-1757307477544.png

one task at a time. ( One table at a time ). It can handle billions of rows at a time in a single table. 

 



 

Thanks & Regards,
Bhavesh

Love the Self Service BI.
Please use the 'Mark as answer' link to mark a post that answers your question. If you find a reply helpful, please remember to give Kudos.
Karthick_Balaje
New Member

Helpful resources

Announcements
August Fabric Update Carousel

Fabric Monthly Update - August 2025

Check out the August 2025 Fabric update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.

Top Kudoed Authors