Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get certified in Microsoft Fabric—for free! For a limited time, get a free DP-600 exam voucher to use by the end of 2024. Register now

Reply
Tejinder
Helper I
Helper I

Dataflow vs Dataset incremental refresh times are very different, Help ?

I have scheduled an incremental refresh from an SQL server through PBi dataflows, a full refresh takes about 12 minutes but an incremental refresh takes 50+ minutes, why would that be the case? 

When I try the same native query and schedule the incremental refresh on the dataset instead of dataflow, it's much faster, under 3 minutes for incremental refresh. but I want to use dataflows to get my data as I have multiple reports from the same DB. 

here are the partitions of my dataset:

Tejinder_0-1696280004653.png

 



Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Thanks 

1 ACCEPTED SOLUTION
lbendlin
Super User
Super User

Refresh times for incremental refresh depend heavily on the housekeeping activities (partition consolidation, starting over etc) as well on the current performance of your data source.

 

 I want to use dataflows to get my data as I have multiple reports from the same DB. 

That's not necessarily an argument for dataflows over datasets.  Dataflows are there to shield you ( the developer ) from slow data sources.  If your data source is not slow then use datasets.

 

Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Not likely.  Spend your energy on making the SQL query faster through correct indexes and statistics.

 

View solution in original post

1 REPLY 1
lbendlin
Super User
Super User

Refresh times for incremental refresh depend heavily on the housekeeping activities (partition consolidation, starting over etc) as well on the current performance of your data source.

 

 I want to use dataflows to get my data as I have multiple reports from the same DB. 

That's not necessarily an argument for dataflows over datasets.  Dataflows are there to shield you ( the developer ) from slow data sources.  If your data source is not slow then use datasets.

 

Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Not likely.  Spend your energy on making the SQL query faster through correct indexes and statistics.

 

Helpful resources

Announcements
November Carousel

Fabric Community Update - November 2024

Find out what's new and trending in the Fabric Community.

Live Sessions with Fabric DB

Be one of the first to start using Fabric Databases

Starting December 3, join live sessions with database experts and the Fabric product team to learn just how easy it is to get started.

Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early Bird pricing ends December 9th.

Nov PBI Update Carousel

Power BI Monthly Update - November 2024

Check out the November 2024 Power BI update to learn about new features.