cancel
Showing results for 
Search instead for 
Did you mean: 

Fabric is Generally Available. Browse Fabric Presentations. Work towards your Fabric certification with the Cloud Skills Challenge.

Reply
Tejinder
Helper I
Helper I

Dataflow vs Dataset incremental refresh times are very different, Help ?

I have scheduled an incremental refresh from an SQL server through PBi dataflows, a full refresh takes about 12 minutes but an incremental refresh takes 50+ minutes, why would that be the case? 

When I try the same native query and schedule the incremental refresh on the dataset instead of dataflow, it's much faster, under 3 minutes for incremental refresh. but I want to use dataflows to get my data as I have multiple reports from the same DB. 

here are the partitions of my dataset:

Tejinder_0-1696280004653.png

 



Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Thanks 

1 ACCEPTED SOLUTION
lbendlin
Super User
Super User

Refresh times for incremental refresh depend heavily on the housekeeping activities (partition consolidation, starting over etc) as well on the current performance of your data source.

 

 I want to use dataflows to get my data as I have multiple reports from the same DB. 

That's not necessarily an argument for dataflows over datasets.  Dataflows are there to shield you ( the developer ) from slow data sources.  If your data source is not slow then use datasets.

 

Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Not likely.  Spend your energy on making the SQL query faster through correct indexes and statistics.

 

View solution in original post

1 REPLY 1
lbendlin
Super User
Super User

Refresh times for incremental refresh depend heavily on the housekeeping activities (partition consolidation, starting over etc) as well on the current performance of your data source.

 

 I want to use dataflows to get my data as I have multiple reports from the same DB. 

That's not necessarily an argument for dataflows over datasets.  Dataflows are there to shield you ( the developer ) from slow data sources.  If your data source is not slow then use datasets.

 

Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Not likely.  Spend your energy on making the SQL query faster through correct indexes and statistics.

 

Helpful resources

Announcements
PBI November 2023 Update Carousel

Power BI Monthly Update - November 2023

Check out the November 2023 Power BI update to learn about new features.

Community News

Fabric Community News unified experience

Read the latest Fabric Community announcements, including updates on Power BI, Synapse, Data Factory and Data Activator.

Power BI Fabric Summit Carousel

The largest Power BI and Fabric virtual conference

130+ sessions, 130+ speakers, Product managers, MVPs, and experts. All about Power BI and Fabric. Attend online or watch the recordings.

Top Solution Authors
Top Kudoed Authors