Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.

Reply
Tejinder
Helper I
Helper I

Dataflow vs Dataset incremental refresh times are very different, Help ?

I have scheduled an incremental refresh from an SQL server through PBi dataflows, a full refresh takes about 12 minutes but an incremental refresh takes 50+ minutes, why would that be the case? 

When I try the same native query and schedule the incremental refresh on the dataset instead of dataflow, it's much faster, under 3 minutes for incremental refresh. but I want to use dataflows to get my data as I have multiple reports from the same DB. 

here are the partitions of my dataset:

Tejinder_0-1696280004653.png

 



Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Thanks 

1 ACCEPTED SOLUTION
lbendlin
Super User
Super User

Refresh times for incremental refresh depend heavily on the housekeeping activities (partition consolidation, starting over etc) as well on the current performance of your data source.

 

 I want to use dataflows to get my data as I have multiple reports from the same DB. 

That's not necessarily an argument for dataflows over datasets.  Dataflows are there to shield you ( the developer ) from slow data sources.  If your data source is not slow then use datasets.

 

Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Not likely.  Spend your energy on making the SQL query faster through correct indexes and statistics.

 

View solution in original post

1 REPLY 1
lbendlin
Super User
Super User

Refresh times for incremental refresh depend heavily on the housekeeping activities (partition consolidation, starting over etc) as well on the current performance of your data source.

 

 I want to use dataflows to get my data as I have multiple reports from the same DB. 

That's not necessarily an argument for dataflows over datasets.  Dataflows are there to shield you ( the developer ) from slow data sources.  If your data source is not slow then use datasets.

 

Also, My query is fairly complex with 10+ JOINS. would doing these JOINS  in power query instead, speed up incremental in dataflows.

Not likely.  Spend your energy on making the SQL query faster through correct indexes and statistics.

 

Helpful resources

Announcements
RTI Forums Carousel3

New forum boards available in Real-Time Intelligence.

Ask questions in Eventhouse and KQL, Eventstream, and Reflex.

LearnSurvey

Fabric certifications survey

Certification feedback opportunity for the community.