Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Be one of the first to start using Fabric Databases. View on-demand sessions with database experts and the Microsoft product team to learn just how easy it is to get started. Watch now

Reply
amolt
Resolver I
Resolver I

Dataflow runs for hours

Hello,
I regularly have Gen2 dataflows that get stuck in refresh mode for hours on end, consuming all the resources of my capacity. Under normal circumstances, these dataflows only last a few minutes (they only load one table from a cvs).
I tried to restart the gateway, without success.
What can I do?

 

Thanks for your help

9 REPLIES 9
v-cboorla-msft
Community Support
Community Support

Hi @amolt 

 

Following up to check if you have a resolution yet. In case if you have any resolution please do share the same with community as it can be helpful to others.
Otherwise, will respond back with the more details and we will try to help .

 

Thanks

Hi @v-cboorla-msft ,

Unfortunately I have no other solution than not to use pipelines to run Gen2 dataflow.

 

Thanks

DataPne
Frequent Visitor

I am having the same issue - support hasn't been able to help other than suggesting not to use Gen2 dataflows.

Not really feasible - surprising how bad this product is.

Joshrodgers123
Advocate V
Advocate V

I have the same issue trying to load from on-prem SQL Server with the latest gateway. 

 

With a gen 1 dataflow, the table loads in a couple seconds. With gen2 and staging enabled, it takes a few minutes. Turning off staging, which I'd like to do since the query can fold, it takes an hour and a half and fails 9 out of 10 times.

HimanshuS-msft
Community Support
Community Support

Hello @amolt 
Thanks for using the Fabric community.

As I understand the current inquiry, the question is why Dataflow Gen2 is taking more time than usual. Please correct me if I'm mistaken.

You mentioned that it's 'consuming all the resources of my capacity,' which seems unusual and suggests that there may be other activities occurring concurrently, such as another scheduled data flow. If you're uncertain, you could attempt to reschedule the Dataflow during off-hours to see if this adjustment results in fewer failures.

Thanks
HImanshu

Hello @HimanshuS-msft ,
Thank you for your reply.
No, I don't have any other treatment in parallel. During this last execution, the dataflow lasted more than 5h against a few minutes usually.

 

Thanks

Hi @DataPne , @Joshrodgers123 ,

Do you use a pipeline to execute your dataflows ?
In my case, I noticed that my problems came when I used a pipeline to schedule the execution of the gen2 dataflow.

DataPne
Frequent Visitor

Hi @amolt ,

The issue seems to present itself even just when executing the pipelines manually from the UI.

It will maybe run successfully 1/10 times. 

Have you managed to find any work arounds on your side?

Hi @DataPne ,

I've found no other solution than to dispense with pipelines to run dataflows. I make individual schedules for each dataflow. It's really not great.
If anyone has a better solution, I'm interested.

Helpful resources

Announcements
Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!

Dec Fabric Community Survey

We want your feedback!

Your insights matter. That’s why we created a quick survey to learn about your experience finding answers to technical questions.

ArunFabCon

Microsoft Fabric Community Conference 2025

Arun Ulag shares exciting details about the Microsoft Fabric Conference 2025, which will be held in Las Vegas, NV.

December 2024

A Year in Review - December 2024

Find out what content was popular in the Fabric community during 2024.

Top Solution Authors