Get certified for free when you join Fabric Data Days 2026 and dive into Fabric, Power BI, SQL, AI, and other essential data skills.
Join nowData Days is here! Join us now for 60+ days of learning, challenges, and connection. Learn more
Solved! Go to Solution.
Hi @Aventuran,
While dataflow gen 1 is still supported, no improvements or development effort are going into maintaining them, so it is a good idea to start thinking about a migration to gen 2. Microsoft has committed to giving at least 12 months notice before retiring gen 1 dataflows, and that notice has not yet been given, so you have some time to figure out your migration path.
There are some major architectural differences that you need to work with when using dataflow gen 2, mainly that dataflow gen 2 does not have internal storage like gen 1 does, making a data store like a lakehouse or a warehouse mandatory.
For your scheduling problems, dataflow gen 2 can be schedueld with a pipeline, or you can use the Scheduler API to trigger them from Power Automate and keep your existing scheduling: https://learn.microsoft.com/en-us/rest/api/fabric/core/job-scheduler/run-on-demand-item-job?tabs=HTT...
Copy activities and almost all other Fabric data factory items can be run the same way, so if you wanted to move to copy activities and the scheduling was your only road block, you can schedule them with power automate using this API.
The decision to move to pipelines instead of dataflows really depends on your data, how much transformation is needed, and what functionalities you are wanting.
As for the incremental data refresh, you will need to have some sort of watermark column to be able to determine when rows were added, removed, or changed for any sort of incremental load to work nicely. If the built in incremental loads of copy jobs or dataflow gen 2 don't work for you, then it might be worth looking at switching your incremental loads to use notebooks instead, which will give you far more control over how the data is loaded and how that incremental window is determined.
Proud to be a Super User! | |
Hi @Aventuran,
May I check if this issue has been resolved? If not, Please feel free to contact us if you have any further questions.
Thank you
Hi @Aventuran,
Thank you for posting your query in the Microsoft Fabric Community Forum, and thanks to @lbendlin & @stoic-harsh & @tayloramy for sharing valuable insights.
Could you please confirm if your query has been resolved by the provided solutions? This would be helpful for other members who may encounter similar issues.
Thank you for being part of the Microsoft Fabric Community.
Think about what incremental refresh is - automated partition management based on a temporal axis
Think about what Dataflows Gen1 are - CSV files in Azure Blob storage.
So your process amounts to a round of musical chairs where you reshape existing partitions (your SPs) and then do incremental refresh just to end up with another set of "partitions" (the CSV files).
A simple approach would be to do your own partition management (as you are already doing with the SPs) and then stuff these partitions into a lakehouse (where you can refresh/overwrite them individually) . Then in your warehouse and/or semantic model you can recombine the partitions as needed.
Hey @Aventuran,
Gen1 should work just fine, but since it won't receive any updates, you should plan a gradual migration toward Gen2 / Fabric-native architecture rather than building new dependencies on Gen1. And, no need to consider pipelines, if dataflows suffice your requirement.
A good setup, I'd suggest:
Keep Gen1 temporarily for critical workloads, in the beginning.
Convert Stored Procedures to views, wherever possible. Views/tables support query folding in Gen2.
Hope this helps!
Best,
Harshit
Hi @Aventuran,
While dataflow gen 1 is still supported, no improvements or development effort are going into maintaining them, so it is a good idea to start thinking about a migration to gen 2. Microsoft has committed to giving at least 12 months notice before retiring gen 1 dataflows, and that notice has not yet been given, so you have some time to figure out your migration path.
There are some major architectural differences that you need to work with when using dataflow gen 2, mainly that dataflow gen 2 does not have internal storage like gen 1 does, making a data store like a lakehouse or a warehouse mandatory.
For your scheduling problems, dataflow gen 2 can be schedueld with a pipeline, or you can use the Scheduler API to trigger them from Power Automate and keep your existing scheduling: https://learn.microsoft.com/en-us/rest/api/fabric/core/job-scheduler/run-on-demand-item-job?tabs=HTT...
Copy activities and almost all other Fabric data factory items can be run the same way, so if you wanted to move to copy activities and the scheduling was your only road block, you can schedule them with power automate using this API.
The decision to move to pipelines instead of dataflows really depends on your data, how much transformation is needed, and what functionalities you are wanting.
As for the incremental data refresh, you will need to have some sort of watermark column to be able to determine when rows were added, removed, or changed for any sort of incremental load to work nicely. If the built in incremental loads of copy jobs or dataflow gen 2 don't work for you, then it might be worth looking at switching your incremental loads to use notebooks instead, which will give you far more control over how the data is loaded and how that incremental window is determined.
Proud to be a Super User! | |