Solved: Can create multiple data flow gen2 inside one pipe...

ManasiL · ‎06-13-2024

I have to load data from SAP HANA DB to fabric warehouse.

there are more than 15 tables to be loaded each with large number of rows

is it good to create one pipeline with more than 15 dataglow gen 2 for each table to load data? Or create one pipeline for each table?

NandanHegde · ‎06-13-2024

Ideally the best way would have been to have a single pipeline and a single dataflow Gen 2 with a meta data driven framework but as of today, dataflow Gen 2 cannot be parameterized.

So owning to that, you would have to create sepearte dataflows per table.

Now the question is should those be triggered or integrated within a signle pipeline or different pipelines?

For that, it depends on your requirement :

1) Is the load for all tables supposed to be scheudled at the same time?

2) Is there any flow dependency between the tables? meaning should those be some sequentail / dependant flow or all tables are independant of each other?

If those tables are to be loaded atthe same time, then the best way would be for you to craete a signle pipeline and integrate all the dataflows within the same pipeline.

----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

View solution in original post

ManasiL · ‎06-13-2024

Okay. got it. 👍

Thank you once again for the prompt responses.

View solution in original post

ManasiL · ‎06-13-2024

I really appreciate your prompt response. Thank you for that.

Lets keep the dependency aside for a minute.

Will it be still ok to have multiple dataflows each for a table inside a single pipeline if those tables have millions of records?

NandanHegde · ‎06-13-2024

The dataflows gen 2 would use the same capacity tied to the workspace.
So whether you run them within the same pipeline or run them parallely via different pipelines, the capacity being utilized is the same. So it wont matter but in case if the capacity is getting throttled with some many runs, atleast you can make them sequential or depeandt in case if all are within the same pipeline and manage the capacity utilization

----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

ManasiL · ‎06-13-2024

Okay. got it. 👍

Thank you once again for the prompt responses.

Anonymous · ‎06-13-2024

Hi @ManasiL ,

Glad to know that your query got resolved. Please continue using Fabric Community on your further queries.

NandanHegde · ‎06-13-2024

Ideally the best way would have been to have a single pipeline and a single dataflow Gen 2 with a meta data driven framework but as of today, dataflow Gen 2 cannot be parameterized.

So owning to that, you would have to create sepearte dataflows per table.

Now the question is should those be triggered or integrated within a signle pipeline or different pipelines?

For that, it depends on your requirement :

1) Is the load for all tables supposed to be scheudled at the same time?

2) Is there any flow dependency between the tables? meaning should those be some sequentail / dependant flow or all tables are independant of each other?

If those tables are to be loaded atthe same time, then the best way would be for you to craete a signle pipeline and integrate all the dataflows within the same pipeline.

----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

ManasiL · ‎06-30-2024

If I need to integrate multiple dataflows with very less data and not dependent on each other in one pipeline how can i implement that? Running them in sequence? Can i run them in parallel? If yes which would be best approach?

NandanHegde · ‎06-30-2024

All the dataflows are using the same workspace capacity is the major thing to take into consideration.

In case if the DFs are independant, then executing them parallely would the best approach but executing all DFs in parallel might choke up the workspace capacity; so you might have to execute DFs in parallel in batches and not all at the same time

----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

ManasiL · ‎06-30-2024

Yes they are within the same workspace. Can you please share the link to see how to implement them in parallel in batches?

Can create multiple data flow gen2 inside one pipeline?

Helpful resources

Join our Fabric User Panel

Fabric Monthly Update - June 2025

Fabric Community Update - June 2025

Party with Power BI’s own Guy in a Cube

Can create multiple data flow gen2 inside one pipeline?

Helpful resources

Join our Fabric User Panel

Fabric Monthly Update - June 2025

Fabric Community Update - June 2025