Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

To celebrate FabCon Vienna, we are offering 50% off select exams. Ends October 3rd. Request your discount now.

Reply
tan_thiamhuat
Post Patron
Post Patron

Notebook vs Dataflow Gen 2

recently I was doing some coding work in Notebook, dealing with files in SFTP server stuffs.. some manipulation and checks and sending back the file to SFTP Server. the logic itself is not as simple, and its big file size needs to be overcome.. but now everything is OK with the notebook script which I feel satisfied.

 

question to ask is if that Notebook script is able to be replicated with Dataflow Gen 2. To me it seems not possible, but maybe the experts here can advise better...

 

I know in terms of cost, Dataflow Gen 2 is more costly than the Notebook. Also need to consider its maintainability.. does Dataflow Gen 2 helps in its maintability, as it is supposed to be low-code.

 

Thanks.

1 ACCEPTED SOLUTION
Vinodh247
Solution Supplier
Solution Supplier

I would suggest you to stick with notebooks for your scenario.


Use Dataflow Gen 2 if....

  • Sources/destinations are natively supported (OneLake, SharePoint, etc.)

  • The logic is simple (joins, filters, merges)

  • You want to expose transformations to citizen developers or analysts

If you need better orchestration or reuse then you have toconsider:

  • Wrapping your notebook logic in a pipeline

  • Using parameters and dynamic branching

  • Logging for better visibility

and yes you are right that dataflowgen2 is costly as it depends on the refresh frequency, size of data, and compute capacity it utilizes. It is tied more to capacity planning and licensing

 

On maintainablity,

  • dataflowgen2 would be easier for citizen developers as it comes with simple ETL pipelines, readable steps, versioning via UI
  • notebook will be harder to onboard non developers but much more maintainable for advanced users if you plan to modularize & use clear functions with logging and comments

Please kudo and 'Mark' as answer if the reply was helpful. This will be benefitting other community members who face the same issue.

 

Please 'Kudos' and 'Accept as Solution' if this answered your query.

Regards,
Vinodh
Microsoft MVP [Fabric]

View solution in original post

1 REPLY 1
Vinodh247
Solution Supplier
Solution Supplier

I would suggest you to stick with notebooks for your scenario.


Use Dataflow Gen 2 if....

  • Sources/destinations are natively supported (OneLake, SharePoint, etc.)

  • The logic is simple (joins, filters, merges)

  • You want to expose transformations to citizen developers or analysts

If you need better orchestration or reuse then you have toconsider:

  • Wrapping your notebook logic in a pipeline

  • Using parameters and dynamic branching

  • Logging for better visibility

and yes you are right that dataflowgen2 is costly as it depends on the refresh frequency, size of data, and compute capacity it utilizes. It is tied more to capacity planning and licensing

 

On maintainablity,

  • dataflowgen2 would be easier for citizen developers as it comes with simple ETL pipelines, readable steps, versioning via UI
  • notebook will be harder to onboard non developers but much more maintainable for advanced users if you plan to modularize & use clear functions with logging and comments

Please kudo and 'Mark' as answer if the reply was helpful. This will be benefitting other community members who face the same issue.

 

Please 'Kudos' and 'Accept as Solution' if this answered your query.

Regards,
Vinodh
Microsoft MVP [Fabric]

Helpful resources

Announcements
September Fabric Update Carousel

Fabric Monthly Update - September 2025

Check out the September 2025 Fabric update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.