Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
Noeleke1301
Helper II
Helper II

Copy gen1 dataflow to lakehouse/warehouse

Hi all,

 

The data output option in gen2 dataflow is great, but it's missing incremental refresh (for now). So I need to use gen1 for our purpose.

How can I copy data from a gen1 dataflow into my lakehouse/warehouse?

 

Thanks.

1 ACCEPTED SOLUTION

I know append is not incremental, but it's the tool to make it. If you create a dataflow that only brings yesterday data, it will append new data. The difference between append and incremental is that you need to think the logic behind the incremental and prepare your source for doing it. That's the way Gen2 can do it now.

Another alternative is using dataflow gen1 storing at Lake gen2. Then create a shortcut with fabric lakehouse.

Your alternative also work, creating first de gen1 then get it with gen2 to insert it in the lakehouse.

I hope that make sense.


If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Happy to help!

LaDataWeb Blog

View solution in original post

4 REPLIES 4
ibarrau
Super User
Super User

Hi. Did you tried dataflow gen2? I haven't test it yet with that scenario, but when I was testing gen2 for simple solutions it ask at the end of the configuration if you want to replace the data or append. I know the doc says incremental refresh doesn't work, but it might be a different way to accomplish the objective.

ibarrau_0-1688484649453.png

I think it will let you make it incremental.

If you just can't, dataflow gen1 can't pick a destiny besides an Azure Data Lake gen2 (storage account with hirarchical setting). You can syn a workspace with the lake and dataflow gen1 will store in there. That would be the only way to let dataflow copy data for a destination like a lake. Otherwise dataflow has its own black box storage that can only be connected with Power Bi Desktop.

I hope that helps


If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Happy to help!

LaDataWeb Blog

Hi @ibarrau 

Appending data is not the same as incremental refreshing, as it will append all data without checking existing data (I believe). The gen1 incr. refresh does that.

 

A workaround would be to add a timestamp in my query and later filter out the latest data. In some documentation, that is what the bronze (or landing) is supposed to do.

 

For now I've created a gen1 dataflow with incremental refresh + a gen2 dataflow with the automatic sink and run them sequentially in a DF pipeline. That works as intended, but is not optimal.

I know append is not incremental, but it's the tool to make it. If you create a dataflow that only brings yesterday data, it will append new data. The difference between append and incremental is that you need to think the logic behind the incremental and prepare your source for doing it. That's the way Gen2 can do it now.

Another alternative is using dataflow gen1 storing at Lake gen2. Then create a shortcut with fabric lakehouse.

Your alternative also work, creating first de gen1 then get it with gen2 to insert it in the lakehouse.

I hope that make sense.


If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Happy to help!

LaDataWeb Blog

I have a list of Customers in a Snowflake data source with a dimension table that has a Time Updated column.  Is there a method using Data Flow Gen2 that would support SELECTing just the rows with a Time Updated after the last refresh and UPDATING just those rows that changed?

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors