Don't miss your chance to take the Fabric Data Engineer (DP-600) exam for FREE! Find out how by attending the DP-600 session on April 23rd (pacific time), live or on-demand.
Learn moreNext up in the FabCon + SQLCon recap series: The roadmap for Microsoft SQL and Maximizing Developer experiences in Fabric. All sessions are available on-demand after the live show. Register now
I've just created a dataflow gen2 and want to load the data directly into my lakehouse.
It asks me to select an 'update method', but how does this work exactly?
For example, what if I get the same data with just a different DateTimeReceived? How does it know which row to update? Should I do anything with the column mapping? Shouldn't I set a unique key somewhere?
Thanks.
Solved! Go to Solution.
Hi @Noeleke1301 ,
The Replace and append functions do not allow you to update your data and directly insert the modified data.
For replace, it replaces all the data directly after the refresh, and for append, it appends all the content of the next refresh directly to the previously saved data.
Fore more information, you can refer to :
Create your first Microsoft Fabric dataflow - Microsoft Fabric | Microsoft Learn
If you need to implement data partition refresh, you can try to understand incremental refresh::
Using incremental refresh with dataflows - Power Query | Microsoft Learn
Best Regards,
Liu Yang
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
Hi @Noeleke1301 ,
The Replace and append functions do not allow you to update your data and directly insert the modified data.
For replace, it replaces all the data directly after the refresh, and for append, it appends all the content of the next refresh directly to the previously saved data.
Fore more information, you can refer to :
Create your first Microsoft Fabric dataflow - Microsoft Fabric | Microsoft Learn
If you need to implement data partition refresh, you can try to understand incremental refresh::
Using incremental refresh with dataflows - Power Query | Microsoft Learn
Best Regards,
Liu Yang
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
I believe this is using some kind of Delta Parquet difference log in the background. I conducted an experiment defining a table with no primary keys and subjected it to a barrage of updates and inserts. The GEN2 Dataflow takes care of every case as long as the Timestamp column specified is correctly updated at the same time. Given "Replace" option chosen
While all of this is very powerful, the documentation is poor. The explanation regarding "buckets" might be better if an example is provided or we can more clearly understand where the "buckets" of change are built from.
https://learn.microsoft.com/en-us/fabric/data-factory/dataflow-gen2-incremental-refresh#how-incremen...
If you have recently started exploring Fabric, we'd love to hear how it's going. Your feedback can help with product improvements.
A new Power BI DataViz World Championship is coming this June! Don't miss out on submitting your entry.
Experience the highlights from FabCon & SQLCon, available live and on-demand starting April 14th.
| User | Count |
|---|---|
| 11 | |
| 10 | |
| 9 | |
| 8 | |
| 8 |