Don't miss your chance to take the Fabric Data Engineer (DP-600) exam for FREE! Find out how by attending the DP-600 session on April 23rd (pacific time), live or on-demand.
Learn moreNext up in the FabCon + SQLCon recap series: The roadmap for Microsoft SQL and Maximizing Developer experiences in Fabric. All sessions are available on-demand after the live show. Register now
Greetings, all. I'm exploring using Microsoft Fabric for an enterprise-scale data warehousing solution but have a question. We have a lot of raw data files in CSV format that we want to load into staging and then transform in a Data Warehouse.
My questions are:
I've seen the following pattern presented as the way to approach this kind of medallion architecture (use Lakehouse as Bronze, DW as Silver, etc.):
My concern with #2 is that new columns can be added to the files, which would require the delta tables change. Plus, it sounds like copying into the Lakehouse and then copying into a DW creates two "copies" of the same data.
Anyone have ideas or helpful suggestions on this?
Solved! Go to Solution.
Hi @arpost ,
Thanks for using Fabric Community.
Whether to load directly to the data warehouse or to load to a lakehouse first and then copy to the data warehouse depends on a few factors, including:
Here are some additional things to consider:
Yes, loading into a lakehouse and then loading into a data warehouse does create separate copies of the same data. This is because the lakehouse and the data warehouse are two separate systems. The lakehouse is typically used for storing and processing raw data, while the data warehouse is typically used for storing and analyzing structured data.
Please refer to these links for more information:
Link1
Link2
Link3
Hope this helps. Please let us know if you have any further queries.
Hi @arpost ,
Thanks for using Fabric Community.
Whether to load directly to the data warehouse or to load to a lakehouse first and then copy to the data warehouse depends on a few factors, including:
Here are some additional things to consider:
Yes, loading into a lakehouse and then loading into a data warehouse does create separate copies of the same data. This is because the lakehouse and the data warehouse are two separate systems. The lakehouse is typically used for storing and processing raw data, while the data warehouse is typically used for storing and analyzing structured data.
Please refer to these links for more information:
Link1
Link2
Link3
Hope this helps. Please let us know if you have any further queries.
Hi @arpost ,
Glad that your issue got resolved. Please continue using Fabric Community for any help regarding your queries.
Experience the highlights from FabCon & SQLCon, available live and on-demand starting April 14th.
If you have recently started exploring Fabric, we'd love to hear how it's going. Your feedback can help with product improvements.
Share feedback directly with Fabric product managers, participate in targeted research studies and influence the Fabric roadmap.
| User | Count |
|---|---|
| 12 | |
| 6 | |
| 5 | |
| 4 | |
| 4 |
| User | Count |
|---|---|
| 23 | |
| 22 | |
| 12 | |
| 12 | |
| 10 |