Check your eligibility for this 50% exam voucher offer and join us for free live learning sessions to get prepared for Exam DP-700.
Get StartedJoin us at the 2025 Microsoft Fabric Community Conference. March 31 - April 2, Las Vegas, Nevada. Use code FABINSIDER for $400 discount. Register now
Greetings, all. I'm exploring using Microsoft Fabric for an enterprise-scale data warehousing solution but have a question. We have a lot of raw data files in CSV format that we want to load into staging and then transform in a Data Warehouse.
My questions are:
I've seen the following pattern presented as the way to approach this kind of medallion architecture (use Lakehouse as Bronze, DW as Silver, etc.):
My concern with #2 is that new columns can be added to the files, which would require the delta tables change. Plus, it sounds like copying into the Lakehouse and then copying into a DW creates two "copies" of the same data.
Anyone have ideas or helpful suggestions on this?
Solved! Go to Solution.
Hi @arpost ,
Thanks for using Fabric Community.
Whether to load directly to the data warehouse or to load to a lakehouse first and then copy to the data warehouse depends on a few factors, including:
Here are some additional things to consider:
Yes, loading into a lakehouse and then loading into a data warehouse does create separate copies of the same data. This is because the lakehouse and the data warehouse are two separate systems. The lakehouse is typically used for storing and processing raw data, while the data warehouse is typically used for storing and analyzing structured data.
Please refer to these links for more information:
Link1
Link2
Link3
Hope this helps. Please let us know if you have any further queries.
Hi @arpost ,
Thanks for using Fabric Community.
Whether to load directly to the data warehouse or to load to a lakehouse first and then copy to the data warehouse depends on a few factors, including:
Here are some additional things to consider:
Yes, loading into a lakehouse and then loading into a data warehouse does create separate copies of the same data. This is because the lakehouse and the data warehouse are two separate systems. The lakehouse is typically used for storing and processing raw data, while the data warehouse is typically used for storing and analyzing structured data.
Please refer to these links for more information:
Link1
Link2
Link3
Hope this helps. Please let us know if you have any further queries.
Hi @arpost ,
Glad that your issue got resolved. Please continue using Fabric Community for any help regarding your queries.
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!
Check out the February 2025 Fabric update to learn about new features.
User | Count |
---|---|
34 | |
17 | |
3 | |
3 | |
2 |
User | Count |
---|---|
41 | |
16 | |
14 | |
10 | |
7 |