March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early bird discount ends December 31.
Register NowBe one of the first to start using Fabric Databases. View on-demand sessions with database experts and the Microsoft product team to learn just how easy it is to get started. Watch now
Hi,
I am implementing Medallion Architecture in "Microsoft Fabric", I am having few queries on the same.
1. One of the option, is bringing data into "Data Warehouse" using "Data Factory".
2. I can use Lakehouse (for large volume of data handling) or Warehouse (For analytical Work handling)
3. For Bronze, Silver, Gold Layers, few options are
(i) Using a same workspace (With different notebooks and saving tables as delta tables, and then running notebook at intervals to do incremental loads)
(ii) Having a different workspace for bronze layer and separate for silver and gold layer.
(iii) For Bronze using "Data Factory", Silver using "Data Warehouse", Gold layer using "Data Engineering".
Just gone through different resources, having few doubts on this.
My data source is SQL Server.
My approach is,
1. Creating a workspace (Need to check if I need to have multiple for multiple layers)
2. Use "Data Factory Persona" (using Dataflow Gen2) to load data into either Warehouse/Lakehouse.
I would like to get a query get resolved on:
1. How to figure out number of Workspace needed.
2. No. of lakehouses needed.
3. Requirement of Warehouse (Maybe for analytical loads), etc..
Any assistance on this would be highly helpful, since Fabric is fresh perspective. (Any references would also be helpful)
Thanks in Advance
Solved! Go to Solution.
Hi @Ashwath_Bala_S ,
Thanks for using Fabric Community. Following an approach is completely depends on your use case. I would like to provide a suggetion based on my understanding.
Number of Workspaces:
Number of Lakehouses:
Warehouse vs. Lakehouse:
Recommendations:
Additional Docs to refer -
Medallion architecture in Microsoft Fabric | by Valentin Loghin | Feb, 2024 | Medium
MS Fabric - The Medallion Architecture
Hope this is helpful. Please let me know incase of further queries.
Hi @Anonymous ,
Yeah it helped me.
Thank You!
Hi @Ashwath_Bala_S ,
Thanks for using Fabric Community. Following an approach is completely depends on your use case. I would like to provide a suggetion based on my understanding.
Number of Workspaces:
Number of Lakehouses:
Warehouse vs. Lakehouse:
Recommendations:
Additional Docs to refer -
Medallion architecture in Microsoft Fabric | by Valentin Loghin | Feb, 2024 | Medium
MS Fabric - The Medallion Architecture
Hope this is helpful. Please let me know incase of further queries.
Hello @Ashwath_Bala_S ,
We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet .
Otherwise, will respond back with the more details and we will try to help .
Hi @Anonymous ,
My data is in SQL Server
One of the approach I have is,
1. Loading in Lakehouse,
2. Creating notebooks to have diferent layers and automate the same using a Pipeline. (In this method the inital tables too are imported into OneLake)
Is there any other way of not loading the tables into OneLake and only having the different layer tables.
Any guidance on this will be highly helpful.
Thanks in Advance!
Hi @Ashwath_Bala_S ,
Inorder to avoid initial load, you can actually use these 2 ways:
1. Using Notebooks - You can directly connect to SQL Server using spark and read the table values, perform transformations and then create a table in lakehouse.
Link for reference -
Apache Spark connector for SQL Server - Spark connector for SQL Server | Microsoft Learn
2. Using Data Factory Pipeline - You can connect to SQL Server using SQL Server Connection in Pipeline and then perform basic transformations using Pipeline Activities and Data Flow Gen2.
After transformations you can load data directly into lakehouse tables.
Above methods will eliminates the initial load into One Lake and avoids data duplication.
Hope this gives you some insights. Please let me know incase of further queries.
Hi @Anonymous ,
Yeah it helps!
Thank You!
Glad to know that you got some insights. Please continue using Fabric Community for your further queries.
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!
Your insights matter. That’s why we created a quick survey to learn about your experience finding answers to technical questions.
Arun Ulag shares exciting details about the Microsoft Fabric Conference 2025, which will be held in Las Vegas, NV.
User | Count |
---|---|
8 | |
4 | |
2 | |
1 | |
1 |
User | Count |
---|---|
8 | |
6 | |
5 | |
4 | |
4 |