The ultimate Fabric, Power BI, SQL, and AI community-led learning event. Save €200 with code FABCOMM.
Get registeredEnhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.
Hello Team / Community,
If a large organization is migrating towards Azure Fabric and establishing a data mesh / domain-oriented architecture structured as a hub-and-spoke model. We currently possess a significant amount of data in various existing Azure Storage Accounts (ADLS Gen2), often organized by source system or business domain.
A core requirement is enabling domain owners to integrate relevant data from their existing source ADLS into their respective Fabric domains. We intend to use Fabric shortcuts to link directly to specific paths within the source ADLS, facilitating a controlled and managed way to access this external data.
Furthermore, within each domain, we may plan to adopt the Medallion Architecture concept (Bronze, Silver, Gold layers) to structure data processing and refinement.
We are now designing how to map these concepts – the external ADLS sources, the Fabric Domains, Workspaces, and Lakehouses – together to realize this architecture. We are seeking architectural patterns and practical guidance on this mapping and the resulting data flow.
Specifically, we'd deeply appreciate insights on:
We are looking for practical, pattern-based guidance on how to align our existing ADLS sources, the Fabric hub-and-spoke domain structure, and the internal Medallion layering using Workspaces and Lakehouses effectively and scalably.
Any shared experiences, architectural diagrams, or specific patterns implemented in similar scenarios would be extremely valuable.
Thank you for your time and expertise!
Solved! Go to Solution.
Hi @TwoCents ,
Thank you for reaching out to Microsoft Fabric Community.
When you are building a Fabric hub and spoke model with domains (like Finance, Sales, etc.), and you have a lot of existing data sitting in Azure Data Lake (ADLS Gen2), the best way to bring that data into Fabric is by using shortcuts.
Each domain should create shortcuts inside their own Lakehouse pointing directly to just the folders they need from ADLS. This becomes the Bronze layer (raw data).
For organizing your Medallion Architecture (Bronze, Silver, Gold), the simplest and cleanest way is to use one Lakehouse per domain and split it into folders like /Files/Bronze, /Tables/Silver, /Tables/Gold.
Only if your data grows massive (like many terabytes) you can consider separating Lakehouses for each layer, but that's rare early on.
Each domain usually needs just one workspace at the start. You don't need to split workspaces unless you grow really big.
Inside the domain workspace, you handle:
Data movement happens using Fabric Pipelines and Spark Notebooks. Pipelines help automate the steps and Notebooks are great if you have complex logic (like heavy data transformations or machine learning).
If this post helps, then please consider Accepting as solution to help the other members find it more quickly, don't forget to give a "Kudos" – I’d truly appreciate it!
Thank you!!
Hi @TwoCents ,
I hope this information provided is helpful. Feel free to reach out if you have any further questions or would like to discuss this in more detail. If responses provided answers your question, please accept it as a solution so other community members with similar problems can find a solution faster.
Thank you!!
Hi @TwoCents ,
I wanted to check if you had the opportunity to review the information provided. Please feel free to contact us if you have any further questions. If the responses has addressed your query, please accept it as a solution and give a 'Kudos' so other members can easily find it.
Thank you!!
Hi @TwoCents ,
May I ask if you have resolved this issue? If so, please mark the helpful reply and accept it as the solution. This will be helpful for other community members who have similar problems to solve it faster.
Thank you.
Hi @TwoCents ,
Thank you for reaching out to Microsoft Fabric Community.
When you are building a Fabric hub and spoke model with domains (like Finance, Sales, etc.), and you have a lot of existing data sitting in Azure Data Lake (ADLS Gen2), the best way to bring that data into Fabric is by using shortcuts.
Each domain should create shortcuts inside their own Lakehouse pointing directly to just the folders they need from ADLS. This becomes the Bronze layer (raw data).
For organizing your Medallion Architecture (Bronze, Silver, Gold), the simplest and cleanest way is to use one Lakehouse per domain and split it into folders like /Files/Bronze, /Tables/Silver, /Tables/Gold.
Only if your data grows massive (like many terabytes) you can consider separating Lakehouses for each layer, but that's rare early on.
Each domain usually needs just one workspace at the start. You don't need to split workspaces unless you grow really big.
Inside the domain workspace, you handle:
Data movement happens using Fabric Pipelines and Spark Notebooks. Pipelines help automate the steps and Notebooks are great if you have complex logic (like heavy data transformations or machine learning).
If this post helps, then please consider Accepting as solution to help the other members find it more quickly, don't forget to give a "Kudos" – I’d truly appreciate it!
Thank you!!
User | Count |
---|---|
14 | |
9 | |
5 | |
3 | |
2 |
User | Count |
---|---|
45 | |
22 | |
17 | |
13 | |
12 |