The ultimate Fabric, Power BI, SQL, and AI community-led learning event. Save €200 with code FABCOMM.
Get registeredEnhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.
Hi all,
I’m looking for advice on cost-effective architectural patterns in Microsoft Fabric, specifically around Capacity Unit usage.
I’m standing up Fabric at a global post-production company (~2000 staff, multiple business units), but with limited resources, low data maturity, and a tight budget. We’re working to consolidate data from several systems, including:
AWS RDS for PostgreSQL (via gateway)
D365 F&O (Finance & HR)
UKG Pro (Payroll)
Dataverse (CRM Extensions via Model Driven App)
Given these sources, my goal is to build a solid medallion architecture, but I need help understanding which approach minimizes CU consumption while still enabling:
Frequent incremental refresh (hourly for some sources)
Archiving strategies (granular vs. summarized)
Support for ~60 tables, some with 10M+ rows
Lakehouse vs. Warehouse for transformed and semantic data
When to use Dataflows Gen2 vs. Pipelines vs. Notebooks
Where to store analysis-ready data for lowest ongoing CU impact
Any tips, design patterns, or lessons from similar orgs would be hugely appreciated—especially if you’ve prioritized cost control in your Fabric rollout.
Cheers!
Solved! Go to Solution.
Hi @Treichs,
Thank you for reaching out to Microsoft Fabric Community.
Thank you for sharing the detailed scenario. Based on your requirement, here is the most cost-efficient and scalable way to build the Fabric architecture:
This setup will give you a solid medallion architecture with controlled CU usage, support for large datasets, frequent refresh and future scalability.
If this post helps, then please consider Accepting as solution to help the other members find it more quickly, don't forget to give a "Kudos" – I’d truly appreciate it!
Thanks and regards,
Anjan Kumar Chippa
Hi @Treichs,
Thank you for reaching out to Microsoft Fabric Community.
Thank you for sharing the detailed scenario. Based on your requirement, here is the most cost-efficient and scalable way to build the Fabric architecture:
This setup will give you a solid medallion architecture with controlled CU usage, support for large datasets, frequent refresh and future scalability.
If this post helps, then please consider Accepting as solution to help the other members find it more quickly, don't forget to give a "Kudos" – I’d truly appreciate it!
Thanks and regards,
Anjan Kumar Chippa
Hi @v-achippa
Thanks so much for the response – much appreciated!
Quick follow-up question regarding two areas:
Archiving Raw Data in Bronze:
related to purging raw data older than a few months, what would be your recomendation of tool to use (i.e. pipeline/notebook/etc)
Custom Views for Excel Users:
Currently, I'm using Dataflows Gen1 within Power BI Pro workspaces (until FABRIC is capacity is further developed) to create curated views that users can connect to directly from Excel via Power Query. This works fine, but I’d love your thoughts on whether this is the best approach or if there’s a more optimal pattern for supporting ad hoc Excel access to curated datasets without incurring high compute costs.
Cheers,
Hi @Treichs ,
You’ve provided a great overview of your scenario and the challenges around cost-efficient architecture in Microsoft Fabric, especially with Capacity Unit (CU) usage. Here are some recommendations and considerations based on your requirements:
1. Medallion Architecture and CU Optimization
2. Incremental Refresh & Archiving
3. Lakehouse vs Warehouse
4. Dataflows Gen2 vs Pipelines vs Notebooks
5. Storing Analysis-Ready Data
6. General Cost Control Tips
Summary Table
Layer Purpose Storage Type CU Impact
Bronze | Raw data storage | Lakehouse | Low (storage only) |
Silver | Cleaned/transformed data | Lakehouse | Moderate |
Gold | Curated/analysis-ready | Lakehouse/Warehouse | Moderate/High (on demand) |
Warehouse | Reporting/semantic layer | Warehouse | Higher (per query) |
Let me know if you’d like more specific design patterns, sample architectures, or if you have more details about your data volumes and refresh needs. Happy to help further!
Hope this helps with your Fabric rollout!
User | Count |
---|---|
20 | |
17 | |
6 | |
2 | |
2 |
User | Count |
---|---|
51 | |
49 | |
17 | |
6 | |
4 |