Learn from the best! Meet the four finalists headed to the FINALS of the Power BI Dataviz World Championships! Register now
Hi,
I am starting to create a model. I have 2 approaches:
1. If I use the query route which has all the joins then I get about half a million rows (fact table and couple of dimension table).
2. If I go through the STAR SCHEMA route then there are about 9 million rows to begin with in fact table. I will join other dimension tables to get desired result but my question is : Where does the processing happens, if I go through STAR SCHEMA route, won't it take lot of time for queries to run?
Thanks,
Ritesh
Solved! Go to Solution.
The answer is as always - It depends.
Flat source: liked by the Vertipaq engine because it can be stored with nice compression. Creates processing cost at the source
Dimensions and facts: not very compressible depending on level of normalization. Should generally (not in your case) result in fewer bytes being transferred over the network. Can directly be converted to an in-memory data model and should be both fast and not consume as much memory.
You'll have to try it out and find the sweet spot between total denormalization and excessive normalization, taking into account the processing power at the source, the available memory on the desktop and in the service, as well as network performance.
And this all before you even start considering the ETL cost of Power Query and the measures cost in DAX 🙂 ...
A new Power BI DataViz World Championship is coming this June! Don't miss out on submitting your entry.
Share feedback directly with Fabric product managers, participate in targeted research studies and influence the Fabric roadmap.
| User | Count |
|---|---|
| 50 | |
| 34 | |
| 32 | |
| 18 | |
| 17 |
| User | Count |
|---|---|
| 64 | |
| 64 | |
| 41 | |
| 27 | |
| 24 |