The ultimate Microsoft Fabric, Power BI, Azure AI, and SQL learning event: Join us in Stockholm, September 24-27, 2024.
Save €200 with code MSCUST on top of early bird pricing!
Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and more. Get started
Hi all.
Recently I found out that my dataset from SQL server lacks some information. My dataset is about machine units and their daily performance. However, if a unit did not work some particular day (maintenance or whatever), the entry with corresponding date is not stored. For example, a CAT unit didn't work on Jan 4th and I would like it to be stored as zero. Here is the example of my database and a modified one.
What is the best way to make this modifications?
The initial problem is that the average monthly performance is calculated wrong (it divides total performance by only table-existing days in january, not all 31 days). Make monthly performance in a separate table could make sense, but then I guess it will lack of time intelligence, drill downs, and there are more indicators, not only performance. But if you think it is a better idea, let me know.
I really appreciate any help.
Hi @karimkz.
In your scenario, you can follow below steps to achieve your requirement:
1. Duplicate this query connect to the SQL Server database, filter the original query only keep cat data, rename it as cat table. Also filter the duplicated table to only keep hitachi data, rename it as hitachi table.
2. Create a calendar table.
3. Merge the calendar table with cat table and hitachi table separately. For the unit column, use Fill down or up, for the value column, use Replaces values as 0.
4. Append the cat table with hitachi table.
For more information, please check attached .pbix file.
Best Regards,
Qiuyun Yu
Thank you for your answer!
I see your point, I believe it will work out. Gotta learn how to do all that things though.
Also I think of creating a date table, then do something like:
CALCULATE(SUM(performance); FILTER(date; date.month=EARLIEST(month))).
Do you think it will work?
Hi
we also have the same situation , but our tables are very huge so cant make it duplicate , is ithere any way for this issue , did you got the solution
Creating a calendar table and merging it with the existing table is still a best option.
You need to dimensionalise your data. I would create a separte date table and a seperate unit table (with the distinct list of unit types). You can then use a calculations like this:
CALCULATE ( AVERAGEX ( CROSSJOIN ( VALUES('Unit'[Unit]), VALUES('Date'[Date]) ) ,SUM([Performance]) ) )
CROSSJOIN will give you the product of all dates and all units that are currently being filtered. Then AVERAGEX will iterate over this calculating the SUM and then finally calculating the average.
Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.
Check out the August 2024 Power BI update to learn about new features.
User | Count |
---|---|
107 | |
77 | |
71 | |
47 | |
39 |
User | Count |
---|---|
136 | |
108 | |
70 | |
64 | |
57 |