Solved: Re: Standard Deviation from 4 measured columns.......

DEMDEJ · ‎03-01-2023

Hi all!

I've researched so many sites (and previous answers) concerning computing standard deviation in PowerBI. I never knew it would be so difficult for me to do since Excel makes it SIMPLE!

I have a matrix table that I want to resemble an excel table. The data shown are all aggregate values compiled from individual records (with some duplicates). All columns are measures.

It is important for the pivot to remain and that I only calculate the current month for each year. For an example, I want to calculate March for the current year and March for the previous 5 years (2022 - 2017). While the STDEVX.P function is fairly simple, I need it to only reference my previous 5 year columns (2017-2021). When I use the function, it references EVERYTHING since I don't have a DAX formula to add in the caveats.

If my vision is possible, please help me out or let me know of alternatives. I will give kudos and I don't mind reading recommendations!

PiEye · ‎03-02-2023

Hi Demdej

I have just realised that you wish to measure the standard deviation at a yearly granularity, not at the granularity of the data within. So over 5 data points

Ie the calculation would look like this if we were to draw it out:

SQRT( ( (2016prod-avg)²+(2017prod-avg)²+....+(2020prod-avg)²)/5 )

What we need for this are the "aggregateX" functions. These are probably easier to understand if you research "sumX" and "averageX" as they are better documented & used.

Essentially, these functions will allow you to iterate across rows that you define:

eg

AverageX( Values([customerID]), sum(Sales)) will group by customer, calculate the sales for each then work out the average of all of those results. Giving you the average sales per customer.

In your example we are going to use

Year STDEV = STDEVX.P( VALUES(APY[Crop_Year]), [x years production])

Where

x years production = CALCULATE(sum(APY[Production]),FILTER(APY,APY[Crop_Year]>= 2016))

Plugging this into the table gives the result as manually calculated in excel.

Result in PBI:

calculation in Excel:

Pi

View solution in original post

PiEye · ‎03-02-2023

Hi Demdej, it seems like you need the CALCULATE() function.

CALCULATE gives you the flexibility to apply a filter to any measure and change the context in which it is executed.

For example. I have a data set with crops, and production amounts.

To work out the total production for all years >2016:

The first parameter is the measure we want to use, the second is the filter. Very simple:

x years production Total = CALCULATE(sum(APY[Production]),APY[Crop_Year]>= 2016)

So whereever we use this measure, it will ignore the year and give us total prod after 2016:

You can see here it is just adding up the sum of production from years 2016+ and putting it in every cell, regardless of the year that the row is for!

Calculate also offers many options for types of filters - see reference here: https://learn.microsoft.com/en-us/dax/filter-functions-dax

If we want to only include the years 2016+ but also filter for the individual year, we can create this expression:

x years production = CALCULATE(sum(APY[Production]),FILTER(APY,APY[Crop_Year]>= 2016))

So it depends on exactly what you will be doing with Standard deviation, what filters you want to impact it etc, but at the simplest level some variation of this expression is what you need:

x years stdev Total = CALCULATE(STDEV.P(APY[Production]),APY[Crop_Year]>= 2016)

Looking at it in a table with years:

Let's perform a quick check by filtering the year and make sure it is correct:

The final table

Now we can apply this to the final table. Unfortunately, we cannot combine a pivot and a straight table in one, so I have used a straight table and created a measure per each year to get round this issue:

HTH,

Pi

DEMDEJ · ‎03-02-2023

@PiEye

Thanks so much for the thorough explanation and examples. They are always so helpful.

The fact that we can't use a pivot table to illustrate the st.dev is difficult because, like you, I created a measure for each year, but I still could not get an accurate SD.
For some reason, it's the same for you:

I just double check my data in Excel, because I'm far better at Excel than PowerBI. Since the column thing is such an issue, I decided to put my dates all in one column, but I STILL can't get an accurate count.

I'm truly stomped. 😞

PiEye · ‎03-02-2023

Hi Demdej

I have just realised that you wish to measure the standard deviation at a yearly granularity, not at the granularity of the data within. So over 5 data points

Ie the calculation would look like this if we were to draw it out:

SQRT( ( (2016prod-avg)²+(2017prod-avg)²+....+(2020prod-avg)²)/5 )

What we need for this are the "aggregateX" functions. These are probably easier to understand if you research "sumX" and "averageX" as they are better documented & used.

Essentially, these functions will allow you to iterate across rows that you define:

eg

AverageX( Values([customerID]), sum(Sales)) will group by customer, calculate the sales for each then work out the average of all of those results. Giving you the average sales per customer.

In your example we are going to use

Year STDEV = STDEVX.P( VALUES(APY[Crop_Year]), [x years production])

Where

x years production = CALCULATE(sum(APY[Production]),FILTER(APY,APY[Crop_Year]>= 2016))

Plugging this into the table gives the result as manually calculated in excel.

Result in PBI:

calculation in Excel:

Pi

DEMDEJ · ‎03-07-2023

@PiEye Thank you so much for your help!

PiEye · ‎03-07-2023

You're welcome - Glad it worked!

Standard Deviation from 4 measured columns.... DARN!

Helpful resources

Join our Fabric User Panel

Power BI Monthly Update - June 2025

Fabric Community Update - June 2025

Join the #PBI10 DataViz contest

Standard Deviation from 4 measured columns.... DARN!

Helpful resources

Join our Fabric User Panel

Power BI Monthly Update - June 2025

Fabric Community Update - June 2025