Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Be one of the first to start using Fabric Databases. View on-demand sessions with database experts and the Microsoft product team to learn just how easy it is to get started. Watch now

Reply
nickc_innova
Advocate I
Advocate I

SUMX returning blanks (BonusPoints: avoiding circular logic)

I am performing an analysis on referral patterns. 
I have a single table of data named SDP_OUTPUT that contains locations named [PSA_NAME] for which there are 13 distinct values. 
The table contains a column of IDs named [STOP_CODE]. For each [PSA_NAME] + [STOP_CODE] the row in the table is unique. 
Other fields are [REFERRAL_LOCATION], [REFERRAL_PERC] and [STOP_TOTAL_DEMAND].
The Referral Location represents where the demand for the STOP code is being sent. Each PSA+STOP code only sends its demand to one location, but a location can receive demand from multiple locations. The Referral_Perc is the percentage being referred on to the referral location (between 0% and 100%). And finally, the STOP_total_demand is the demand generated at the PSA location before any referral. 

My objective is to calculate all of the demand being referred out, referred in, and the final adjusted demand after referrals. 
I have to be careful to stage out the calculation or end up in a mathematical loop. 
I first created a measure to calculate all of the workload being referred out, but set a condition for the calculation to first check that for the given PSA+STOP code, no workload is being referred in. 

REFERRED_OUT =
VAR CurrentPSAName = MAX('SDP_OUTPUT'[PSA_NAME])
VAR CurrentStopCode = MAX('SDP_OUTPUT'[STOP_CODE])
VAR CurrentStopDemand = MAX('SDP_OUTPUT'[STOP_TOTAL_DEMAND])
VAR CurrentRefPerc = MAX('SDP_OUTPUT'[REFERRAL_PERC])
VAR ReferredInRowCount =
    CALCULATE(
        COUNTROWS('SDP_OUTPUT'),
        'SDP_OUTPUT'[REFERRAL_LOCATION] = CurrentPSAName &&
        'SDP_OUTPUT'[PSA_NAME] <> CurrentPSAName &&
        'SDP_OUTPUT'[STOP_CODE] = CurrentStopCode
    )
RETURN
    IF(
        ReferredInRowCount > 0,
        BLANK(),
        CurrentStopDemand * CurrentRefPerc
    )


Next, I created a 2nd measure named Adjusted Direct Demand. 

ADJUSTED_DIRECT_DEMAND =
SUM('SDP_OUTPUT'[STOP_TOTAL_DEMAND]) - [REFERRED_OUT]

Now I am trying to create a 3rd measure named Referred In, but this is where I am running into trouble. 
REFERRED_IN =
VAR CurrentPSAName = MAX('SDP_OUTPUT'[PSA_NAME])
VAR CurrentStopCode = MAX('SDP_OUTPUT'[STOP_CODE])
VAR ReferredInTable =
    FILTER (
        'SDP_OUTPUT',
        'SDP_OUTPUT'[STOP_CODE] = CurrentStopCode &&
        'SDP_OUTPUT'[REFERRAL_LOCATION] = CurrentPSAName
    )
RETURN
    SUMX(ReferredInTable, [REFERRED_OUT])

This measure is returning blank results. 

nickc_innova_0-1685999935009.png

PSA_NAMESTOP_CODEREFERRAL_LOCATIONREFERRAL_PERCSTOP_TOTAL_DEMANDREFERRED_OUTADJUSTED_DIRECT_DEMANDREFERRED_IN
Antelope Valley1002Sepulveda1118575.7118575.70 
Antelope Valley104Sepulveda1233.7255233.72550 
Bakersfield1002Sepulveda1133481.7133481.70 
Bakersfield104Sepulveda1190.7225190.72250 
East Los Angeles1002Los Angeles134798.7634798.760 
East Los Angeles104West Los Angeles173.4766473.476640 
Lake Isabella1002Sepulveda15988.8465988.8460 
Lake Isabella104Sepulveda18.5570338.5570330 
Los Angeles1002West Los Angeles0.514961474807.0274807.02 
Los Angeles104West Los Angeles1315.906315.9060 
Ridgecrest1002Sepulveda120467.520467.50 
Ridgecrest104Sepulveda129.2445429.244540 
San Gabriel1002Sepulveda1190845.9190845.90 
San Gabriel104Sepulveda1402.966402.9660 
San Luis Obispo1002Sepulveda151814.8351814.830 
San Luis Obispo104Sepulveda170.7902670.790260 
Santa Maria1002Sepulveda184130.0684130.060 
Santa Maria104Sepulveda1129.6575129.65750 
Sepulveda1002Sepulveda0232407.90232407.9 
Sepulveda104West Los Angeles0.542480.2137260.2758219.9379 
South Los Angeles1002West Los Angeles1130810.3130810.30 
South Los Angeles104West Los Angeles1276.2024276.20240 
Ventura1002Sepulveda1142020.2142020.20 
Ventura104Sepulveda1240.2623240.26230 
West Los Angeles1002West Los Angeles073635.01073635.01 
West Los Angeles104West Los Angeles0155.47830155.4783 


For [PSA_NAME] Los Angeles, I expect [REFERRED_IN] to equal 347,983.76.
For [PSA_NAME] Sepulveda, I expected [REFERRED_IN] to equal 747,324.80.
For [PSA_NAME] West Los Angeles, I expected [REFERRED_IN] to equal 205,617.33.
For all other PSAs I expect [REFERRED_IN] to return zero because no Referral Location matches that PSA. 

I need to move on from this calculation to create a 4th measure named 2ndADJUSTED_DIRECT_DEMAND by adding the  REFERRED_IN_DEMAND to the ADJUSTED_DIRECT_DEMAND. 
I say 2nd here because there could be a 3rd or 4th adjustment needed, each time check to see if the demand continues to be referred. For example, in the data I provided, East Los Angels sends 100% of its workload on to Los Angeles, but then Los Angeles sends 50% its workload on to West Los Angeles. On the first calculation, what ends up at West Los Angeles is just 50% of Los Angles, but it should really be (the total demand of Los Angles + the 100% of the total demand from East Los Angles) * 50%. 

If anyone can think of a way to do that more efficiently, without multiple measures and without running into circular logic, you win the grand prize. 😄

For now though, I'll be happy to know why my REFERRED_IN measures isn't returning the expected result. 

Thank you!

4 REPLIES 4
lbendlin
Super User
Super User

 

a way to do that more efficiently, without multiple measures and without running into circular logic

 

- not really

- no

- heck no.

 

This is a closed loop control system that will require multiple runs to calm down and level in.  There is no chance at all that a deterministic single loop system like Power BI (which doesn't even have the concept of global variables) can handle this very complex process.

 

Can you add a timing component? Datetime stamps on when the referrals happened? So you can build a sequence?

 

I guess you could build a solution out of hydraulics, or with mycelium.

@lbendlin, I made some progress yesterday. 

I didn't know DAX had functions to measure parent/child relationships. I can leverage these to detect exactly how many PSAs away a given PSA may be on the chain of referral. PSAs at the beginning of the chain with no children will have the largest path length. PSAs at the end of the chain with the most children and/or are only referring to themselves have the shortest path length. 
So the larger the path length, the earlier its demand should be calculated. The PSA(s) with the smallest path length are calculated last. 

Next, I need to write a looping statement... which I guess isn't possible in DAX but thankfully @Greg_Deckler provided us with a write-up on how to work around that. I don't have the code written out yet but I think I have a map for how to get there. 

PATH only works if each element appears in only one place of the hierarchy.  From your original description it sounded more like you have circular references between PSAs?

 

The FOR and WHILE constructs you reference will only work for independent computations in each of the loops. As soon as one loop iteration depends on the result of another iteration this will break down.  There is a Power Query function List.Accumulate that is a bit more powerful but even that cannot mask the fact that Power BI does not have a concept of global variables.

 

I think your best chance is still when you provide additional sequencing information (ideally timestamps, or index columns).

 

I don't believe it is as hopeless a situation as you describe. 

I was able to make some progress - though I admit I still don't have the full solution. 
After calculating the demand referred out, and calculating the demand remaining at the PSA, I have a measure for the demand referred in for each PSA as follows:

REFERRED_IN =
VAR __currentPSA = MAX(SDP_OUTPUT[PSA_NAME])
VAR __currentSTOP = MAX(SDP_OUTPUT[STOP_CODE])
RETURN
CALCULATE(
SUMX(SDP_OUTPUT, [REFERRED_OUT]),
REMOVEFILTERS(SDP_OUTPUT), SDP_OUTPUT[REFERRAL_LOCATION] = __currentPSA && SDP_OUTPUT[STOP_CODE] = __currentSTOP
)


This resolves the first issue I originally posted about. I then took it a step further, by simply writing another measure that adds the Referred In demand and the "Adjusted Demand" - which probably isn't the best name for the field but it represents the demand originated at the PSA minus the portion that was referred out. 

I think of this as the first cycle of referral. For any PSA that receives no Referred In demand, this is the end of the calculation but the problem is that some PSAs continue to refer their new adjusted demand onto another location. And the important part to remember is that it may not always be sending on 100% of it's demand. 
To calculate this, I wrote a measure that subtracts the demand that the PSA already referred out from its Adjusted Demand, and then multiplie by the referral percentage. I subtract the referred out demand from the first cycle first so that I don't double dip on the referral percentage. Also, some PSAs will have already referred out 100% of their demand, so the subtraction will result in a negative number. I handle that with a conditional statement that just returns zero. 
The measure looks like this:

2nd Reff =
VAR __remainder = ([1st_ADJUSTED_DEMAND] - [REFERRED_OUT]) * SUM(SDP_OUTPUT[REFERRAL_PERC])
VAR __solution = IF(__remainder > 0, __remainder, 0)
RETURN
__solution

This is the new demand that each PSA has to send on to another PSA for the second cycle. 

To get to the demand at the end of the second cycle of referral, I have a measure like this:
2nd_ADJUSTED_DEMAND =
VAR __currentPSA = MAX(SDP_OUTPUT[PSA_NAME])
VAR __currentSTOP = MAX(SDP_OUTPUT[STOP_CODE])
VAR __currentRefL = MAX(SDP_OUTPUT[REFERRAL_LOCATION])
VAR __solution =
CALCULATE(
SUMX(SDP_OUTPUT, ([2nd Reff])),
REMOVEFILTERS(SDP_OUTPUT), SDP_OUTPUT[REFERRAL_LOCATION] = __currentPSA && SDP_OUTPUT[STOP_CODE] = __currentSTOP
)
RETURN
IF(__currentPSA = __currentRefL,
[1st_ADJUSTED_DEMAND] + __solution,
[1st_ADJUSTED_DEMAND] - [2nd Reff])

I get the sum of all of the demand I calculated for the second cycle, and add it to the demand calculated in at the end of the first cycle, but only for each PSA where it is programmed not to refer demand on to any location other than its-self (this is the default setting for PSA referral locations if they are not referring demand on). If the PSA dose have a referral location, then I subtract this second cycle of referred out demand from the first cycle of demand. 

This solution with a series of measures works. However, it is predicated on the fact that at most there are 2 cycles of referral. Either the demand goes from A to B, or from A to B to C. 
I am certain there has to be a way to combine all of these measures into a single calculation - someone better than me in writing DAX could figure out... Then, if I could just have the measure understand the referral pattern first... If I could calculate the number of referrals/relationships... that would be the final solution.


Helpful resources

Announcements
Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!

Dec Fabric Community Survey

We want your feedback!

Your insights matter. That’s why we created a quick survey to learn about your experience finding answers to technical questions.

ArunFabCon

Microsoft Fabric Community Conference 2025

Arun Ulag shares exciting details about the Microsoft Fabric Conference 2025, which will be held in Las Vegas, NV.

December 2024

A Year in Review - December 2024

Find out what content was popular in the Fabric community during 2024.