Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.

Reply
Anonymous
Not applicable

Find Cumulative sum if a value is matched in colmn

Hi All,

I am working on pateint data to plot new courses. I imported PatientID and DateDiffs column using sql.

Then created custom column called courses. 

As I do not know how to calculate cumulative sum for group if value is matched in column I used following python function in power query to calculate New_Course column - dataset['New_Course']=(dataset.Courses.eq('New')).cumsum()

Is there any power query function for this python code?

 

Thanks in advance

 

PranjaliK_0-1654195876710.png

 

1 ACCEPTED SOLUTION
Vijay_A_Verma
Super User
Super User

See the working here - Open a blank query - Home - Advanced Editor - Remove everything from there and paste the below code to test

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45WMjKxUNJRMtQzBJJ+qeVKsTowMSNjDCEDIHatyCwuycxLhwpbYtENFsOhdFR4VJiewmSkylgA", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [PatientID = _t, DateDiffs = _t, Courses = _t]),
    ListOfCourses = List.Buffer(Source[Courses]),
    ListCount = List.Count(ListOfCourses),
    GenListOfRunningTotal = List.Generate(()=>[x=1,i=1], each [i]<=ListCount, each [i=[i]+1, x=if ListOfCourses{[i]}="New" then [x]+1 else [x]], each [x]),
    Result = Table.FromColumns(Table.ToColumns(Source) & {GenListOfRunningTotal},Table.ColumnNames(Source)&{"New_course"})
in
    Result

 

View solution in original post

6 REPLIES 6
Vijay_A_Verma
Super User
Super User

See the working here - Open a blank query - Home - Advanced Editor - Remove everything from there and paste the below code to test

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45WMjKxUNJRMtQzBJJ+qeVKsTowMSNjDCEDIHatyCwuycxLhwpbYtENFsOhdFR4VJiewmSkylgA", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [PatientID = _t, DateDiffs = _t, Courses = _t]),
    ListOfCourses = List.Buffer(Source[Courses]),
    ListCount = List.Count(ListOfCourses),
    GenListOfRunningTotal = List.Generate(()=>[x=1,i=1], each [i]<=ListCount, each [i=[i]+1, x=if ListOfCourses{[i]}="New" then [x]+1 else [x]], each [x]),
    Result = Table.FromColumns(Table.ToColumns(Source) & {GenListOfRunningTotal},Table.ColumnNames(Source)&{"New_course"})
in
    Result

 

Anonymous
Not applicable

@Vijay_A_Verma 

can you please explain the logic in the step of GenListOfRunningTotal?

Anonymous
Not applicable

@Vijay_A_Verma this works perfectly and does not even cause delays.

Thank You!!!

edhans
Super User
Super User

You need to do this in DAX. Power Query is horrible at running totals. For a few hundred records it works, and for maybe 1-2 thousand, it will work. But if you have over 10K records it will take forever, and over 100K it won't finish.


You can look at this article for running totals in DAX, or post data we can use and we can help.

 

How to get good help fast. Help us help you.

How To Ask A Technical Question If you Really Want An Answer

How to Get Your Question Answered Quickly - Give us a good and concise explanation
How to provide sample data in the Power BI Forum - Provide data in a table format per the link, or share an Excel/CSV file via OneDrive, Dropbox, etc.. Provide expected output using a screenshot of Excel or other image. Do not provide a screenshot of the source data. I cannot paste an image into Power BI tables.



Did I answer your question? Mark my post as a solution!
Did my answers help arrive at a solution? Give it a kudos by clicking the Thumbs Up!

DAX is for Analysis. Power Query is for Data Modeling


Proud to be a Super User!

MCSA: BI Reporting

@edhans If you write the same calculation logic as in DAX, then M is definitely worse. However, with the recursive functions List.Accumulate and List.Generate, it's possible to write running totals in O(N) rather than O(N^2) since M can handle recursion whereas DAX cannot.

Helpful resources

Announcements
LearnSurvey

Fabric certifications survey

Certification feedback opportunity for the community.

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors