Join us at FabCon Atlanta from March 16 - 20, 2026, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.
Register now!The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now! Learn more
Hi all,
I need some help understanding if I have the right idea about why I was having problems with really slow power query experience.
It's kind of a long story but in a nutshell, I was reading in a 250MB CSV file of invoices from a legacy system and aggregating the sales by month and SKU, then generating a list of unique month-product combinations and filling in all the missing combinations so that I could find the true average and standard deviations (0 sales months obviously not being in the invoice report). When working on the data everything slowed and I saw that power query was reading in over 1.7GB of data - which over a VPN was painfully slow.
I have fixed my problem and have a working theory what the engine was doing behind the scenes, but I need someone a bit more knowlegable about what the engine is doing behind the scenes to correct my thinking if I have the wrong end of the stick about what was happening. I did a walk through of a simplfied version in order to check my thinking which looked like this:
The approach in a nutshell as you can see from the Queries tab show:
My thought is that by using references and merges, I was making multiple queries instead of just making multiple tables and lists in one query where I could user Table.Buffer() to stop repeated reads of the csv. So I went back and did the entire thing in a single block of code, creating the tables and lists all in one query. This fixed my problem, but it wouldn't have been possible to write the query from the GUI as far as I can tell as it involved a cross join and a left join to tables which only appear within the query.
In case this is unclear, I've written full details of what I think is happening with sample code here:
https://wordpress.com/post/supplychaindatascientistcom.wordpress.com/195
While my problem is fixed, I'd like to get a better idea what was happening inside power query here in terms of amount of data read in when using one query of multiple tables vs. multiple queries of single tables.
Thanks for any help.
Nick
Solved! Go to Solution.
This article and video explain it well.
Regards,
Pat
To learn more about Power BI, follow me on Twitter or subscribe on YouTube.
This article and video explain it well.
Regards,
Pat
To learn more about Power BI, follow me on Twitter or subscribe on YouTube.
Thanks. It will take me some time to go through that, but it looks very promising.
Greatly appreciate you taking the time.
Check out the November 2025 Power BI update to learn about new features.
Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!
| User | Count |
|---|---|
| 57 | |
| 44 | |
| 42 | |
| 21 | |
| 17 |
| User | Count |
|---|---|
| 183 | |
| 114 | |
| 93 | |
| 62 | |
| 45 |