Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Join us at FabCon Atlanta from March 16 - 20, 2026, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM. Register now.

Reply
Anonymous
Not applicable

Does having too many columns in the query editor slow down a report / dataset or have other impacts?

I am having a debate with our sysadmin.   

 

He likes to put every single column from our systems (ERP, CRM, etc) into the dataflows that we use to create our datasets.    He feels that I can delete them in the query editor when creating a report and then when the report gets published the response time and refresh times won't be affected because the columns are not in the actual report.  

 

My feeling is that the report will be affected from a user experience perspective, but also from other issues like opening an App that contains this reports (I've noticed it can take up to 45 seconds to load an App for the first time).    Or that the load on the system for background refreshes will also be affected by having all of those columns in the dataset (and the dataflows).   

 

Can anyone shed some light on this?   Thanks!

1 ACCEPTED SOLUTION

@Anonymous Well, yes, not having query folding slows down data load in general. However, not all data sources support query folding. So, if your data source doesn't support query folding, then fine but in that case, then absolutely if you do not need the columns they should never show up to Power Query because the operation of removing those columns is causing processing to happen on the gateway (even if they are cloud only resources it uses Microsoft's cloud gateway). In short, it is causing extra processing to happen when loading the data. First, the source system has to render that data and ship it over the network. Second, your data refresh has to ingest that data and then get rid of it. Those are all extra network packets and cycles that would not happen if the "view" from the data source didn't include those extra columns to begin with.



Follow on LinkedIn
@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
DAX For Humans

DAX is easy, CALCULATE makes DAX hard...

View solution in original post

4 REPLIES 4
Greg_Deckler
Community Champion
Community Champion

@Anonymous If the extraneous columns actually make it to the data model then it definitely impacts dataset size and then potentially user experience. If you remove the extraneous columns in Power Query Editor then it comes down to whether or not query folding is happening. If you can right click on the last step in the query and you have the option to "View Native Query" then query folding is happening and thus the removal steps of the query are not impacting data load. If that is not the case, then there are steps happening locally on the gateway that are definitely impacting data refresh speeds.



Follow on LinkedIn
@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
DAX For Humans

DAX is easy, CALCULATE makes DAX hard...
Anonymous
Not applicable

@Greg_Deckler   Thanks Greg.  I've just checked a number of queries that all come from dataflows (which themselves are coming from our ERP system).   View Native Query is greyed out on all of them.    So if I understand you correctly, this is not a good thing?    We don't use a gateway as far as I know.   

@Anonymous Well, yes, not having query folding slows down data load in general. However, not all data sources support query folding. So, if your data source doesn't support query folding, then fine but in that case, then absolutely if you do not need the columns they should never show up to Power Query because the operation of removing those columns is causing processing to happen on the gateway (even if they are cloud only resources it uses Microsoft's cloud gateway). In short, it is causing extra processing to happen when loading the data. First, the source system has to render that data and ship it over the network. Second, your data refresh has to ingest that data and then get rid of it. Those are all extra network packets and cycles that would not happen if the "view" from the data source didn't include those extra columns to begin with.



Follow on LinkedIn
@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
DAX For Humans

DAX is easy, CALCULATE makes DAX hard...
Anonymous
Not applicable

@Greg_Deckler   Awesome.  Thank you Greg.   And thanks for all you and the other MVP's do for this community!   

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

October Power BI Update Carousel

Power BI Monthly Update - October 2025

Check out the October 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.