Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes! Register now.

Reply
AndyDDC
Super User
Super User

Power BI Power Query connection to Azure Data Lake Gen2

Hi,

 

I'm loading data from an Azure Data Lake Gen2 folder (10 CSV files totalling around 5GB) using Power Query (started in Desktop and now doing this in Dataflows).  I have a GroupBy transformation which is reducing the data down to around 10MB after aggregation.  The performance is around 15 minutes to load the data in and perform the GroupBy transformation.  Can anyone tell me if Power BI is doing all the work here?   In that Power BI isn't able to fold any transformation queries down to the Data Lake (it's just storage right?) and that the raw data is being loaded into the dataflow engine and then transformed?

 

Thanks

1 ACCEPTED SOLUTION
Jimmy801
Community Champion
Community Champion

Hello @AndyDDC 

 

for sure there is no way to fold back to a CSV-file. Maybe there are methologies where it would be possible to fold, because another engine could load the csv to a database and accepting some native query.


If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too

Have fun

Jimmy

 

View solution in original post

3 REPLIES 3
Jimmy801
Community Champion
Community Champion

Hello @AndyDDC 

 

for sure there is no way to fold back to a CSV-file. Maybe there are methologies where it would be possible to fold, because another engine could load the csv to a database and accepting some native query.


If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too

Have fun

Jimmy

 

Thanks Jimmy, that makes sense about CSV files.

 

How about Parquet files?  I'm wondering if they are able to have some processing folded?

Jimmy801
Community Champion
Community Champion

Hello @AndyDDC 

 

sorry but I never heard about Parquet files. When you are loading from files it's always best to start with reducing data in the first steps, because this has an impact on loading time. So filter-steps, remove columns-steps and then group steps first

 

If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too

Have fun

Jimmy

Helpful resources

Announcements
September Power BI Update Carousel

Power BI Monthly Update - September 2025

Check out the September 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.