Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Enhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends September 15. Request your voucher.

Reply
AndyDDC
Super User
Super User

Power BI Power Query connection to Azure Data Lake Gen2

Hi,

 

I'm loading data from an Azure Data Lake Gen2 folder (10 CSV files totalling around 5GB) using Power Query (started in Desktop and now doing this in Dataflows).  I have a GroupBy transformation which is reducing the data down to around 10MB after aggregation.  The performance is around 15 minutes to load the data in and perform the GroupBy transformation.  Can anyone tell me if Power BI is doing all the work here?   In that Power BI isn't able to fold any transformation queries down to the Data Lake (it's just storage right?) and that the raw data is being loaded into the dataflow engine and then transformed?

 

Thanks

1 ACCEPTED SOLUTION
Jimmy801
Community Champion
Community Champion

Hello @AndyDDC 

 

for sure there is no way to fold back to a CSV-file. Maybe there are methologies where it would be possible to fold, because another engine could load the csv to a database and accepting some native query.


If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too

Have fun

Jimmy

 

View solution in original post

3 REPLIES 3
Jimmy801
Community Champion
Community Champion

Hello @AndyDDC 

 

for sure there is no way to fold back to a CSV-file. Maybe there are methologies where it would be possible to fold, because another engine could load the csv to a database and accepting some native query.


If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too

Have fun

Jimmy

 

Thanks Jimmy, that makes sense about CSV files.

 

How about Parquet files?  I'm wondering if they are able to have some processing folded?

Jimmy801
Community Champion
Community Champion

Hello @AndyDDC 

 

sorry but I never heard about Parquet files. When you are loading from files it's always best to start with reducing data in the first steps, because this has an impact on loading time. So filter-steps, remove columns-steps and then group steps first

 

If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too

Have fun

Jimmy

Helpful resources

Announcements
August Power BI Update Carousel

Power BI Monthly Update - August 2025

Check out the August 2025 Power BI update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.

Top Solution Authors