Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and more. Get started
I am working on a dashboard which has a bunch of .csv files placed in the data path folder and the dataset is quite huge. The date range of the data is from 2018 upto the latest month. We receive monthly updates and when we add some new files in the source path and refresh, it takes a lot of time(almost a day or two). I was wondering if there is a way to remove 2018-2019 files by adding some automation steps in PowerQuery? Even if this cannot be achieved, is there any workaround for reducing the refresh time?
Also, is it possible to append the new data files without having to refresh the entire tables?
Solved! Go to Solution.
Hi @Anonymous231 ,
You can refer the following links to reduce the refresh time:
1. Find the reason for the long refresh time
Why is my Power BI refresh so SLOW?!? 3 Bottlenecks for refresh performance
My dataset is taking too long to refresh!
There are 3 places where the refresh might be slow:
- Data source
- Power Query engine
- Analysis Services engine
2. Connect the file with the latest data only just as suggested by @amitchandak
Power BI: How to Connect Most Recent File from a Folder
Power BI - Connect to Most Recent File in a Folder
3. Use incremental refresh
How you can incrementally refresh any Power BI data source (This example is a CSV File)
Incremental data load from CSV
You can use the same M code and create a Dataflow. Dataflows DO enable incremental refresh on sources like .csv flat files.When the dataflow refreshes, it will only pick up the newest data, perform the transformations, and load that newly shaped data into the Azure Data Lake.
If you do all of your transformations in the dataflow, the Power BI just has to read from the Data Lake to pull data into the model. That read is usually much faster than reading from .csv files in SharePoint and performing transformations.
I'm pretty sure you can also enable incremental refresh for pulling data from the dataflow.
Best Regards
Hi @Anonymous231 ,
You can refer the following links to reduce the refresh time:
1. Find the reason for the long refresh time
Why is my Power BI refresh so SLOW?!? 3 Bottlenecks for refresh performance
My dataset is taking too long to refresh!
There are 3 places where the refresh might be slow:
- Data source
- Power Query engine
- Analysis Services engine
2. Connect the file with the latest data only just as suggested by @amitchandak
Power BI: How to Connect Most Recent File from a Folder
Power BI - Connect to Most Recent File in a Folder
3. Use incremental refresh
How you can incrementally refresh any Power BI data source (This example is a CSV File)
Incremental data load from CSV
You can use the same M code and create a Dataflow. Dataflows DO enable incremental refresh on sources like .csv flat files.When the dataflow refreshes, it will only pick up the newest data, perform the transformations, and load that newly shaped data into the Azure Data Lake.
If you do all of your transformations in the dataflow, the Power BI just has to read from the Data Lake to pull data into the model. That read is usually much faster than reading from .csv files in SharePoint and performing transformations.
I'm pretty sure you can also enable incremental refresh for pulling data from the dataflow.
Best Regards
@Anonymous231 , Can getting lastest file from folder can help ?
Check out the September 2024 Power BI update to learn about new features.
Learn from experts, get hands-on experience, and win awesome prizes.