Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now! Learn more

Reply
molegris
Advocate IV
Advocate IV

Best practice for getting data from csv file

Hi,

 

I just heard about Power BI ability to connect to csv files located on SharePoint online then automatically create a dataset.  And I'm wondering if there would be some benifits to update my current ETL strategy.  I'm not concern about refreshing the data because we get updated data only once a month but I'd like to be sure I'm using Microsoft product for what they were intended for and with the optimal architecure.

 

Currently, I use 3 dataflows to import data into Power BI Services. 

All sourced from csv files located in SharePoint online.

With a few transformations (rename headers, a few calculated columns and some value replacement)

 

  1. Fact table #1 : ~500k rows
  2. Fact table #2 : ~60k rows
  3. 10 dimension tables, 1 dimension has ~20k rows but the others are all under 1k.

Then I use 2 datasets.  In each dataset I created a star model with 1 fact table and its related dimensions.

Most transformations and data prep steps are done in the datasets.

Those datasets are used for many official reports AND and are available to analyts across the organisation.

 

Should I drop the dataflows and rethink the dataset to connect directly to the csv files located in Sharepoint online or it is actually more robust the way I designed it ??

 

Thank you

--mo

 

1 ACCEPTED SOLUTION
v-easonf-msft
Community Support
Community Support

Hi, @molegris 

Although, a dataset can directly get data from a data source, however, it is a best practice that a shared dataset gets the data from dataflows, this is to have a multi-developer implementation  of Power BI.

For more details,please refer to this document.

dataflow-vs-dataset 

 

Best Regards,
Community Support Team _ Eason

View solution in original post

2 REPLIES 2
v-easonf-msft
Community Support
Community Support

Hi, @molegris 

Although, a dataset can directly get data from a data source, however, it is a best practice that a shared dataset gets the data from dataflows, this is to have a multi-developer implementation  of Power BI.

For more details,please refer to this document.

dataflow-vs-dataset 

 

Best Regards,
Community Support Team _ Eason

Thank you, this was very useful.

I found that Microsoft documentation is very good with the "How To" use their products but it's very weak for explaining "Why would we use this product"  "What was it made to do"  "What is the product place in a BI ecosystem", etc.  

 

Fortunately, there some good bloggers and a great community!  🙂

Helpful resources

Announcements
Power BI DataViz World Championships

Power BI Dataviz World Championships

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now!

December 2025 Power BI Update Carousel

Power BI Monthly Update - December 2025

Check out the December 2025 Power BI Holiday Recap!

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.