Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and a 50 percent discount on exams.
Get startedEarn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.
Hi, very basic question here.
I am very new to the whole PowerBi World, I depend a lot on it for our daily reporting, but I reckon I am NOT using it in the best way.
Let's start with a simple question: how to import the data, this is how I am going about it and these are the issues I have.
Solved! Go to Solution.
Unless you are accessing SSAS tabular data, you should always use Power Query to import your data. It does the transformations. You cannot do any transformations in Power Pivot's data model. You can add columns, but you cannot filter, remove data, or otherwise transform. Any filtering is visual only, and does not limit what is loaded into the model.
When building your data, you shouldn't have anything actually in the Excel spreadsheets. If you are loading data, load only to the data model as below. The Table section should be "Connection Only"
Even if you do have data in a table you load to Power Query, do some transformations, then load directly into the model, that will show your original table plus the query. Just use the tables/queries that have the little yellow database icon next to them. Once you get going, you just use the part of the Pivot Fields window that shows Active connections in this Pivot, vs All.
You should not add Calculated Columns as a rule to your model. If you can do the math to add a calculated column, do it in Power Query. Calculated columns will not compress, take up more room, and can slow the model down depending on the number of rows, especially on a refresh. There are some limited reasons to do it in Power BI when you need it for a slicer and cannot create it in Power Query, or there is some more complex calculation that isn't feasable or possible in Power Query (this is pretty rare though).
You shouldn't do any work in Excel's spreadsheets either. No vlookups, text concatenations, etc. Do it ALL in Power Query, then load those only into the data model. I have some very Power Query models with over 100 queries that eventually boil down to 10-12 that get loaded to the Data Model, and then I start working on the Pivot Table. There is not a single cell of data in Excel's spreadsheet side until I get to that point. 100% is imported directly into PQ.
As for speed, it depends on where your data is. If you are using Sharepoint, you'll usually want to change the initial connection string from SharePoint.Files to SharePoint.Contents, then go to the folder you want. SharePoint.Files loads every file name and other metadata before it will move to the actual file(s) you want. If OneDrive, it just depends on your connection. If it is a lot of data (hundreds of thousands of records) consider DataLake. We throw tons of text files into DataLake and use PQ to transform the data before pulling into Power Pivot or Power BI's data model.
DAX is for Analysis. Power Query is for Data Modeling
Proud to be a Super User!
MCSA: BI ReportingUnless you are accessing SSAS tabular data, you should always use Power Query to import your data. It does the transformations. You cannot do any transformations in Power Pivot's data model. You can add columns, but you cannot filter, remove data, or otherwise transform. Any filtering is visual only, and does not limit what is loaded into the model.
When building your data, you shouldn't have anything actually in the Excel spreadsheets. If you are loading data, load only to the data model as below. The Table section should be "Connection Only"
Even if you do have data in a table you load to Power Query, do some transformations, then load directly into the model, that will show your original table plus the query. Just use the tables/queries that have the little yellow database icon next to them. Once you get going, you just use the part of the Pivot Fields window that shows Active connections in this Pivot, vs All.
You should not add Calculated Columns as a rule to your model. If you can do the math to add a calculated column, do it in Power Query. Calculated columns will not compress, take up more room, and can slow the model down depending on the number of rows, especially on a refresh. There are some limited reasons to do it in Power BI when you need it for a slicer and cannot create it in Power Query, or there is some more complex calculation that isn't feasable or possible in Power Query (this is pretty rare though).
You shouldn't do any work in Excel's spreadsheets either. No vlookups, text concatenations, etc. Do it ALL in Power Query, then load those only into the data model. I have some very Power Query models with over 100 queries that eventually boil down to 10-12 that get loaded to the Data Model, and then I start working on the Pivot Table. There is not a single cell of data in Excel's spreadsheet side until I get to that point. 100% is imported directly into PQ.
As for speed, it depends on where your data is. If you are using Sharepoint, you'll usually want to change the initial connection string from SharePoint.Files to SharePoint.Contents, then go to the folder you want. SharePoint.Files loads every file name and other metadata before it will move to the actual file(s) you want. If OneDrive, it just depends on your connection. If it is a lot of data (hundreds of thousands of records) consider DataLake. We throw tons of text files into DataLake and use PQ to transform the data before pulling into Power Pivot or Power BI's data model.
DAX is for Analysis. Power Query is for Data Modeling
Proud to be a Super User!
MCSA: BI Reporting