The ultimate Microsoft Fabric, Power BI, Azure AI, and SQL learning event: Join us in Stockholm, September 24-27, 2024.
Save €200 with code MSCUST on top of early bird pricing!
Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and more. Get started
There have been similar forums I know but I'm looking for specific advice on improving efficiency.
Currently I have a report with many separate tables all loading from different sharepoint files - .xlsx files to be precise. On each of these files I choose a 'Master' table and load that into the report + append to one Master table. The files are all from the same sharepoint source so the idea has come to me about creating a basic 'Source' query and reference all the tables off that same source query. But before I invest time doing that can anyone actually explain if and why that would actually speed up my refreshes?
To provide context, a refresh often takes over 10mins at the moment which is obviously not ideal. Keen to hear any other ideas.
Solved! Go to Solution.
Hi
The main idead behind what you describe is that you only need to get the data from sharepoint once if you only have one source and all the tables are being referenced. On the other hand, if all the tables have a conection with sharepoint (whitout a reference table) you are retrieving all the same data from sharepoint as many times as the number of tables you have. You can do something like this:
Create a files table whit a sharepoint connection whitout any transformation (I pass the sharepoint URL as a parameter)
Then click on the files table an unmark the "Enable Load" option, so that you are not actualy loading this table in the model (This will be your reference table)
Than just reference this table in all the new tables you want to get and do your transformations there.
Kind regards,
José
Please mark this answer as the solution if it resolves your issue.
Appreciate your kudos! 🙂
Thanks @jcalheir! I'll give it a try.
On the same topic, I often get schedule refresh errors that seem like timeouts. They look like this:
with a bit more detail on the browser site:
I have a feeling this happens because I have so many requests going through, have you seen it before?
Cheers!
Never had it, as i only use one connection, and reference my tables from it.
You should try the method i described and see if the errors still come up
Hi
The main idead behind what you describe is that you only need to get the data from sharepoint once if you only have one source and all the tables are being referenced. On the other hand, if all the tables have a conection with sharepoint (whitout a reference table) you are retrieving all the same data from sharepoint as many times as the number of tables you have. You can do something like this:
Create a files table whit a sharepoint connection whitout any transformation (I pass the sharepoint URL as a parameter)
Then click on the files table an unmark the "Enable Load" option, so that you are not actualy loading this table in the model (This will be your reference table)
Than just reference this table in all the new tables you want to get and do your transformations there.
Kind regards,
José
Please mark this answer as the solution if it resolves your issue.
Appreciate your kudos! 🙂
Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.
Check out the August 2024 Power BI update to learn about new features.
User | Count |
---|---|
110 | |
80 | |
66 | |
53 | |
52 |
User | Count |
---|---|
121 | |
117 | |
77 | |
64 | |
63 |