The ultimate Fabric, Power BI, SQL, and AI community-led learning event. Save €200 with code FABCOMM.
Get registeredCompete to become Power BI Data Viz World Champion! First round ends August 18th. Get started.
I cannot post any screenshots or visuals due to client policies restricting data disclosure.
I am currently working on a project where the original architecture involved SQL Server as the data source. Power BI reports were developed on top of SQL Server tables and views, with data refreshed using a Gateway. The scheduled refresh never exceeded 3 minutes. There were a few transformations applied in Power Query. These reports were published in a PP3 workspace.
Now, the architecture has shifted to Azure Databricks, and I updated the data source for the existing reports using the Azure Databricks connector in Power BI. The connection is configured with ODBC, running native queries, and using Azure Active Directory for credentials. The transformations in Power Query from the original report were preserved. The reports are also published in a PP3 workspace, but the refresh time has increased to 9–12 minutes.
Some additional details:
My questions are focused on understanding what might be causing this issue, rather than solely fixing it:
Any insights or experiences with similar setups would be greatly appreciated! Any official doumentation would also be of a great help! Thanks in Advance 🙂
Solved! Go to Solution.
Did I answer your question? Mark my post as a solution, this will help others!
If my response(s) assisted you in any way, don't forget to drop me a "Kudos" 🙂
Kind Regards,
Poojara
Data Analyst | MSBI Developer | Power BI Consultant
Consider Subscribing my YouTube for Beginners/Advance Concepts: https://youtube.com/@biconcepts?si=04iw9SYI2HN80HKS
@PrabhakaranM, As we haven’t heard back from you, we wanted to kindly follow up to check if the solution provided by our super users for your issue worked? or let us know if you need any further assistance here?
Thanks,
Prashanth Are
MS Fabric community support
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly and give Kudos if helped you resolve your query
@PrabhakaranM As we haven’t heard back from you, we wanted to kindly follow up to check if the solution provided by our super users for your issue worked? or let us know if you need any further assistance here?
Thanks,
Prashanth Are
MS Fabric community support
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly and give Kudos if helped you resolve your query
@PrabhakaranM, Hope your doing well.
@Poojara_D12, @TomMartens thanks for your promt repsonse.
@PrabhakaranM, As we haven’t heard back from you, we wanted to kindly follow up to check if the solution provided by our super users for your issue worked? or let us know if you need any further assistance here?
Thanks,
Prashanth Are
MS Fabric community support
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly and give Kudos if helped you resolve your query
Did I answer your question? Mark my post as a solution, this will help others!
If my response(s) assisted you in any way, don't forget to drop me a "Kudos" 🙂
Kind Regards,
Poojara
Data Analyst | MSBI Developer | Power BI Consultant
Consider Subscribing my YouTube for Beginners/Advance Concepts: https://youtube.com/@biconcepts?si=04iw9SYI2HN80HKS
Hey @PrabhakaranM ,
from my understanding you do not use Direct Query?
Nevertheless, do not use ODBC, use the Azure Databricks connectivity that comes with Power BI Desktop / Power BI Service.
Basically, SQL Server is better if there are multiple connections at the same time, but this is only important if you are using Direct Query connection modes.
If you are using Azure Databricks, then I recommend that all data transformations are done in Power Query should be moved to Databricks, this is what Databricks is good at. Of course this requires "communication" between Power BI users and the Databricks team.
Regarding load times, not sure if you are using Power BI Premium or not and if you are using a on-premises data gateway.
Depending on the location of the SQL Server, you also have to remember that the data travels from SQL Server to Power BI Service vs. from Databricks to Power BI Service. If an on-premises data gateway is involved for the latter, then data travel will add a lot to the data load / data refresh duration.
Please check if Azure Databricks and the Power BI Service are in the same region. If they are not, you may have to reach out to your Azure Infrastructure architects and ask how these different data centers are connected.
Hopefully, this helps to tackle your challenge.
Regards,
Tom
Hey Tom... Thank you for your time and response.
I want to give more clarifications as you asked few questions.
1. I am using Azure Databricks connector found in Power Query.
2. All the workspaces and servers are in same region.
3. The workspaces is Premium Per User.
4. The connectivity mode is Import and client doesn't wants to change it.
4. The tranformation is power bi are complex and stops at the first applied step. Hence without the query folding the refresh times are quicker when the datasource is SQL Server. Whereas Azure Databricks is take 4x time to complete a scheduled refresh. Hope this gives more insight on the situation.
As per my knowledge there is no bottleneck from databrick side. What could be the most reasonable issue?
User | Count |
---|---|
43 | |
15 | |
13 | |
13 | |
9 |
User | Count |
---|---|
50 | |
43 | |
24 | |
22 | |
18 |