Starting December 3, join live sessions with database experts and the Microsoft product team to learn just how easy it is to get started
Learn moreGet certified in Microsoft Fabric—for free! For a limited time, get a free DP-600 exam voucher to use by the end of 2024. Register now
Hi, guys! I need help.
I've made 2 dataflows using powerbi service:
One connecting to the google bigquery, and the other connecting to an Excel file in onedrive.
They both work perfectly.
Then i created a report in power bi desktop, using the 'dataflow' connector and published it.
So, here are the problemns i'm having:
I separated the data transformation in 2 dataflows because one of them, the google cloud one, is really big and will be refresh only once a day. The other one, one drive excel file, is suposed to be refresh on demand by the user.
The refreshing of the dataflows is taking the expected time. The google cloud about 15 minutes and the onedrive 5 seconds.
But, when i update the dataset, it's taking more than 10 minutes. It makes no sense to me. The refresh time when in PBI desktop, connected to the dataflows, takes like 5 seconds too.
My guess is tha somehow the online dataset is not getting only the last 'view' from the dataflows. But then again, if i do not update the dataflows, the dataset remains the same. I'm lost...
This might be relevant:
When i go to the lineage view in the workspace that i've published the reports and created the flows, the dataset of the report does not show as connected to the dataflows. Also, the dataflows do not show as connected to any data sources.
In lineage views is like this:
MyDataSet ----> MyReport
MyDataFlowGoogle (literally no connections)
MyDataFlowOneDrive (literally no connections)
Can anyone help me?
Thank you a lot!
Solved! Go to Solution.
Thanks for the response @GilbertQ , and sorry for taking to long for my reply.
I really tought that the dataset would only retrieve the 'last state' data from the flows, and not refresh them.
I'm doing some tests now and will be back with more info.
Thanks a lot!
Happy to help!
Hi @PedroNaves
The reason it will take longer is because it has to get a the data from each dataflow, then import this data into your dataset.
If the 2 dataflows are merge or appended this will take additional time because it will have to load each dataflow, and then still merge or append them and this will take even more time.
Thanks for the response @GilbertQ , and sorry for taking to long for my reply.
I really tought that the dataset would only retrieve the 'last state' data from the flows, and not refresh them.
I'm doing some tests now and will be back with more info.
Thanks a lot!
Starting December 3, join live sessions with database experts and the Fabric product team to learn just how easy it is to get started.
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early Bird pricing ends December 9th.
User | Count |
---|---|
37 | |
30 | |
17 | |
16 | |
8 |
User | Count |
---|---|
49 | |
40 | |
32 | |
17 | |
15 |