Hi community,
More silly questions still trying to understand fundamentals here, but just not quite getting it.
I have a Workspace for 'Ingestion' entities, pulling data from an on-premise datasource (SQL Server).
This ingestion Dataflow is running on Incremental Refresh - it took around 3 hours to do the initial load, but now consistently refreshes on schedule in around 4 - 6 minutes. Great so far.
I then have a 2nd Workspace for staging ie. linked entities, with Calculated entities referencing them. (I'm still unsure if its better to do this all in the 'Ingestion' workspace, or to do staging / ETL in a seperate workspace like this.. that's another issue but any suggestions welcomed 😀 😞
The 'Staged' Dataflow is however taking excessively longer to refresh than the 'Ingestion' dataflow, around 1 hour an 45 minutes:
So I'm thinking - 'this time should be at least 'comparable' to the previous incremental refresh time.. Do I need to set up Incremental Refresh on my Calculated Entities in this Dataflow?'
However, this Microsoft article seems to indicated that Computed entities behave the same way as Linked entities, which don't require incremental refresh (as Linked entities are simply a pointer), and that Calculated entities are simply performing queries over the existing stored data, not 'storing' the data again within themselves.. so putting incremental refresh on Calculated entities doesn't seem to be the correct method:
So.. my 3 big questions are:
Thanks for sticking with me... any help greatly appreciated!
Solved! Go to Solution.
HI @uberdube ,
#1. I think you only need to configure normally refresh on link entities.
#2. As the document said, it seems like incremental refresh will work on original dataflow, so if you already setting incremental refresh on original dataflow, you not need to config this on link entities.
#3. I think this refresh means to sync the last data from original dataflow and execute in query calculation steps on new records.
According to your description, I think refresh time is spent query operations in that computed entity if any advanced or complex query formulas existed in it. (merge or combine query/reference other steps)
In addition, you can also submit a support ticket to get further support form power bi team.
Regards,
Xiaoxin Sheng
HI @uberdube ,
#1. I think you only need to configure normally refresh on link entities.
#2. As the document said, it seems like incremental refresh will work on original dataflow, so if you already setting incremental refresh on original dataflow, you not need to config this on link entities.
#3. I think this refresh means to sync the last data from original dataflow and execute in query calculation steps on new records.
According to your description, I think refresh time is spent query operations in that computed entity if any advanced or complex query formulas existed in it. (merge or combine query/reference other steps)
In addition, you can also submit a support ticket to get further support form power bi team.
Regards,
Xiaoxin Sheng