Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and more. Get started

Reply
Cymbolz
Helper III
Helper III

Dataflow vs Dataset refresh

Haven't found any documentation on how data refresh works with respect to a dataflow and then a dataset sourced from that dataflow.

 

So looking for feedback, based on what I've discovered:

 

  • Both a dataflow and dataset need data to be refreshed
  • So I assume the dataflow is much like a data storage component on it's own that manages the updating from the data source, wherever that may be
  • And the dataset will refresh data from the dataflow 'storage'
  • Thus a logical refresh sequence (such as setting a scheduled refresh) would see the dataflow update first then the dataset aftewards (maybe 30 mins later as I suspect doing both at the same time may not yield the right results)

I've come to this conclusion after seeing the behaviour of having one or the other set for scheduled refresh.

 

I'm also seeing inconsistency in the workspace contents view where it shows last and next refresh times.  

 

For this dataflow, I've toggled off the scheduled refresh but it still shows a Next Refresh time (I would expect not to see any time stamp):

Capture 1.PNG

 

For these datasets, they have both had a refresh more recently than indicated here

 

 Capture 2.PNG

 

Here's the first one:

Capture 3.PNG

And the second:

Capture 4.PNG 

 

A bug?

 

2 ACCEPTED SOLUTIONS
otravers
Community Champion
Community Champion

>So I assume the dataflow is much like a data storage component on its own that manages the updating from the data source, wherever that may be

 

That's correct, technically PBI's Dataflow uses Azure Data Lake Gen2 for storage.

 

One use case I plan to use this dual refresh structure for, is to handle sources (e.g. static files) that don't need to be refreshed in Dataflows where they'll be imported but not under scheduled refresh. I've found PBI's scheduled refreshes to fail easily, so cutting down the service's scheduled refreshes to sources that actually need to be refreshed should lower incidents (e.g. web API timeouts, credential issues etc.).

------------------------------------------------
1. How to get your question answered quickly - good questions get good answers!
2. Learning how to fish > being spoon-fed without active thinking.
3. Please accept as a solution posts that resolve your questions.
------------------------------------------------
BI Blog: Datamarts | RLS/OLS | Dev Tools | Languages | Aggregations | XMLA/APIs | Field Parameters | Custom Visuals

View solution in original post

Hi,

 

After internal checking, it seems that the next refresh time update (without browser refresh) was fixed and is should be available in the following updates.

 

I will keep monitor it.

 

Thanks,

Assaf

View solution in original post

28 REPLIES 28

The above link not going anywhere.

I think I've found the session here:

https://www.youtube.com/watch?v=m_oLq3uS238 

Hi,

NotifyOption in Power Automate custom connector for either dataflow or dataset refresh is not working for me.

Anyone recived mail on completion?

Assaf
Employee
Employee

Hi!

 

Regarding the inconsistency in the next refresh time of the dataflow, after a browser refresh, do you still see the next refresh time?

 

Thanks,

Assaf


@Assaf wrote:

 

Regarding the inconsistency in the next refresh time of the dataflow, after a browser refresh, do you still see the next refresh time?

 


I do see the refresh time after a browser refresh.  Also with the page remaining on screen, the refresh was scheduled to run and the date/time stamps updated without me having to refresh.

 

The issue I was experiencing included navigating away from that pgae, then returning to the page (so not refreshing the browser), so I'd have thought that would result in the updated time stamps too...I'll keep an eye on it.

Hi,

 

After internal checking, it seems that the next refresh time update (without browser refresh) was fixed and is should be available in the following updates.

 

I will keep monitor it.

 

Thanks,

Assaf

I've noticed the date & time stamps are now updating, even without any refresh or navigating away.

Good to hear. Thanks!

otravers
Community Champion
Community Champion

>So I assume the dataflow is much like a data storage component on its own that manages the updating from the data source, wherever that may be

 

That's correct, technically PBI's Dataflow uses Azure Data Lake Gen2 for storage.

 

One use case I plan to use this dual refresh structure for, is to handle sources (e.g. static files) that don't need to be refreshed in Dataflows where they'll be imported but not under scheduled refresh. I've found PBI's scheduled refreshes to fail easily, so cutting down the service's scheduled refreshes to sources that actually need to be refreshed should lower incidents (e.g. web API timeouts, credential issues etc.).

------------------------------------------------
1. How to get your question answered quickly - good questions get good answers!
2. Learning how to fish > being spoon-fed without active thinking.
3. Please accept as a solution posts that resolve your questions.
------------------------------------------------
BI Blog: Datamarts | RLS/OLS | Dev Tools | Languages | Aggregations | XMLA/APIs | Field Parameters | Custom Visuals

Helpful resources

Announcements
Fabcon_Europe_Social_Bogo

Europe’s largest Microsoft Fabric Community Conference

Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.

Power BI Carousel June 2024

Power BI Monthly Update - June 2024

Check out the June 2024 Power BI update to learn about new features.

RTI Forums Carousel3

New forum boards available in Real-Time Intelligence.

Ask questions in Eventhouse and KQL, Eventstream, and Reflex.

Top Solution Authors