Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.

Reply
jgarcia-alvarez
Frequent Visitor

Refresh Lakehouse after load data with copy activity in Data Factory Pipeline

Hi all,

 

I have created a Data Factory pipeline in Fabric to move data among the different layers (Lakehouses) using Notebooks and Copy activities and once data is in Gold Lakehouse I need to replicate it in a Warehouse. For that I am using a stored procedure, but it is not able to read the recent data until I refresh the Lakehouse manually. I added a Wait (60 seconds) activity before the Stored Procedure but it not refreshing automatically.

 

Could you let me know how to create an activity / set of activities to refresh the Lakehouse? As we don't have a specific activity for that is there any best practice?

 

Thanks in advance for your help.

1 ACCEPTED SOLUTION
Anonymous
Not applicable

Hi @jgarcia-alvarez 

 

It looks like you want to make sure the data is up to date in Lakehouse before running the stored procedure.


You might consider using a custom script activity or notebook in the pipeline. This script can programmatically trigger a refresh of Lakehouse. For example:

 

# Example script to refresh Lakehouse
import requests

# Define your Lakehouse refresh endpoint and authentication details
refresh_url = "https://your-lakehouse-endpoint/refresh"
headers = {
    "Authorization": "Bearer your_access_token",
    "Content-Type": "application/json"
}

# Send the refresh request
response = requests.post(refresh_url, headers=headers)

if response.status_code == 200:
    print("Lakehouse refresh initiated successfully.")
else:
    print(f"Failed to refresh Lakehouse: {response.status_code}")

 

Ensure that the custom script activity is executed after data replication is complete.

 

vnuocmsft_0-1732672494736.png

 

I hope this helps you with your thoughts.

 

Regards,

Nono Chen

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

View solution in original post

1 REPLY 1
Anonymous
Not applicable

Hi @jgarcia-alvarez 

 

It looks like you want to make sure the data is up to date in Lakehouse before running the stored procedure.


You might consider using a custom script activity or notebook in the pipeline. This script can programmatically trigger a refresh of Lakehouse. For example:

 

# Example script to refresh Lakehouse
import requests

# Define your Lakehouse refresh endpoint and authentication details
refresh_url = "https://your-lakehouse-endpoint/refresh"
headers = {
    "Authorization": "Bearer your_access_token",
    "Content-Type": "application/json"
}

# Send the refresh request
response = requests.post(refresh_url, headers=headers)

if response.status_code == 200:
    print("Lakehouse refresh initiated successfully.")
else:
    print(f"Failed to refresh Lakehouse: {response.status_code}")

 

Ensure that the custom script activity is executed after data replication is complete.

 

vnuocmsft_0-1732672494736.png

 

I hope this helps you with your thoughts.

 

Regards,

Nono Chen

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

September Fabric Update Carousel

Fabric Monthly Update - September 2025

Check out the September 2025 Fabric update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.