This time we’re going bigger than ever. Fabric, Power BI, SQL, AI and more. We're covering it all. You won't want to miss it.
Learn moreDid you hear? There's a new SQL AI Developer certification (DP-800). Start preparing now and be one of the first to get certified. Register now
Incremental loading moves only new or changed records instead of reprocessing entire datasets. Benefits include:
The watermark pattern uses a timestamp column to identify new records since the last pipeline execution.
Create a tracking table in your Warehouse:
CREATE TABLE tblWaterMark ( Watermark DATETIME2(6) ); INSERT INTO tblWaterMark VALUES ('2024-01-01 00:00:00');
Note: Set the initial watermark to a date before your earliest data or when the last full load occurred.
Your Warehouse now contains:
Create a stored procedure to update the watermark after each successful load:
CREATE OR ALTER PROCEDURE updateWatermark @WaterMark DATETIME2(6) AS UPDATE tblWaterMark SET Watermark = @WaterMarkThis procedure accepts a datetime parameter and persists the new high-water mark.
Configure four activities in your Fabric pipeline:
Select * from FactImport WHERE OrderDate > '@{activity('Watermark table lookup').output.firstRow.Watermark}' and OrderDate <= '@{activity('Maximum Watermark').output.firstRow.LatestWatermark}'
Each pipeline execution:
Subsequent runs load only new records added since the previous execution, making data movement efficient and scalable.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.