<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: CDC ( incremental load ) in fabric pipline - help in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5137395#M15530</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/1561317"&gt;@kaouter&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;SPAN data-teams="true"&gt;Hope you are doing well. Kindly let us know if the issue has been resolved or if further assistance is needed. Your input could be helpful to others in the community.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 23 Mar 2026 09:01:12 GMT</pubDate>
    <dc:creator>v-hashadapu</dc:creator>
    <dc:date>2026-03-23T09:01:12Z</dc:date>
    <item>
      <title>CDC ( incremental load ) in fabric pipline - help</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5132554#M15465</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;I am learning Microsoft Fabric during my data engineering training at Simplon.&lt;BR /&gt;I am building a pipeline that extracts data from an API every day using Fabric Data Factory.&lt;/P&gt;&lt;P&gt;I want to implement CDC to process only new inserts updates and deletes instead of loading the full dataset each day.&lt;/P&gt;&lt;P&gt;What is the best approach in Fabric to implement CDC&lt;BR /&gt;Should I load data into a staging table first and then detect changes or is there a native CDC approach&lt;/P&gt;&lt;P&gt;Thank you for your help !&amp;nbsp;&lt;BR /&gt;&lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 14 Mar 2026 20:42:51 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5132554#M15465</guid>
      <dc:creator>kaouter</dc:creator>
      <dc:date>2026-03-14T20:42:51Z</dc:date>
    </item>
    <item>
      <title>Re: CDC ( incremental load ) in fabric pipline - help</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5133609#M15479</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/1561317"&gt;@kaouter&lt;/a&gt;,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This all depends on the capabilities of your API.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Does the API return a watermark column like a last updated datetime?&amp;nbsp;&lt;BR /&gt;Does the API allow you to filter results on that column?&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;If yes, then you can keep track of the last time you ran your pipeline to pull the data in a metadata table somewhere, and then only pull data that has changed since the last time your pipeline started. From there you can upsert or merge into your tables in Fabric.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 16 Mar 2026 13:35:13 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5133609#M15479</guid>
      <dc:creator>tayloramy</dc:creator>
      <dc:date>2026-03-16T13:35:13Z</dc:date>
    </item>
    <item>
      <title>Re: CDC ( incremental load ) in fabric pipline - help</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5133655#M15481</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/1561317"&gt;@kaouter&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;P&gt;If your source is a REST API, then true CDC is not available in Microsoft Fabric, because CDC relies on database transaction logs. However, you can implement an industry-standard incremental ingestion pattern that achieves the same outcome.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Ingest incrementally from the API (source-side filtering)&lt;/P&gt;&lt;P&gt;If the API supports it, use:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;A lastModified, updatedAt, or similar timestamp&lt;/LI&gt;&lt;LI&gt;Or a monotonically increasing ID&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;In Fabric, this can be implemented using:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Copy Data activity with a REST connector, or&lt;/LI&gt;&lt;LI&gt;Notebook-based ingestion (for complex pagination or auth)&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;You store and reuse a watermark value (last successful timestamp or ID) between runs to fetch only new or changed records.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. Land data in a Bronze (staging) area in the Lakehouse&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Store the raw API responses as JSON or Delta&lt;/LI&gt;&lt;LI&gt;This provides:&lt;UL&gt;&lt;LI&gt;Replayability&lt;/LI&gt;&lt;LI&gt;Schema evolution handling&lt;/LI&gt;&lt;LI&gt;Auditability (industry best practice)&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;This staging step is strongly recommended in modern lakehouse architectures.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;3. Apply changes using MERGE (CDC-style processing)&lt;/P&gt;&lt;P&gt;Once data is in the Lakehouse:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Use Spark / SQL MERGE INTO on Delta tables to:&lt;UL&gt;&lt;LI&gt;Insert new records&lt;/LI&gt;&lt;LI&gt;Update changed records&lt;/LI&gt;&lt;LI&gt;Handle deletes (if the API provides delete indicators)&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Delta Lake’s MERGE operation is the standard mechanism for CDC-style processing in Fabric Lakehouses.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;4. Handling deletes (often missed)&lt;/P&gt;&lt;P&gt;Industry best practice for APIs:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;If the API provides:&lt;UL&gt;&lt;LI&gt;A deleted flag &amp;gt; soft delete&lt;/LI&gt;&lt;LI&gt;Or delete events &amp;gt; propagate deletes via MERGE&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;If not:&lt;UL&gt;&lt;LI&gt;Periodic reconciliation or snapshot comparison may be required&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Fabric does not automatically detect deletes for APIs—this must be handled explicitly in your logic.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 16 Mar 2026 14:42:50 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5133655#M15481</guid>
      <dc:creator>deborshi_nag</dc:creator>
      <dc:date>2026-03-16T14:42:50Z</dc:date>
    </item>
    <item>
      <title>Re: CDC ( incremental load ) in fabric pipline - help</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5136230#M15509</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hi &lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/1561317"&gt;@kaouter&lt;/a&gt;&amp;nbsp;, Thank you for reaching out to the Microsoft Community Forum.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;We find the answer shared by&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/1445078"&gt;@deborshi_nag&lt;/a&gt;&amp;nbsp; is appropriate. Can you please confirm if the solution worked for you. It will help others with similar issues find the answer easily.&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Thank you&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/1445078"&gt;@deborshi_nag&lt;/a&gt;&amp;nbsp; for your valuable response.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;LI-WRAPPER&gt;&lt;/LI-WRAPPER&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Mar 2026 08:41:36 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5136230#M15509</guid>
      <dc:creator>v-hashadapu</dc:creator>
      <dc:date>2026-03-20T08:41:36Z</dc:date>
    </item>
    <item>
      <title>Re: CDC ( incremental load ) in fabric pipline - help</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5137395#M15530</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/1561317"&gt;@kaouter&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;SPAN data-teams="true"&gt;Hope you are doing well. Kindly let us know if the issue has been resolved or if further assistance is needed. Your input could be helpful to others in the community.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Mar 2026 09:01:12 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5137395#M15530</guid>
      <dc:creator>v-hashadapu</dc:creator>
      <dc:date>2026-03-23T09:01:12Z</dc:date>
    </item>
    <item>
      <title>Re: CDC ( incremental load ) in fabric pipline - help</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5141327#M15640</link>
      <description>&lt;P&gt;The best approach really depends on what capabilities your API provides.&lt;/P&gt;&lt;P&gt;First, you need to check whether the API returns a watermark field, such as a lastUpdatedDateTime, modifiedAt, or something similar. Also verify whether the API allows you to filter results based on that field, for example by passing a query parameter like ?updated_since=.&lt;/P&gt;&lt;P&gt;If the API supports both of these, then you can implement CDC very efficiently. You simply store the timestamp of the last successful pipeline run in a metadata table, and on each new run, you call the API using that timestamp to retrieve only the records that were inserted, updated, or deleted since the last run. From there, you can load the incremental results into a staging table and perform an upsert/merge operation into your Fabric tables.&lt;/P&gt;&lt;P&gt;This avoids reloading the full dataset every day and lets you process only the changed data.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Mar 2026 16:39:30 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/CDC-incremental-load-in-fabric-pipline-help/m-p/5141327#M15640</guid>
      <dc:creator>NaveenUpadhye</dc:creator>
      <dc:date>2026-03-30T16:39:30Z</dc:date>
    </item>
  </channel>
</rss>

