<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Handling Pipeline Failures and Audit Layer for Data Ingestion in Microsoft Fabric Lakehouse in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4400443#M7054</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/917648"&gt;@Hussain_Abedi&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;We haven’t heard from you on the last response and was just checking back to see if your query got resolved. If this post was helpful, please consider marking&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Accept as solution&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;to assist other members in finding it more easily.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you continue to face issues, feel free to reach out to us for further assistance!&lt;/P&gt;
&lt;P&gt;Thanks for using Microsoft Fabric Community.&lt;/P&gt;</description>
    <pubDate>Fri, 07 Feb 2025 17:03:16 GMT</pubDate>
    <dc:creator>v-aatheeque</dc:creator>
    <dc:date>2025-02-07T17:03:16Z</dc:date>
    <item>
      <title>Handling Pipeline Failures and Audit Layer for Data Ingestion in Microsoft Fabric Lakehouse</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4377389#M6415</link>
      <description>&lt;P&gt;I am ingesting data from PostgreSQL and MySQL into the Fabric Lakehouse Bronze layer. In the Silver layer, the two tables are joined, and the combined data is stored in a delta table, which I have named the &lt;STRONG&gt;temp table&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;From the &lt;STRONG&gt;temp table&lt;/STRONG&gt;, the data is ingested into a warehouse. After ingestion, an &lt;STRONG&gt;inner join&lt;/STRONG&gt; is performed between the &lt;STRONG&gt;temp table&lt;/STRONG&gt; and the warehouse to identify the records that match in both places. These matched records are marked as &lt;STRONG&gt;loaded&lt;/STRONG&gt; and saved into the Silver layer of the Lakehouse.&lt;/P&gt;&lt;H3&gt;Problem:&lt;/H3&gt;&lt;P&gt;If the pipeline breaks after the data is ingested into the warehouse but before it is marked as loaded and saved into the Silver layer, the incoming new data will overwrite the &lt;STRONG&gt;temp table&lt;/STRONG&gt;, causing a loss of unprocessed records from the previous run. This happens because the Silver delta tables in Fabric are immutable, which led me to adopt this approach.&lt;/P&gt;&lt;H3&gt;Questions:&lt;/H3&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;What should I use as an audit layer?&lt;/STRONG&gt;&lt;BR /&gt;How can I ensure that the records loaded into the warehouse are properly tracked, especially in the event of a pipeline failure?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;How can I maintain which records have been loaded into the warehouse and which have not?&lt;/STRONG&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Thu, 23 Jan 2025 08:52:40 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4377389#M6415</guid>
      <dc:creator>Hussain_Abedi</dc:creator>
      <dc:date>2025-01-23T08:52:40Z</dc:date>
    </item>
    <item>
      <title>Re: Handling Pipeline Failures and Audit Layer for Data Ingestion in Microsoft Fabric Lakehouse</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4381956#M6521</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/917648"&gt;@Hussain_Abedi&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;Thanks for reaching out to the Microsoft Fabric Community forum.&lt;BR /&gt;To address your data ingestion challenges and prevent data loss during pipeline failures, implement these strategies:&lt;BR /&gt;&lt;BR /&gt;1.&amp;nbsp;Create a dedicated audit table that logs the status of each record processed during the pipeline. This table can include the following columns: record_id , ingestion _timestamp, status, error_message.&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;2.Instead of directly overwriting the temp table, consider using a staging area where you can store the new incoming data. This allows you to keep the previous run's data until you confirm that the new data has been successfully processed&lt;/P&gt;
&lt;UL class="lia-list-style-type-disc"&gt;
&lt;LI&gt;Ingest new data into the staging area.&lt;/LI&gt;
&lt;LI&gt;Perform an inner join between the temp table and staging area to identify matching records.&lt;/LI&gt;
&lt;LI&gt;Update the audit table to mark successfully ingested records as "loaded."&lt;/LI&gt;
&lt;LI&gt;Overwrite the temp table with new data only after all records are processed and marked.&lt;/LI&gt;
&lt;LI&gt;Use the audit table to mark records as "loaded" upon successful ingestion into the warehouse.&lt;/LI&gt;
&lt;LI&gt;If a pipeline failure occurs, you can query the audit table to identify which records have not been marked as "loaded" and need to be reprocessed.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If this post was helpful, please consider marking &lt;STRONG&gt;Accept as solution&lt;/STRONG&gt; to assist other members in finding it more easily.&lt;/P&gt;
&lt;P&gt;If you continue to face issues, feel free to reach out to us for further assistance!&lt;/P&gt;</description>
      <pubDate>Mon, 27 Jan 2025 08:24:41 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4381956#M6521</guid>
      <dc:creator>v-aatheeque</dc:creator>
      <dc:date>2025-01-27T08:24:41Z</dc:date>
    </item>
    <item>
      <title>Re: Handling Pipeline Failures and Audit Layer for Data Ingestion in Microsoft Fabric Lakehouse</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4389634#M6754</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/917648"&gt;@Hussain_Abedi&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;We haven’t heard from you on the last response and was just checking back to see if your query got resolved. If this post was helpful, please consider marking &lt;SPAN&gt;Accept as solution&lt;/SPAN&gt; to assist other members in finding it more easily.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="margin-top: 0pt; margin-bottom: 8pt; font-family: Aptos; font-size: 12.0pt;"&gt;If you continue to face issues, feel free to reach out to us for further assistance!&lt;/P&gt;
&lt;P style="margin-top: 0pt; margin-bottom: 8pt; font-family: Aptos; font-size: 12.0pt;"&gt;Thanks for using Microsoft Fabric Community.&lt;/P&gt;</description>
      <pubDate>Fri, 31 Jan 2025 09:48:41 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4389634#M6754</guid>
      <dc:creator>v-aatheeque</dc:creator>
      <dc:date>2025-01-31T09:48:41Z</dc:date>
    </item>
    <item>
      <title>Re: Handling Pipeline Failures and Audit Layer for Data Ingestion in Microsoft Fabric Lakehouse</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4393851#M6908</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;A href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/917648" target="_blank"&gt;@Hussain_Abedi&lt;/A&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;We haven’t heard from you on the last response and was just checking back to see if your query got resolved. If this post was helpful, please consider marking&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Accept as solution&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;to assist other members in finding it more easily.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you continue to face issues, feel free to reach out to us for further assistance!&lt;/P&gt;
&lt;P&gt;Thanks for using Microsoft Fabric Community.&lt;/P&gt;</description>
      <pubDate>Tue, 04 Feb 2025 08:41:45 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4393851#M6908</guid>
      <dc:creator>v-aatheeque</dc:creator>
      <dc:date>2025-02-04T08:41:45Z</dc:date>
    </item>
    <item>
      <title>Re: Handling Pipeline Failures and Audit Layer for Data Ingestion in Microsoft Fabric Lakehouse</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4400443#M7054</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/917648"&gt;@Hussain_Abedi&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;We haven’t heard from you on the last response and was just checking back to see if your query got resolved. If this post was helpful, please consider marking&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Accept as solution&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;to assist other members in finding it more easily.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you continue to face issues, feel free to reach out to us for further assistance!&lt;/P&gt;
&lt;P&gt;Thanks for using Microsoft Fabric Community.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Feb 2025 17:03:16 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Handling-Pipeline-Failures-and-Audit-Layer-for-Data-Ingestion-in/m-p/4400443#M7054</guid>
      <dc:creator>v-aatheeque</dc:creator>
      <dc:date>2025-02-07T17:03:16Z</dc:date>
    </item>
  </channel>
</rss>

