<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Pipeline. Delta Load JSON Files. Filtering out processed files not working in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Pipeline-Delta-Load-JSON-Files-Filtering-out-processed-files-not/m-p/4209050#M4376</link>
    <description>&lt;P&gt;All sorted. The condition needed a slight change&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/213851"&gt;@IF&lt;/a&gt;(empty(activity('LookupProcessedFiles').output.value), true, not(contains(string(activity('LookupProcessedFiles').output.value), item().name)))&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now wrapped in String&lt;/P&gt;</description>
    <pubDate>Tue, 01 Oct 2024 14:58:50 GMT</pubDate>
    <dc:creator>DebbieE</dc:creator>
    <dc:date>2024-10-01T14:58:50Z</dc:date>
    <item>
      <title>Pipeline. Delta Load JSON Files. Filtering out processed files not working</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Pipeline-Delta-Load-JSON-Files-Filtering-out-processed-files-not/m-p/4208953#M4371</link>
      <description>&lt;P&gt;I am trying to load JSON files from a Data Lake into SQL DW using Pipelines&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a table&amp;nbsp;LOG_PROCESSED_FILE and a stored Procedure. at the end of the process the SP records the processed File&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;And the current state is that I have processed 2 files and a new file is waiting to be loaded&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. GetJSONFiles Get Meta data activity File List 'Child Items'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here is the output&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;{ "childItems": [ { "name": "A.json", "type": "File" }, { "name": "B.json", "type": "File" }, { "name": "C.json", "type": "File" } ], "effectiveIntegrationRuntime": "AutoResolveIntegrationRuntime (UK South)", "executionDuration": 1, "durationInQueue": { "integrationRuntimeQueue": 10 }, "billingReference": { "activityType": "PipelineActivity", "billableDuration": [ { "meterType": "AzureIR", "duration": 0.016666666666666666, "unit": "Hours" } ] } } &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. Lookup LookupProcessedFiles&amp;nbsp;SELECT FileName FROM [framework].[PROCESSED_FILE_LOG] connected to my data warehouse&lt;/P&gt;&lt;P&gt;Here is the output&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;{ "count": 2, "value": [ { "FileName": "A.json" }, { "FileName": "B.json" } ], "effectiveIntegrationRuntime": "AutoResolveIntegrationRuntime (UK South)", "billingReference": { "activityType": "PipelineActivity", "billableDuration": [ { "meterType": "AzureIR", "duration": 0.016666666666666666, "unit": "Hours" } ], "totalBillableDuration": [ { "meterType": "AzureIR", "duration": 0.016666666666666666, "unit": "Hours" } ] }, "durationInQueue": { "integrationRuntimeQueue": 0 } } &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;3. A Filter. FilterProcessedFiles&lt;/P&gt;&lt;P&gt;Items:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;activity(&lt;/SPAN&gt;&lt;SPAN&gt;'GetJsonFiles'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;childItems&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Condition :&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;if(empty(activity(&lt;/SPAN&gt;&lt;SPAN&gt;'LookupProcessedFiles'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;value&lt;/SPAN&gt;&lt;SPAN&gt;), &lt;/SPAN&gt;&lt;SPAN&gt;true&lt;/SPAN&gt;&lt;SPAN&gt;, not(contains(activity(&lt;/SPAN&gt;&lt;SPAN&gt;'LookupProcessedFiles'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;value&lt;/SPAN&gt;&lt;SPAN&gt;, item().&lt;/SPAN&gt;&lt;SPAN&gt;name&lt;/SPAN&gt;&lt;SPAN&gt;)))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;This is where its going wrong. On the condition. The output is&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;{ "ItemsCount": 3, "FilteredItemsCount": 3, "Value": [ { "name": "A.json", "type": "File" }, { "name": "Bjson", "type": "File" }, { "name": "C.json", "type": "File" } ] }&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;and it should be just C&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;After this I have a foreach activity containing the copy activity and the stored Procedure activity. These seem to be working fine.&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;I have researched this condition all day and I havent got any further. Could anyone help me understand what the condition needs to be to get this working. Of if I need to change what I have?&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;</description>
      <pubDate>Tue, 01 Oct 2024 14:33:59 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Pipeline-Delta-Load-JSON-Files-Filtering-out-processed-files-not/m-p/4208953#M4371</guid>
      <dc:creator>DebbieE</dc:creator>
      <dc:date>2024-10-01T14:33:59Z</dc:date>
    </item>
    <item>
      <title>Re: Pipeline. Delta Load JSON Files. Filtering out processed files not working</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Pipeline-Delta-Load-JSON-Files-Filtering-out-processed-files-not/m-p/4209050#M4376</link>
      <description>&lt;P&gt;All sorted. The condition needed a slight change&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/213851"&gt;@IF&lt;/a&gt;(empty(activity('LookupProcessedFiles').output.value), true, not(contains(string(activity('LookupProcessedFiles').output.value), item().name)))&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now wrapped in String&lt;/P&gt;</description>
      <pubDate>Tue, 01 Oct 2024 14:58:50 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Pipeline-Delta-Load-JSON-Files-Filtering-out-processed-files-not/m-p/4209050#M4376</guid>
      <dc:creator>DebbieE</dc:creator>
      <dc:date>2024-10-01T14:58:50Z</dc:date>
    </item>
    <item>
      <title>Re: Pipeline. Delta Load JSON Files. Filtering out processed files not working</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Pipeline-Delta-Load-JSON-Files-Filtering-out-processed-files-not/m-p/4210754#M4380</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/76077"&gt;@DebbieE&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Glad you found your problem, and you can mark your approach as a solution to help more people with similar problems find answers. Thanks for your understand.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P style="text-align: justify; text-justify: inter-ideograph;"&gt;&lt;SPAN&gt;Best Regards,&lt;BR /&gt;Yang&lt;/SPAN&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;Community Support Team&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 02 Oct 2024 02:19:23 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Pipeline-Delta-Load-JSON-Files-Filtering-out-processed-files-not/m-p/4210754#M4380</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-10-02T02:19:23Z</dc:date>
    </item>
  </channel>
</rss>

