<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Fabric Pipeline. Data flow. Removing items from a Filter with GetMetaData and Lookup in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061644#M3276</link>
    <description>&lt;P&gt;Perhaps you can do similar like below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a Bronze Lakehouse and a Silver Lakehouse.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The files in my Bronze Lakehouse are as follows:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_0-1721915655333.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138939i71C66089077D1961/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_0-1721915655333.png" alt="frithjof_v_0-1721915655333.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The files in my Silver Lakehouse are as follows:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_1-1721915907699.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138941iB8B88A3A5A64E67C/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_1-1721915907699.png" alt="frithjof_v_1-1721915907699.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I made a pipeline like this:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_2-1721915943800.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138942i36E5350920F2215A/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_2-1721915943800.png" alt="frithjof_v_2-1721915943800.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Get Metadata activities get the Child items metadata from the File folder in Bronze lakehouse and Silver lakehouse, respectively.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_3-1721916050920.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138944i85B9C9E54DB0F105/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_3-1721916050920.png" alt="frithjof_v_3-1721916050920.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Filter activity removes the lookup.csv file from the output of the metadata activity from Bronze lakehouse:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_4-1721916166056.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138946i88E4D05CD5C6CC46/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_4-1721916166056.png" alt="frithjof_v_4-1721916166056.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Items:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;activity(&lt;/SPAN&gt;&lt;SPAN&gt;'Get Metadata Bronze'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;childItems&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Condition:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;not(equals(item().&lt;/SPAN&gt;&lt;SPAN&gt;name&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'lookup.csv'&lt;/SPAN&gt;&lt;SPAN&gt;))&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The Items in the ForEach activity is the output from the Filter activity:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_5-1721916215603.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138948iC95B40A7267BF097/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_5-1721916215603.png" alt="frithjof_v_5-1721916215603.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Items: @&lt;/SPAN&gt;&lt;SPAN&gt;activity(&lt;/SPAN&gt;&lt;SPAN&gt;'Filter Away Lookup file'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;Value&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The If Condition inside the ForEach activity:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_6-1721916273266.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138950i448341D5A7E14649/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_6-1721916273266.png" alt="frithjof_v_6-1721916273266.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Expression:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;contains(activity(&lt;/SPAN&gt;&lt;SPAN&gt;'Get Metadata Silver'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;childItems&lt;/SPAN&gt;&lt;SPAN&gt;, item())&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Copy activity if the If Condition is False:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_7-1721916333614.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138952i3FB8392306D39653/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_7-1721916333614.png" alt="frithjof_v_7-1721916333614.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_8-1721916358371.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138953i04FDE449078CD174/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_8-1721916358371.png" alt="frithjof_v_8-1721916358371.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;After I run the pipeline, the Proje.csv file has been copied to Silver:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_9-1721916448145.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138956i6B18C7EC81924EB5/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_9-1721916448145.png" alt="frithjof_v_9-1721916448145.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't know if Fabric Data Pipeline has any limits (like output size, number of items in collection, number of items in foreach activity, etc.) which needs to be taken into consideration or it can result in pipeline failure or unexpected results if the number of files in any of the folders grow above the limits.&lt;/P&gt;</description>
    <pubDate>Thu, 25 Jul 2024 14:07:50 GMT</pubDate>
    <dc:creator>frithjof_v</dc:creator>
    <dc:date>2024-07-25T14:07:50Z</dc:date>
    <item>
      <title>Fabric Pipeline. Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4027330#M2868</link>
      <description>&lt;P&gt;OK so I have a Bronze Datalake and a Silver Data Lake&lt;/P&gt;&lt;P&gt;In Silver I have a Parquet File of processed file names e.g.&lt;/P&gt;&lt;P&gt;Proja.csv&lt;/P&gt;&lt;P&gt;Projb.csv&lt;/P&gt;&lt;P&gt;Projc.csv&lt;/P&gt;&lt;P&gt;Projd.csv&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;And in the dataflow I have a Get MetaData activity conntected to the childitems in my Bronze datalake. So its finding files&lt;/P&gt;&lt;P&gt;lookup.csv&lt;/P&gt;&lt;P&gt;Proja.csv&lt;/P&gt;&lt;P&gt;Projb.csv&lt;/P&gt;&lt;P&gt;Projc.csv&lt;/P&gt;&lt;P&gt;Projd.csv&lt;/P&gt;&lt;P&gt;Proje.csv (Which is the new file)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I then have a filter to remove the Lookup.csv file&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;and(equals(item().&lt;/SPAN&gt;&lt;SPAN&gt;type&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;'File'&lt;/SPAN&gt;&lt;SPAN&gt;),startswith(item().&lt;/SPAN&gt;&lt;SPAN&gt;name&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;'Proj'&lt;/SPAN&gt;&lt;SPAN&gt;))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;And now I want to get a list of everything in GetMetadata that doesnt exist in the lookup&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Which would leave me with&amp;nbsp;Proje.csv&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;The hope is that I can use this to run a notebook so it only uses these files (Not sure how to do that yet but Im concentrating on the first bit)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;I thought I could add another lookup. Connect it to the Filter (Source Files) and the Lookup (Processed Files) But Im really stuck at this point.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Should I be using a Lookup and what code should I use to&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Get All Items from Filter where Not in Lookup?&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Jul 2024 09:22:53 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4027330#M2868</guid>
      <dc:creator>DebbieE</dc:creator>
      <dc:date>2024-07-10T09:22:53Z</dc:date>
    </item>
    <item>
      <title>Re: Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4029242#M2880</link>
      <description>&lt;P&gt;HI&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/76077"&gt;@DebbieE&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;I think you need a template list or query result that used to compare with current items, or you can't define which not exist and use to filter.&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Xiaoxin Sheng&lt;/P&gt;</description>
      <pubDate>Mon, 08 Jul 2024 03:23:46 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4029242#M2880</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-07-08T03:23:46Z</dc:date>
    </item>
    <item>
      <title>Re: Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4034246#M2937</link>
      <description>&lt;P&gt;I would need some specific information to work with here for how I would go about that. This is all in a fabric pipeline&lt;/P&gt;</description>
      <pubDate>Wed, 10 Jul 2024 09:23:33 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4034246#M2937</guid>
      <dc:creator>DebbieE</dc:creator>
      <dc:date>2024-07-10T09:23:33Z</dc:date>
    </item>
    <item>
      <title>Re: Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4058383#M3240</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/76077"&gt;@DebbieE&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Here is the document link about use dataflow in data pipeline, you can use M query editor to operation with query table records:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/fabric/data-factory/tutorial-dataflows-gen2-pipeline-activity" target="_blank"&gt;Use a dataflow in a pipeline - Microsoft Fabric | Microsoft Learn&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Xiaoxin Sheng&lt;/P&gt;</description>
      <pubDate>Wed, 24 Jul 2024 08:22:02 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4058383#M3240</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-07-24T08:22:02Z</dc:date>
    </item>
    <item>
      <title>Re: Fabric Pipeline. Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061644#M3276</link>
      <description>&lt;P&gt;Perhaps you can do similar like below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a Bronze Lakehouse and a Silver Lakehouse.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The files in my Bronze Lakehouse are as follows:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_0-1721915655333.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138939i71C66089077D1961/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_0-1721915655333.png" alt="frithjof_v_0-1721915655333.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The files in my Silver Lakehouse are as follows:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_1-1721915907699.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138941iB8B88A3A5A64E67C/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_1-1721915907699.png" alt="frithjof_v_1-1721915907699.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I made a pipeline like this:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_2-1721915943800.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138942i36E5350920F2215A/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_2-1721915943800.png" alt="frithjof_v_2-1721915943800.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Get Metadata activities get the Child items metadata from the File folder in Bronze lakehouse and Silver lakehouse, respectively.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_3-1721916050920.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138944i85B9C9E54DB0F105/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_3-1721916050920.png" alt="frithjof_v_3-1721916050920.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Filter activity removes the lookup.csv file from the output of the metadata activity from Bronze lakehouse:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_4-1721916166056.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138946i88E4D05CD5C6CC46/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_4-1721916166056.png" alt="frithjof_v_4-1721916166056.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Items:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;activity(&lt;/SPAN&gt;&lt;SPAN&gt;'Get Metadata Bronze'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;childItems&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Condition:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;not(equals(item().&lt;/SPAN&gt;&lt;SPAN&gt;name&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'lookup.csv'&lt;/SPAN&gt;&lt;SPAN&gt;))&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The Items in the ForEach activity is the output from the Filter activity:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_5-1721916215603.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138948iC95B40A7267BF097/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_5-1721916215603.png" alt="frithjof_v_5-1721916215603.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Items: @&lt;/SPAN&gt;&lt;SPAN&gt;activity(&lt;/SPAN&gt;&lt;SPAN&gt;'Filter Away Lookup file'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;Value&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The If Condition inside the ForEach activity:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_6-1721916273266.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138950i448341D5A7E14649/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_6-1721916273266.png" alt="frithjof_v_6-1721916273266.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Expression:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;contains(activity(&lt;/SPAN&gt;&lt;SPAN&gt;'Get Metadata Silver'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;childItems&lt;/SPAN&gt;&lt;SPAN&gt;, item())&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The Copy activity if the If Condition is False:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_7-1721916333614.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138952i3FB8392306D39653/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_7-1721916333614.png" alt="frithjof_v_7-1721916333614.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_8-1721916358371.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138953i04FDE449078CD174/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_8-1721916358371.png" alt="frithjof_v_8-1721916358371.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;After I run the pipeline, the Proje.csv file has been copied to Silver:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_9-1721916448145.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138956i6B18C7EC81924EB5/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_9-1721916448145.png" alt="frithjof_v_9-1721916448145.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't know if Fabric Data Pipeline has any limits (like output size, number of items in collection, number of items in foreach activity, etc.) which needs to be taken into consideration or it can result in pipeline failure or unexpected results if the number of files in any of the folders grow above the limits.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 14:07:50 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061644#M3276</guid>
      <dc:creator>frithjof_v</dc:creator>
      <dc:date>2024-07-25T14:07:50Z</dc:date>
    </item>
    <item>
      <title>Re: Fabric Pipeline. Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061668#M3278</link>
      <description>&lt;P&gt;If there is a more efficient way to compare the two collections of child items from Get Metadata Silver and Get Metadata Bronze and return the items which only exist in the Get Metadata Bronze, then I would like to know.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(I am thinking if there exists some kind of anti join functionality, or similar?&lt;BR /&gt;Perhaps some way to do one array minus another array, which keeps only the items which are only in the first array?)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In my solution, I am using the ForEach activity with an IF condition inside to achieve a similar effect.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 15:33:37 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061668#M3278</guid>
      <dc:creator>frithjof_v</dc:creator>
      <dc:date>2024-07-25T15:33:37Z</dc:date>
    </item>
    <item>
      <title>Re: Fabric Pipeline. Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061802#M3279</link>
      <description>&lt;P&gt;If you want to use the lookup.csv file to lookup which files don't need to be processed again (instead of using the file names in the Silver lakehouse directory for this purpose):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_1-1721920576261.png" style="width: 999px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1138998i58AEE63E307323CD/image-size/large?v=v2&amp;amp;px=999" role="button" title="frithjof_v_1-1721920576261.png" alt="frithjof_v_1-1721920576261.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In my case, the lookup.csv file has the following content:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_2-1721920667584.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1139000i93EC40F59DF439E1/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_2-1721920667584.png" alt="frithjof_v_2-1721920667584.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The 'ForEach LookupFileRow' activity:&lt;/P&gt;&lt;P&gt;Items:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;activity(&lt;/SPAN&gt;&lt;SPAN&gt;'Get Lookup File Content'&lt;/SPAN&gt;&lt;SPAN&gt;).&lt;/SPAN&gt;&lt;SPAN&gt;output&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;value&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The&amp;nbsp;'Append varLookupFileNames' activity inside the 'ForEach LookupFileRow' activity:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="frithjof_v_3-1721920804950.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1139003iEFCE53EEE7A769A1/image-size/medium?v=v2&amp;amp;px=400" role="button" title="frithjof_v_3-1721920804950.png" alt="frithjof_v_3-1721920804950.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The 'IF Condition' inside the 'ForEach' activity:&lt;/P&gt;&lt;P&gt;Expression:&amp;nbsp;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;contains(variables(&lt;/SPAN&gt;&lt;SPAN&gt;'varLookupFileNames'&lt;/SPAN&gt;&lt;SPAN&gt;), item().&lt;/SPAN&gt;&lt;SPAN&gt;name&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Otherwise similar like the previous example pipeline.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I don't know if Fabric Data Pipeline has any size limits (like output size, number of items in collection, number of items in foreach activity, result size in lookup activity, etc.) which needs to be taken into consideration or it can result in pipeline failure or unexpected results if the number of files in any of the folders grow above the limits.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 15:22:21 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061802#M3279</guid>
      <dc:creator>frithjof_v</dc:creator>
      <dc:date>2024-07-25T15:22:21Z</dc:date>
    </item>
    <item>
      <title>Re: Fabric Pipeline. Data flow. Removing items from a Filter with GetMetaData and Lookup</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061819#M3280</link>
      <description>&lt;P&gt;For example, the Lookup activity has some limitations:&lt;/P&gt;&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/fabric/data-factory/lookup-activity#supported-capabilities" target="_blank"&gt;Lookup activity - Microsoft Fabric | Microsoft Learn&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 15:29:51 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Fabric-Pipeline-Data-flow-Removing-items-from-a-Filter-with/m-p/4061819#M3280</guid>
      <dc:creator>frithjof_v</dc:creator>
      <dc:date>2024-07-25T15:29:51Z</dc:date>
    </item>
  </channel>
</rss>

