<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to get a list of the blob with full path stored in Azure Blob Storage Container in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3971132#M2411</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;We haven’t heard from you on the last response and was just checking back to see if your query was answered.&lt;BR /&gt;Otherwise, will respond back with the more details and we will try to help .&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Tue, 04 Jun 2024 11:33:50 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-06-04T11:33:50Z</dc:date>
    <item>
      <title>How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3966396#M2378</link>
      <description>&lt;DIV&gt;&lt;SPAN&gt;Hi Everyone&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Any help with the following would be highly appreciated.&lt;BR /&gt;&lt;BR /&gt;I am using the following code to get the Content of the Parquet files stored in an Azure Blob Storage Container.&amp;nbsp; The below code gets the data of all the files successfully. I am using a Fabric PySpark Notebook for this.&lt;BR /&gt;&lt;BR /&gt;Now I have two issues that need to be resolved.&lt;BR /&gt;&lt;BR /&gt;1. I am only able to get the data when the access level of the container is set to "anonymus". I want to get the code worked when the access level of the container is set to "private".&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;2.&amp;nbsp;I want to get a list of all the blobs stored in the container and the list should contain the complete path to the blob. Like if I have 2 files stored in the container and their complete path should be as follows.&lt;BR /&gt;Entity/App/20240521/00001.parquet&lt;BR /&gt;Entity/App/20240525/00001.parquet&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;Code:&lt;/STRONG&gt;&lt;BR /&gt;&lt;EM&gt;from&lt;/EM&gt;&lt;/SPAN&gt;&lt;EM&gt;&amp;nbsp;pyspark.sql&amp;nbsp;import&amp;nbsp;SparkSession&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;from&amp;nbsp;pyspark&amp;nbsp;import&amp;nbsp;SparkContext&lt;/EM&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;EM&gt;blob_account_name =&amp;nbsp;"parquetfiles1"&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;blob_container_name =&amp;nbsp;"container1"&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;blob_sas_token =&amp;nbsp;"sp=rl...."&lt;/EM&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;EM&gt;# Initialize Spark Session&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;spark = SparkSession.builder.appName("azure").getOrCreate()&lt;/EM&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;EM&gt;# Set the configuration&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;spark.conf.set(&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;f'spark.hadoop.fs.azure.sas.{blob_container_name}.{blob_account_name}.blob.core.windows.net',&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;blob_sas_token)&lt;/EM&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;EM&gt;# Build the base path for the container (without specific file path)&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;base_path =&amp;nbsp;f'wasbs://{blob_container_name}@{blob_account_name}.blob.core.windows.net'&lt;/EM&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;EM&gt;df = (&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; &amp;nbsp; spark.read.format("parquet")&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; &amp;nbsp; .option("recursiveFileLookup",&amp;nbsp;"true")&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;&amp;nbsp; &amp;nbsp; .load(base_path)&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;)&lt;/EM&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;EM&gt;display(df)&lt;BR /&gt;&lt;BR /&gt;&lt;/EM&gt;Regards&lt;BR /&gt;Rashid Anwar&lt;/DIV&gt;</description>
      <pubDate>Sun, 02 Jun 2024 08:10:27 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3966396#M2378</guid>
      <dc:creator>rashidanwar</dc:creator>
      <dc:date>2024-06-02T08:10:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3968643#M2396</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Thanks for using Fabric Community.&lt;BR /&gt;Regarding query 1, can you please refer -&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/answers/questions/1289365/azure-storage-container-access-is-set-to-private-h" target="_blank" rel="noopener"&gt;click here&lt;/A&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Additional Docs -&lt;BR /&gt;&lt;A href="https://www.youtube.com/watch?v=zqIprwv9euk" target="_blank"&gt;Loading data from ADLS behind firewalls to Fabric Lakehouse (youtube.com)&lt;/A&gt;&lt;BR /&gt;&lt;A href="https://learn.microsoft.com/en-us/fabric/security/security-managed-private-endpoints-overview" target="_blank"&gt;Overview of managed private endpoints for Microsoft Fabric - Microsoft Fabric | Microsoft Learn&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Regarding query 2, can you please check below docs -&lt;BR /&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/microsoft-spark-utilities?pivots=programming-language-python#list-files" target="_blank"&gt;Introduction to Microsoft Spark utilities - Azure Synapse Analytics | Microsoft Learn&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Hope this is helpful. Please do let me know incase of further queries.&lt;/P&gt;</description>
      <pubDate>Mon, 03 Jun 2024 12:02:14 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3968643#M2396</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-06-03T12:02:14Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3971132#M2411</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;We haven’t heard from you on the last response and was just checking back to see if your query was answered.&lt;BR /&gt;Otherwise, will respond back with the more details and we will try to help .&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Tue, 04 Jun 2024 11:33:50 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3971132#M2411</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-06-04T11:33:50Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3971436#M2412</link>
      <description>&lt;P&gt;Thank you&amp;nbsp;@Anonymous!&lt;BR /&gt;Let me try your solution and I'll get back to you.&lt;/P&gt;</description>
      <pubDate>Tue, 04 Jun 2024 12:53:43 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3971436#M2412</guid>
      <dc:creator>rashidanwar</dc:creator>
      <dc:date>2024-06-04T12:53:43Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3972715#M2424</link>
      <description>&lt;P&gt;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;- Have you tried using the _metadata properties as part of your df, if you want it alongside the data&lt;/P&gt;&lt;P&gt;- df = spark.read.format("parquet").select("*","_metadata.file_path","_metadata.file_name",")&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If you just want to list out the files have you tried this; I do not fully understand your use case so this may not be what you are trying to achieve.&lt;/P&gt;&lt;P&gt;-&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/fabric/data-engineering/microsoft-spark-utilities#list-files" target="_blank"&gt;https://learn.microsoft.com/en-us/fabric/data-engineering/microsoft-spark-utilities#list-files&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 05 Jun 2024 02:15:50 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3972715#M2424</guid>
      <dc:creator>jwinchell40</dc:creator>
      <dc:date>2024-06-05T02:15:50Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3982646#M2470</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet .&lt;BR /&gt;In case if you have any resolution please do share that same with the community as it can be helpful to others . &lt;BR /&gt;Otherwise, will respond back with the more details and we will try to help . &lt;BR /&gt;Thanks&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Jun 2024 05:59:10 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3982646#M2470</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-06-10T05:59:10Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3983523#M2484</link>
      <description>&lt;P&gt;@Anonymous,&lt;BR /&gt;regarding query 1: Creating managed private endpoints in Fabric is only available for workspaces assigned to Fabric capacities with SKUs F64 or larger. Unfortunately, this does not help in my cas&lt;/P&gt;&lt;P&gt;I have tried every suggestion but without success. I've had multiple meetings with the Microsoft team, and they haven't been able to resolve the issue yet. I'm at a loss for what to do next.&lt;/P&gt;&lt;P&gt;I have Parquet files stored in Azure Blob Storage, some of which have nested data structures. Initially, I tried to directly consume the files using Power BI, but the Parquet.Document() function in Power Query cannot read the nested data structure and throws an error. Interestingly, I have been able to access my Azure Blob Storage data in Power BI with the container access level set to "Private". I then decided to use a Fabric PySpark Notebook to retrieve the data, but I haven't had any success with that either. In fabric I am to get the data unanimously but can't get when access level is set to Private.&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Jun 2024 12:16:59 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3983523#M2484</guid>
      <dc:creator>rashidanwar</dc:creator>
      <dc:date>2024-06-10T12:16:59Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985322#M2508</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;At present inorder to access the data with access level set to 'Private' is by using Manage Private Endpoint.&lt;/P&gt;
&lt;P&gt;In your case, the best course of action is to open a support ticket and have our support team take a closer look at it.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please reach out to our support team so they can do a more thorough investigation and can guide you better:&amp;nbsp;&lt;A href="https://support.fabric.microsoft.com/en-IN/support/" target="_blank" rel="noopener"&gt;Link&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;After creating a Support ticket please provide the ticket number as it would help us to track for more information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this helps. Please let us know if you have any other queries.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 07:46:27 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985322#M2508</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-06-11T07:46:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985474#M2509</link>
      <description>&lt;P&gt;Thank you&amp;nbsp;@Anonymous&amp;nbsp;for the infomration.&lt;BR /&gt;Below is the support ticket/tracking id&amp;nbsp;&lt;BR /&gt;&lt;SPAN&gt;TrackingID#2405230050000012&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 08:53:51 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985474#M2509</guid>
      <dc:creator>rashidanwar</dc:creator>
      <dc:date>2024-06-11T08:53:51Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985499#M2510</link>
      <description>&lt;P&gt;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/599628"&gt;@jwinchell40&lt;/a&gt;,&amp;nbsp;thank you for your message. I used the following code and I am able to access the list of the parquet files in my blob storage container.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;from&lt;/SPAN&gt;&lt;SPAN&gt; pyspark.sql.functions &lt;/SPAN&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; *&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;from&lt;/SPAN&gt;&lt;SPAN&gt; pyspark &lt;/SPAN&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; SparkContext&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;from&lt;/SPAN&gt;&lt;SPAN&gt; pyspark.sql &lt;/SPAN&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; SparkSession&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;from&lt;/SPAN&gt;&lt;SPAN&gt; notebookutils.mssparkutils.fs &lt;/SPAN&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; ls&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;blob_account_name = &lt;/SPAN&gt;&lt;SPAN&gt;"account_name"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;blob_container_name = &lt;/SPAN&gt;&lt;SPAN&gt;"container"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;blob_sas_token = &lt;/SPAN&gt;&lt;SPAN&gt;"sp=rl&amp;amp;.....&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;# Initialize Spark Session&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;spark = SparkSession.builder.appName(&lt;/SPAN&gt;&lt;SPAN&gt;"azure"&lt;/SPAN&gt;&lt;SPAN&gt;).getOrCreate()&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;# Set the configuration&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;spark.conf.set(&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;'spark.hadoop.fs.azure.sas.&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;blob_container_name&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;blob_account_name&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;.blob.core.windows.net'&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;blob_sas_token)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;# Build the base path for the container (without specific file path)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;base_path = &lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;'wasbs://&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;blob_container_name&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;@&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;blob_account_name&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;.blob.core.windows.net'&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;# Function to list all files in a directory recursively&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;def&lt;/SPAN&gt; &lt;SPAN&gt;list_all_files&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;directory&lt;/SPAN&gt;&lt;SPAN&gt;&lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; all_files = []&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; items = ls(directory)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;for&lt;/SPAN&gt;&lt;SPAN&gt; item &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; items:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;if&lt;/SPAN&gt;&lt;SPAN&gt; item.isDir:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; all_files.extend(list_all_files(item.path))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;else&lt;/SPAN&gt;&lt;SPAN&gt;:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; all_files.append(item.path)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;return&lt;/SPAN&gt;&lt;SPAN&gt; all_files&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;main_directories = mssparkutils.fs.ls(base_path)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;# Initialize a list to hold all Parquet file paths&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;parquet_files = []&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;# Iterate through each main directory and list all files&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;for&lt;/SPAN&gt;&lt;SPAN&gt; main_dir &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; main_directories:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;if&lt;/SPAN&gt;&lt;SPAN&gt; main_dir.isDir:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; all_files = list_all_files(main_dir.path)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;# Filter Parquet files&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; parquet_files.extend([&lt;/SPAN&gt;&lt;SPAN&gt;file&lt;/SPAN&gt; &lt;SPAN&gt;for&lt;/SPAN&gt; &lt;SPAN&gt;file&lt;/SPAN&gt; &lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; all_files &lt;/SPAN&gt;&lt;SPAN&gt;if&lt;/SPAN&gt; &lt;SPAN&gt;file&lt;/SPAN&gt;&lt;SPAN&gt;.endswith(&lt;/SPAN&gt;&lt;SPAN&gt;'.parquet'&lt;/SPAN&gt;&lt;SPAN&gt;)])&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;# Print all Parquet file paths&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;for&lt;/SPAN&gt;&lt;SPAN&gt; parquet_file &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; parquet_files:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;&lt;SPAN&gt;(parquet_file)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Tue, 11 Jun 2024 09:03:00 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985499#M2510</guid>
      <dc:creator>rashidanwar</dc:creator>
      <dc:date>2024-06-11T09:03:00Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985699#M2513</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Glad to know your issue got resolved. Please continue using Fabric Community on your further queries.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 09:50:11 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985699#M2513</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-06-11T09:50:11Z</dc:date>
    </item>
    <item>
      <title>Re: How to get a list of the blob with full path stored in Azure Blob Storage Container</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985702#M2514</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/320559"&gt;@rashidanwar&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Thanks for sharing the support ticket.&lt;/P&gt;
&lt;P&gt;Please allow some time, so team can check and provide a resolution.&lt;/P&gt;
&lt;P&gt;In case if you got a resolution please do share with the community as it can be helpful to others .&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jun 2024 09:50:45 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/How-to-get-a-list-of-the-blob-with-full-path-stored-in-Azure/m-p/3985702#M2514</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-06-11T09:50:45Z</dc:date>
    </item>
  </channel>
</rss>

