<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Adding custom .zip or .egg reference files to PySpark jobs in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3566095#M978</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/641130"&gt;@gmangiante&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Thanks for using Fabric Community,&lt;BR /&gt;&lt;BR /&gt;We are reaching out to the internal team to get more information related to your query and will get back to you as soon as we have an update.&lt;/P&gt;</description>
    <pubDate>Mon, 04 Dec 2023 12:33:52 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2023-12-04T12:33:52Z</dc:date>
    <item>
      <title>Adding custom .zip or .egg reference files to PySpark jobs</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3565229#M977</link>
      <description>&lt;P&gt;Hello-&lt;/P&gt;&lt;P&gt;I'm currently working on a fairly complex Spark ETL job definition that requires a number of additional custom Python files alongside the main file in order to run. I'm currently using the "reference files" feature of Spark jobs in Fabric, but I'm finding it difficult to maintain the job definitions when I have to add 10 ABFSS URLs to the "reference files" area, each one pointing to an individual Python file.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;With regular Spark, I would use the --py-files flag on spark-submit to add a .zip or .egg package containing the files, but it seems like I'm only able to add URLs to .py files within the Fabric job definition UI. I know I could create a custom .whl and include it in my Spark environment, but during iterative development and CI/CD, it's much easier to automate creating a .zip file and replacing it via ABFSS than it is to try to figure out how to attach a new .whl to the environment and republish (I'm not even sure the REST APIs exist to do those things).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Am I missing an obvious solution here? Any guidance would be appreciated, and if I'm not missing anything, I guess this is my vote for allowing .zip/.egg files in the "reference files" area of Fabric Spark job definitions. Thanks so much!&lt;/P&gt;</description>
      <pubDate>Mon, 04 Dec 2023 02:16:36 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3565229#M977</guid>
      <dc:creator>gmangiante</dc:creator>
      <dc:date>2023-12-04T02:16:36Z</dc:date>
    </item>
    <item>
      <title>Re: Adding custom .zip or .egg reference files to PySpark jobs</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3566095#M978</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/641130"&gt;@gmangiante&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Thanks for using Fabric Community,&lt;BR /&gt;&lt;BR /&gt;We are reaching out to the internal team to get more information related to your query and will get back to you as soon as we have an update.&lt;/P&gt;</description>
      <pubDate>Mon, 04 Dec 2023 12:33:52 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3566095#M978</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-12-04T12:33:52Z</dc:date>
    </item>
    <item>
      <title>Re: Adding custom .zip or .egg reference files to PySpark jobs</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3568064#M979</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/641130"&gt;@gmangiante&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Currently this is not supported. Team has taken has feedback on it.&lt;BR /&gt;&lt;BR /&gt;Appreciate if you could share the feedback on our &lt;A href="http://Appreciate%20if you could share the feedback on our feedback channel. Which would be open for the user community to upvote &amp;amp; comment on. This allows our product teams to effectively prioritize your request against our existing feature backlog and gives insight into the potential impact of implementing the suggested feature. Hope this helps. Please let me know if you have any further queries." target="_blank" rel="noopener"&gt;feedback channel&lt;/A&gt;. Which would be open for the user community to upvote &amp;amp; comment on. This allows our product teams to effectively prioritize your request against our existing feature backlog and gives insight into the potential impact of implementing the suggested feature.&lt;BR /&gt;&lt;BR /&gt;Hope this helps. Please let me know if you have any further queries.&lt;/P&gt;</description>
      <pubDate>Tue, 05 Dec 2023 09:32:47 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3568064#M979</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-12-05T09:32:47Z</dc:date>
    </item>
    <item>
      <title>Re: Adding custom .zip or .egg reference files to PySpark jobs</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3569094#M980</link>
      <description>&lt;P&gt;Appreciate you all taking a look, and will do! Thanks.&lt;/P&gt;</description>
      <pubDate>Tue, 05 Dec 2023 18:20:24 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Adding-custom-zip-or-egg-reference-files-to-PySpark-jobs/m-p/3569094#M980</guid>
      <dc:creator>gmangiante</dc:creator>
      <dc:date>2023-12-05T18:20:24Z</dc:date>
    </item>
  </channel>
</rss>

