<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Spark SQL query data in one datalake and store transformed data in another data lake in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421957#M7428</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/248029"&gt;@SergioTorrinha&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In your notebook, you need access to the Bronze table. If both lakehouses share the same workspace, you can reference the table directly (for example, using its fully qualified name). Otherwise, create a shortcut in your Silver lakehouse that points to the Bronze table&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-- Using fully qualified name (catalog.schema.table)&lt;BR /&gt;CREATE OR REPLACE TABLE silver_lh.tableA_silver AS&lt;BR /&gt;SELECT&lt;BR /&gt;Cola&lt;BR /&gt;FROM bronze_lh.tableA&lt;/P&gt;</description>
    <pubDate>Fri, 21 Feb 2025 13:02:30 GMT</pubDate>
    <dc:creator>nilendraFabric</dc:creator>
    <dc:date>2025-02-21T13:02:30Z</dc:date>
    <item>
      <title>Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421816#M7424</link>
      <description>&lt;P&gt;Hi everyone!&lt;BR /&gt;&lt;BR /&gt;Hope the question I bring today has a simple solution, as i'm trying to figure out the best way of realizing data transformations accross medallion architecture layers.&lt;BR /&gt;&lt;BR /&gt;So, supose you have 2 lakehouses, resembling medallion architecture:&lt;BR /&gt;&lt;BR /&gt;bronze_lh&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; tableA&lt;BR /&gt;&lt;BR /&gt;silver_lh&lt;BR /&gt;&lt;BR /&gt;Question here is:&lt;BR /&gt;Using a notebook with &lt;STRONG&gt;Spark SQL&lt;/STRONG&gt;, how would you write a spark sql query that would fetch data from tableA, in bronze_lh, transform it and then store the resulting data in a tableA_silver in the silver_lh ?&lt;BR /&gt;I know this is possible with PySpark, it's what i've been using so far. But how is it correctly done using SparkSQL?&lt;BR /&gt;&lt;BR /&gt;The reason I ask is because I'm far more confortable using SQL than Python, especially for all data transformations not involving pivoting the data, but I feel Spark SQL lacks the capabilities of doing operations like the ones I'm describing here. Please prove i'm wrong &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;Furthermore, using wharehouse and pure sql notebooks is not an option, I beleive, as the data is composed by large millions of records with a fair amount of columns/data fields, and I'm unsure how well would SQL perform in this scenario.&lt;BR /&gt;&lt;BR /&gt;If the question is somewhat ambiguous, please feel free to ask for specific details. I'm glad to provide them as far they don't go off to what's reasonably possible to share.&lt;BR /&gt;Apretiate in advance your feedback.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Feb 2025 11:10:32 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421816#M7424</guid>
      <dc:creator>SergioTorrinha</dc:creator>
      <dc:date>2025-02-21T11:10:32Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421912#M7426</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/248029"&gt;@SergioTorrinha&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Assume you’ve created a shortcut named&amp;nbsp;&lt;/SPAN&gt;bronze_tableA&lt;SPAN&gt;&amp;nbsp;in your Silver workspace that points to the original&amp;nbsp;&lt;/SPAN&gt;tableA&lt;SPAN&gt;&amp;nbsp;in Bronze. Then your notebook cell could look like this&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;%%sql&lt;BR /&gt;CREATE OR REPLACE TABLE tableA_silver AS&lt;BR /&gt;SELECT&lt;BR /&gt;col1,&lt;BR /&gt;col2,&lt;BR /&gt;col4,&lt;BR /&gt;col5&lt;BR /&gt;FROM bronze_tableA&lt;/P&gt;</description>
      <pubDate>Fri, 21 Feb 2025 12:29:03 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421912#M7426</guid>
      <dc:creator>nilendraFabric</dc:creator>
      <dc:date>2025-02-21T12:29:03Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421943#M7427</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/913195"&gt;@nilendraFabric&lt;/a&gt; !&lt;BR /&gt;&lt;BR /&gt;Thanks for you'r input.&lt;BR /&gt;Do I really need to create shortcuts, for this kind of task, even if these lakehouses are within the same tenant/domain/workspace?&lt;BR /&gt;Imagine I have multiple tables in bronze_lh, for which I would like to do the same sort of operations. Would I need to create those shortcuts in the silver_lh? How is one automating such task?&lt;BR /&gt;&lt;BR /&gt;Thank you.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Feb 2025 13:02:39 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421943#M7427</guid>
      <dc:creator>SergioTorrinha</dc:creator>
      <dc:date>2025-02-21T13:02:39Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421957#M7428</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/248029"&gt;@SergioTorrinha&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In your notebook, you need access to the Bronze table. If both lakehouses share the same workspace, you can reference the table directly (for example, using its fully qualified name). Otherwise, create a shortcut in your Silver lakehouse that points to the Bronze table&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-- Using fully qualified name (catalog.schema.table)&lt;BR /&gt;CREATE OR REPLACE TABLE silver_lh.tableA_silver AS&lt;BR /&gt;SELECT&lt;BR /&gt;Cola&lt;BR /&gt;FROM bronze_lh.tableA&lt;/P&gt;</description>
      <pubDate>Fri, 21 Feb 2025 13:02:30 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421957#M7428</guid>
      <dc:creator>nilendraFabric</dc:creator>
      <dc:date>2025-02-21T13:02:30Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421960#M7429</link>
      <description>&lt;P&gt;Ok, now this is making more sense, I'll have to try this one out.&lt;BR /&gt;&lt;BR /&gt;Thank you for your help!&lt;/P&gt;</description>
      <pubDate>Fri, 21 Feb 2025 13:05:08 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4421960#M7429</guid>
      <dc:creator>SergioTorrinha</dc:creator>
      <dc:date>2025-02-21T13:05:08Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4587531#M7553</link>
      <description>&lt;P&gt;Hi again&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/913195"&gt;@nilendraFabric&lt;/a&gt;&amp;nbsp;!&lt;BR /&gt;&lt;BR /&gt;Sorry to pull this topic again, but after testing I didn't quite got the results I was expecting after your input. I somewhat feel this has a really simple solution, but aparently I cant get there alone.&lt;BR /&gt;&lt;BR /&gt;Below code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;%%sql

CREATE OR REPLACE TABLE lh_silver.dbo.test_table_pls_drop_me AS
SELECT
    *
FROM dbo.bronze_table
LIMIT 10&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Throws the following error message:&lt;BR /&gt;&lt;SPAN&gt;[REQUIRES_SINGLE_PART_NAMESPACE] spark_catalog requires a single-part namespace, but got `lh_silver`.`dbo`.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;Knowing that:&lt;/P&gt;&lt;P&gt;- the notebook I am running this code is associated to lh_bronze&lt;/P&gt;&lt;P&gt;- both lh_bronze and lh_silver are located in the same workspace&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;what am I doing wrong?&lt;BR /&gt;Apretiate your help in advance.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 27 Feb 2025 09:51:50 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4587531#M7553</guid>
      <dc:creator>SergioTorrinha</dc:creator>
      <dc:date>2025-02-27T09:51:50Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4589790#M7597</link>
      <description>&lt;P&gt;Hi everyone!&lt;BR /&gt;&lt;BR /&gt;can someone from the support team, help with this one, please?&lt;BR /&gt;Much apretiated.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 28 Feb 2025 10:23:55 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4589790#M7597</guid>
      <dc:creator>SergioTorrinha</dc:creator>
      <dc:date>2025-02-28T10:23:55Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4592551#M7641</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hi everyone!&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I keep facing this issue, can someone help with this one, please?&lt;/SPAN&gt;&lt;BR /&gt;Thank you.&lt;/P&gt;</description>
      <pubDate>Mon, 03 Mar 2025 09:10:01 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4592551#M7641</guid>
      <dc:creator>SergioTorrinha</dc:creator>
      <dc:date>2025-03-03T09:10:01Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4604024#M7817</link>
      <description>&lt;P&gt;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/248029"&gt;@SergioTorrinha&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;SPAN&gt;As we haven’t heard back from you, we wanted to kindly follow up to check if the solution provided for your issue worked? or let us know if you need any further assistance here?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/913195"&gt;@nilendraFabric&lt;/a&gt;, thanks for your promt repsonse&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Prashanth Are&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;MS Fabric community support&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;If this post helps, then please consider&amp;nbsp;&lt;STRONG&gt;Accept it as the solution&lt;/STRONG&gt;&amp;nbsp;to help the other members find it more quickly and give&amp;nbsp;&lt;STRONG&gt;Kudos&lt;/STRONG&gt;&amp;nbsp;if helped you resolve your query&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Mar 2025 21:39:58 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4604024#M7817</guid>
      <dc:creator>v-prasare</dc:creator>
      <dc:date>2025-03-10T21:39:58Z</dc:date>
    </item>
    <item>
      <title>Re: Spark SQL query data in one datalake and store transformed data in another data lake</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4612464#M7976</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/248029"&gt;@SergioTorrinha&lt;/a&gt;&amp;nbsp;, Hope your doing well.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In this scenario i suggest you to raise a support ticket here. so, that they can assit you in addressing the issue you are facing. please follow below link on how to raise a support ticket:&lt;/P&gt;
&lt;P&gt;&lt;A class="" title="https://learn.microsoft.com/en-us/power-bi/support/create-support-ticket" href="https://learn.microsoft.com/en-us/power-bi/support/create-support-ticket" target="_blank" rel="noreferrer noopener" aria-label="Link How to create a Fabric and Power BI Support ticket - Power BI | Microsoft Learn"&gt;How to create a Fabric and Power BI Support ticket - Power BI | Microsoft Learn&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Prashanth Are&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;MS Fabric community support&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;If this post helps, then please consider&amp;nbsp;&lt;STRONG&gt;Accept it as the solution&amp;nbsp;to help the other members find it more quickly and give&amp;nbsp;&lt;STRONG&gt;Kudos&amp;nbsp;if helped you resolve your query&lt;/STRONG&gt;&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;LI-WRAPPER&gt;&lt;/LI-WRAPPER&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 17 Mar 2025 09:09:50 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Spark-SQL-query-data-in-one-datalake-and-store-transformed-data/m-p/4612464#M7976</guid>
      <dc:creator>v-prasare</dc:creator>
      <dc:date>2025-03-17T09:09:50Z</dc:date>
    </item>
  </channel>
</rss>

