Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Be one of the first to start using Fabric Databases. View on-demand sessions with database experts and the Microsoft product team to learn just how easy it is to get started. Watch now

Reply
kely
Frequent Visitor

cannot write to lakehouse with schema enabled 3 part naming

I am using a pyspark notebook, reading data from one lakehouse (not schema enabled) to another lakehouse (schema enabled) in the same workspace... however:

 

it ONLY works if I use: df.write.format("delta").mode("overwrite").option("mergeSchema", "true").save("abfss://abc123@onelake.dfs.fabric.microsoft.com/def456/Tables/dim/testnewtable")

 

when I should be able to use: df.write.format("delta").mode("overwrite").option("mergeSchema", "true").saveAsTable("testingSchemas.dim.testnewtable").  

 

This seems like a bug to me, as non schema-enabled lakehouses work like my 2nd example just fine.

1 ACCEPTED SOLUTION

Hi @kely 

 

My previous reply is based on my testing results, without any documentation supporting. So I'm not sure whether it's expected behavior, or it's a limitation, or it might be improved in the future. 

 

If your notebook is currently using a 'bronze' lakehouse as the default lakehouse, using saveAsTable with 'goldlakehouse.schema.table' is probably not valid. Usually switching the default lakehouse of a running notebook will need to terminate the session first. If I understand it correctly, one session can have only one default lakehouse. So at least one lakehouse needs to use the ABFS path in the notebook. 

 

As of now, I recommend using ABFS paths and save function to write the dataframe in your case. This is more reliable. 

 

Best Regards,
Jing

 

View solution in original post

5 REPLIES 5
v-jingzhan-msft
Community Support
Community Support

Hi @kely 

 

My testing result is somewhat different. When I attach a schema-enabled lakehouse to the notebook and use saveAsTable function to save a dataframe to the default lakehouse, it works fine with both 3 part naming and 2 part naming. vjingzhanmsft_0-1731482467528.png

However when the destination schema-enabled lakehouse is not attached as the default lakehouse of the notebook, it failed with an error "AnalysisException: Couldn't find a catalog to handle the identifier LHschema.raw.newTableOne."

 

It seems when using saveAsTable, it always tries to look for the schema and table in the default lakehouse. With 3 part naming, saveAsTable treats the first part as a schema within the default lakehouse. However actually it should be the name of another lakehouse. This leads to the failure. 

 

Best Regards,
Jing
Community Support Team

thx @v-jingzhan-msft !

 

So if I am in a notebook connected to a 'bronze' lakehouse, and I am writing to a 'gold' lakehouse that is schema enabled... is this not a valid use case? 

i.e.  When I write my gold table i would use goldlakehouse.schema.table?  

Hi @kely 

 

My previous reply is based on my testing results, without any documentation supporting. So I'm not sure whether it's expected behavior, or it's a limitation, or it might be improved in the future. 

 

If your notebook is currently using a 'bronze' lakehouse as the default lakehouse, using saveAsTable with 'goldlakehouse.schema.table' is probably not valid. Usually switching the default lakehouse of a running notebook will need to terminate the session first. If I understand it correctly, one session can have only one default lakehouse. So at least one lakehouse needs to use the ABFS path in the notebook. 

 

As of now, I recommend using ABFS paths and save function to write the dataframe in your case. This is more reliable. 

 

Best Regards,
Jing

 

frithjof_v
Community Champion
Community Champion

@kely thanks for sharing your workaround.

 

I guess in the future we will all be using schema enabled Lakehouses, once they turn GA.

 

But we will have many existing Lakehouses that are not schema enabled.

 

So we will need to be able to work with both of them at the same time, like you're wanting to do.

 

I hope this will be supported out-of-the box across schema enabled Lakehouses and non-schema enabled Lakehouses in the future.

AndyDDC
Most Valuable Professional
Most Valuable Professional

This could be a public preview limitation. There's a few items listed in the limitations section talking about part naming conventions, it may be that this particular issue isn't documented.

 

https://learn.microsoft.com/en-us/fabric/data-engineering/lakehouse-schemas#public-preview-limitatio...

 

It may be useful to create an Idea to make sure this functionality is being looked at

Helpful resources

Announcements
ArunFabCon

Microsoft Fabric Community Conference 2025

Arun Ulag shares exciting details about the Microsoft Fabric Conference 2025, which will be held in Las Vegas, NV.

December 2024

A Year in Review - December 2024

Find out what content was popular in the Fabric community during 2024.