Solved: Re: Load Parquet File into Fabric Datawarehouse

westf · ‎11-19-2024

Hello,

Im writting a Fabric notebook to load a Parquet file into Fabric Warehouse.

I want this isn Pysparks only as the notebook is fully written using the PySparks. Please advise me the fast & efficient way to load the data into Warehouse from lakehouse.

My request is, If table exist, i want to upsert, if not, i want to create a table and insert the data.

Please kindly share code

Anonymous · ‎11-19-2024

Hi @westf,

Perhaps you can take a look at the following link to use notebook load data to data warehouse.

Load data to MS Fabric Warehouse from notebook - Stack Overflow

Regards,

Xiaoxin Sheng

View solution in original post

Anonymous · ‎11-19-2024

Hi @westf,

Perhaps you can take a look at the following link to use notebook load data to data warehouse.

Load data to MS Fabric Warehouse from notebook - Stack Overflow

Regards,

Xiaoxin Sheng

NandanHegde · ‎11-19-2024

Note: you can use Spark SQL for upsert
What I meant is I am not sure myself whether it is doable via pyspark

----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

NandanHegde · ‎11-19-2024

Below is the sample code that we use :

# Load the Parquet file into a Spark DataFrame

df = spark.read.parquet("path/to/your/parquet/file")

# Write the DataFrame to the Fabric Data Warehouse df.write.mode("overwrite").saveAsTable("your_table_name")

Now based on what I know, you can either append or overwrite data in table directly.

I am not sure w.r.t upsert, need to validate myself

----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

Load Parquet File into Fabric Datawarehouse

Helpful resources

Fabric Community Update - August 2025

Huge last-minute discounts for FabCon Vienna from September 15-18, 2025

Load Parquet File into Fabric Datawarehouse

Helpful resources

Fabric Community Update - August 2025