Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Enhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.

Reply
Ostrzak
Helper II
Helper II

Overwriting a csv using Spark notebook creates artifacts

Hi everyone,

 

It might be a bit stupid of a question, but is there a way to overwrite csv files in a lakehouse (using Pyspark notebook) without creation of additional folders/artifacts?

 

Right now when I use:

df.write.mode("overwrite").csv("file_path")
 
A new subfolder is created (named as original file), within it SUCCESS artifact and a .csv file with a hashed name. 

Ostrzak_0-1698687647912.png

I can live with it, but it would be nice if it could just overwrite a file and leave it in the same destination. 

Thank you in advance for any feedback.

1 ACCEPTED SOLUTION
AndyDDC
Super User
Super User

Hi @Ostrzak what is the value in "file_path"?  If I specify a Files folder to save the CSV to, it replaces the current CSV with a new version - changing the CSV filename in the process.  But the old one has been removed.

 

AndyDDC_0-1698694337789.png

 

 

AndyDDC_1-1698694354260.png

 

View solution in original post

4 REPLIES 4
AndyDDC
Super User
Super User

Hi @Ostrzak what is the value in "file_path"?  If I specify a Files folder to save the CSV to, it replaces the current CSV with a new version - changing the CSV filename in the process.  But the old one has been removed.

 

AndyDDC_0-1698694337789.png

 

 

AndyDDC_1-1698694354260.png

 

Hi @AndyDDC 

 

Thank you for answering.

I had it saved directly to the lakehouse Files, without any subfolder. When I overwrite it, it lands in a subfolder that is named as the file before, while inside there are  two entities:

- csv file with hashed name

- SUCCESS artifact

I see from your example that it works fine after it creates the aforementioned structure. That is useful knowledge. I guess I have to get accustomed to this structure, at the end of a day it is still human-readable.

Anonymous
Not applicable

Hi @Ostrzak ,

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. Otherwise, will respond back with the more details and we will try to help .

Yes it's advisable to have sub-folders when writing, as there could be overwrite issues.

If my reply has been helpful please consider marking it as the solution.

Glad it's sorted now

Helpful resources

Announcements
Join our Fabric User Panel

Join our Fabric User Panel

This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.

June FBC25 Carousel

Fabric Monthly Update - June 2025

Check out the June 2025 Fabric update to learn about new features.

June 2025 community update carousel

Fabric Community Update - June 2025

Find out what's new and trending in the Fabric community.

Top Solution Authors