Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Enhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.

Reply
sawilliams
Frequent Visitor

Does Fabric duplicate data when creating a table from a Parquet file?

Hi Community,

I'm working on a POC for a simple data ingestion pipeline in Microsoft Fabric. My process is as follows:

  1. Read a CSV from a shortcut to ADLS2
  2. Convert it to Delta Parquet
  3. Save it in the FILES path of the Lakehouse
  4. Create a table from the stored Parquet in the Fabric portal

However, after creating the table, I noticed that the Parquet files also appear in the TABLES path, even though I initially saved them in FILES. Is Fabric duplicating the Parquet files when creating the table?

 

Would it be better to save the Parquet files directly in the TABLES path instead of FILES to avoid redundancy?

1 ACCEPTED SOLUTION
v-shamiliv
Community Support
Community Support

Hi @sawilliams 
Thank you for reaching out microsoft fabric community forum.

Yes, Microsoft Fabric does duplicate the data when creating a table from a Parquet file. This happens because the data is initially stored in the FILES path and then copied to the TABLES path when creating the table.

To avoid redundancy, it would indeed be better to save the Parquet files directly in the TABLES path from the start. This way, you won't have duplicate copies of your data

If you need any further assistance or have any questions, please feel free to reach  us.

If this solution helps, please consider giving us Kudos and accepting it as the solution so that it may assist other members in the community.

Thank you.

 

View solution in original post

1 REPLY 1
v-shamiliv
Community Support
Community Support

Hi @sawilliams 
Thank you for reaching out microsoft fabric community forum.

Yes, Microsoft Fabric does duplicate the data when creating a table from a Parquet file. This happens because the data is initially stored in the FILES path and then copied to the TABLES path when creating the table.

To avoid redundancy, it would indeed be better to save the Parquet files directly in the TABLES path from the start. This way, you won't have duplicate copies of your data

If you need any further assistance or have any questions, please feel free to reach  us.

If this solution helps, please consider giving us Kudos and accepting it as the solution so that it may assist other members in the community.

Thank you.

 

Helpful resources

Announcements
July 2025 community update carousel

Fabric Community Update - July 2025

Find out what's new and trending in the Fabric community.

June FBC25 Carousel

Fabric Monthly Update - June 2025

Check out the June 2025 Fabric update to learn about new features.