March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early bird discount ends December 31.
Register NowBe one of the first to start using Fabric Databases. View on-demand sessions with database experts and the Microsoft product team to learn just how easy it is to get started. Watch now
Hi, i'm using Fabric.
And i got a simple question.
What is the difference between V-Order and Z-Order?
I know V-Order is a write time optimization to the parquet file format that enables lightning-fast reads under the Microsoft Fabric compute engines, such as Power BI, SQL, Spark, and others.(as MS Docs said)
But i can't find any Docs of Z-Order, and even i can't understood well after reading V-Order & Z-Order disc.
Please help this newbie, i will really appricate all your comments.
Thanks a lot.
Park.
Solved! Go to Solution.
Hi @sjpark
Thanks for using Fabric Community.
V-Order:
V-Order is a write-time optimization specifically designed for the Parquet file format within the Microsoft Fabric ecosystem. Its primary goal is to enhance read performance under various compute engines, including Power BI, SQL, and Spark.
Z-Order:
Z-Order is another optimization technique, but it’s not specific to Fabric; it’s widely used in data lakes and analytics platforms. Z-Order aims to improve query performance by co-locating related information in the same set of files.Key Differences:
For more information please refer to these links:
https://www.linkedin.com/posts/lucazanna_data-microsoft-fabric-activity-7068093014677540864-hNm0/?or...
https://docs.delta.io/latest/optimizations-oss.html#language-sql
https://learn.microsoft.com/en-us/fabric/data-engineering/delta-optimization-and-v-order?tabs=sparks...
https://www.dremio.com/blog/how-z-ordering-in-apache-iceberg-helps-improve-performance/
Data skipping for Delta Lake - Azure Databricks | Microsoft Learn
Hope this helps. Please let me know if you have any further questions. Glad to help.
Hi @sjpark
Thanks for using Fabric Community.
V-Order:
V-Order is a write-time optimization specifically designed for the Parquet file format within the Microsoft Fabric ecosystem. Its primary goal is to enhance read performance under various compute engines, including Power BI, SQL, and Spark.
Z-Order:
Z-Order is another optimization technique, but it’s not specific to Fabric; it’s widely used in data lakes and analytics platforms. Z-Order aims to improve query performance by co-locating related information in the same set of files.Key Differences:
For more information please refer to these links:
https://www.linkedin.com/posts/lucazanna_data-microsoft-fabric-activity-7068093014677540864-hNm0/?or...
https://docs.delta.io/latest/optimizations-oss.html#language-sql
https://learn.microsoft.com/en-us/fabric/data-engineering/delta-optimization-and-v-order?tabs=sparks...
https://www.dremio.com/blog/how-z-ordering-in-apache-iceberg-helps-improve-performance/
Data skipping for Delta Lake - Azure Databricks | Microsoft Learn
Hope this helps. Please let me know if you have any further questions. Glad to help.
Hello @v-nikhilan-msft
Thanks a lot for your quick and detail reply.
It helps a lot to me.
But i got a 1 more question.
Can i know how V-Order works? like special sorting techniques of V-Order or else.
If it's a confidential of MS, i won't ask any further.
Thanks.
Park.
Hi @sjpark
I have shared all the publicly available information about V-Order's functionality.
Here's a quick recap: V-Order sorts data within Parquet files based on specific columns, strategically distributes rows across groups, and leverages dictionary encoding for compression. These techniques combined optimize Parquet files for faster reads and storage efficiency.
Hope this helps. Please let me know if you have any further questions.
Hi @sjpark
Glad that your query got resolved. Please continue using Fabric Community for any help regarding your queries.
All the best for you too! Have a great day.
User | Count |
---|---|
8 | |
6 | |
5 | |
2 | |
1 |
User | Count |
---|---|
15 | |
10 | |
5 | |
4 | |
4 |