This time we’re going bigger than ever. Fabric, Power BI, SQL, AI and more. We're covering it all. You won't want to miss it.
Learn moreDid you hear? There's a new SQL AI Developer certification (DP-800). Start preparing now and be one of the first to get certified. Register now
OpenLineage lets you trace a large number of ETL jobs that mutates tables, letting you trace changes to tables significantly easier.
Databricks Unity Catalog has support for Column Level Lineage and it's highly useful:
View data lineage using Unity Catalog - Azure Databricks | Microsoft Learn
Databricks Demo
AWS also supports OpenLineage:
Amazon DataZone introduces OpenLineage-compatible data lineage visualization in preview | AWS Big Da...
It's actually quite easy to get OpenLineage working in Fabric Spark since you already have the JAR pre-installed:
All we need is a way to store and query historical state (e.g. in Delta Lake), and a UI component that hooks into the Lakehouse catalog and offers an API on top of OpenLineage schema.
The OneLake Catalog UI could look like this:
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.