Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Fabric Data Days Monthly is back. Join us on March 26th for two expert-led sessions on 1) Getting Started with Fabric IQ and 2) Mapping & Spacial Analytics in Fabric. Register now

stronger metadata & lineage support

When working with data that needs to be shared, it is of utmost importance to understand what the semantic definition of it is (i.e., the proper usage of it) and also where it came from (both sources and transformations). This would be published as a data dictionary with lineage information. There are many other aspects to meta- and master-data, and this is one of the biggest barriers to PowerBI becoming adopted as an enterprise-level tool.
Status: Completed
Comments
jeff_gentry
New Member
Bump. Ability to extract metadata for a tool like Adaptive, MetaCenter, or Rochade is a showstopper in some environments. I'm also interested in the plans in this area. It's supported by other Microsoft platforms, like SSRS.
fbcideas_migusr
New Member
This would be a great step, but besides that. What about your business rules / transformations that you apply to your data? When creating an new document there is no way to re-use those transformations. Ideal would be if you could store the result of a query/transformation and use that in multiple PIBX files. (Also more efficient to your source db-system)
fbcideas_migusr
New Member
i believe this is being solved by Data Catalog... see this PBI Idea: https://ideas.powerbi.com/forums/265200-power-bi-ideas/suggestions/6864081-data-catalog-integration-in-powerbi-designer
Elaine_Brown1
New Member
Fundamental to users experiencing the data in the best way and helping them know and trust what they are using.
chris_ward2
New Member
Powerbi API needs much stronger lineage capability. Currently it is only possible to find, for a report, the server that the dataset is using. It's not a preety process to get.... So that means, we can get server level lineage. Not database, not table and certainly not column lineage. This is very poor. This means that anytime we do an impact analysis, we have to open all the datasets to see if the table/column etc is used. At minimum, the SQL executed should be exposed (still means we'd need to parse the sql but its a step closer). What would be idea is knowing the sql fields and the mapping to other queries measures etc. (ie column level lineage to the dataset). If we could then also get what measures are used in the report/dashboard etc we'd have relatively full tracability. Been able for 3rd party tools such as Informatica EDC / Alation to be able to be able to download this metadata is vital. Impact analysis is just 1 use case. Without column lineage, how do you know which reports you need to know have been view/printed/download in the case of GDR/CCP/breach etc (if you know the columns with identifiable info, you should be able to trace to where its used.)
daragh
New Member
thanks @billanton
Anonymous
Not applicable

There is the lineage view in which the lineage information is avaialbe - Introducing Power BI lineage view | Microsoft Power BI Blog | Microsoft Power BI

The metadata is exposed through the Scanner API or in Purview - Announcing Scanner API (Admin REST APIs) enhancements to include dataset tables, columns, measures, ..., , Use Power BI with Azure Purview to achieve better data governance and discovery | Microsoft Power BI...

If there is a need to expose more granualr information in one or all of these capabilities please do open a new idea/s with the specific asks so we address them

fbcideas_migusr
New Member
Status changed to: Completed