Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Score big with last-minute savings on the final tickets to FabCon Vienna. Secure your discount

Reply
Anonymous
Not applicable

Visualize data lineage

Hi there!

 

I am trying to visualize the data lineage of certain attributes. The goal of the assignment is to determine what part of the data lineage still needs to be done. This is the case:

 

I have four different tables all related to each other that represent the entire data lineage of the attributes in table 1. For example:

 

Table 1 gives an overview of all attributes that I need the data lineage from. In this example I picked one of the attributes (Value of Product).

Table 2 gives me all attributes from which the value in table 1 retrieves its data. For example: (table1[Value of Product]) gets the data from the combination of (table2[attribute100]), (table2[attribute101]), (table2[attribute102]), etc.

Table 3 does the same as table 2 but for the attributes of table 2. For example: (table2[attribute100]) gets the data from (table3[attribute_x]), (table3[attribute_y]). However, it is also possible that there are blanks. This means that there is no known data lineage for the attribute in table 2 (! this means there is no data, there are no blanks in the original table).

Table 4 (source system) does the same as table 2 and 3 but it refers to the system level. For example: (table3[attribute_x]) retrieves its data from both (table4[System_1]) and (table4[System_2]).

 

All with all, the data looks like this (below):

 

Now I want to know how to visualize certain things:

 

  1. How to automatically visualize this data into the tree I now made myself.
  2. The count of the attributes in table2 that do not have a value in table3 (in this example the count should be 6)
  3. The count of the attributes in table2 that do have a value in table 3 (in this example the count should be 6)
  4. The count of the distinct systems in table4 (in this example the count should be 2), note: when I use the distinctcount option it gives me the count of all the systems in table4 without taking the filter (table1=value of product) into consideration.

I hope someone can help 🙂

 

Table 1 (attribute level)

Table 2 (attribute level)

Table 3 (attribute level)

Table 4 (source system level)

Value of product

Attribute100

Attribute_x

System_1

Value of product

Attribute100

Attribute_x

System_2

Value of product

Attribute100

Attribute_y

System_2

Value of product

Attribute101

 

 

Value of product

Attribute102

Attribute_z

System_1

Value of product

Attribute103

 

 

Value of product

Attribute104

 

 

Value of product

Attribute105

Attribute_x

System_1

Value of product

Attribute105

Attribute_x

System_2

Value of product

Attribute105

Attribute_y

System_2

Value of product

Attribute106

 

 

Value of product

Attribute107

 

 

Value of product

Attribute108

Attribute_x

System_1

Value of product

Attribute108

Attribute_x

System_2

Value of product

Attribute108

Attribute_y

System_2

Value of product

Attribute109

Attribute_z

System_1

Value of product

Attribute110

Attribute_x

System_1

Value of product

Attribute110

Attribute_x

System_2

Value of product

Attribute111

 
 

 

D64A099B-1941-4898-A3C6-BDF2649E0429.png

1 ACCEPTED SOLUTION
v-frfei-msft
Community Support
Community Support

Hi @Anonymous ,

 

Based on my research, There is no such kind of visual in power bi currently. I suggest you to use decomposition tree to work around.

The decomposition tree visual in Power BI lets you visualize data across multiple dimensions. It automatically aggregates data and enables drilling down into your dimensions in any order.

 

2.PNG

 

Community Support Team _ Frank
If this post helps, then please consider Accept it as the solution to help the others find it more quickly.

View solution in original post

2 REPLIES 2
Anonymous
Not applicable

You can try Sankey diagram to show visual data linage, I think it is the closest match for your requirement.

To use sankey you have to transform your data in source and destination columns(or nodes) like shown below, and add a dummy Weigh column.

To transform your data simply follow this logic : All the nodes which are acting as a source for other nodes will be listed in source column and all the nodes which are acting as a destination for other nodes will be listed in Destination column.

(e.g. in your data Colum Table 1 (attribute level) is source for column Table 2(attribute level) and column Table 2 is source for Table 3 so on and so forth. So Table1 (attribute level)  and Table2 (attribute level) will appear in source column list and Table2 (attribute level)  and Table3 (attribute level) will appear in destination column list)

Source

DestinationWeigh
Value of productAttribute1001
Value of productAttribute1001
Value of productAttribute1001
Value of productAttribute1011
Value of productAttribute1021
Value of productAttribute1031
Value of productAttribute1041
Value of productAttribute1051
Value of productAttribute1051
Value of productAttribute1051
Value of productAttribute1061
Value of productAttribute1071
Value of productAttribute1081
Value of productAttribute1081
Value of productAttribute1081
Value of productAttribute1091
Value of productAttribute1101
Value of productAttribute1101
Value of productAttribute1111
Attribute100Attribute_x1
Attribute100Attribute_x1
Attribute100Attribute_y1
Attribute102Attribute_z1
Attribute105Attribute_x1
Attribute105Attribute_x1
Attribute105Attribute_y1
Attribute108Attribute_x1
Attribute108Attribute_x1
Attribute108Attribute_y1
Attribute109Attribute_z1
Attribute110Attribute_x1
Attribute110Attribute_x1
Attribute_xSystem_11
Attribute_xSystem_21
Attribute_ySystem_21
Attribute_zSystem_11
Attribute_xSystem_11
Attribute_xSystem_21
Attribute_ySystem_21
Attribute_xSystem_11
Attribute_xSystem_21
Attribute_ySystem_21
Attribute_zSystem_11
Attribute_xSystem_11
Attribute_xSystem_21

 

The output of Sankey diagram is shown below:

Sankey.JPG

 

 

v-frfei-msft
Community Support
Community Support

Hi @Anonymous ,

 

Based on my research, There is no such kind of visual in power bi currently. I suggest you to use decomposition tree to work around.

The decomposition tree visual in Power BI lets you visualize data across multiple dimensions. It automatically aggregates data and enables drilling down into your dimensions in any order.

 

2.PNG

 

Community Support Team _ Frank
If this post helps, then please consider Accept it as the solution to help the others find it more quickly.

Helpful resources

Announcements
August Power BI Update Carousel

Power BI Monthly Update - August 2025

Check out the August 2025 Power BI update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.