Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.

Reply
Jack2of3
Helper II
Helper II

Full outer join two tables with 100's of millions of rows each

Greetings. Absolutely new to Power BI, DAX, etc. Decades of experience in ETL, data modeling, OBIEE. 

What I need to do as a POC is pull all values from one table and populate a third value from a second table.

I will then do trend and comparative analysis on the output with filters etc. I've tried merging the tables but with the amount of data it is too slow. What I've read so far suggests using a DAX formula is the way to go, aside from using a dedicated ETL tool, to form the data. 

In the snapshots below TAP_DATA has values TOTALS for LINES_ROUTE when DATE_HOUR has a reading.

TAP_LKP has every LINE_ROUTE and every DATE_HOUR.

I need to select all rows from TAP_LKP and the TOTAL row from TAP_DATA and populate 'null' or 0 in the TOTAL row when TAP_DATA is missing a join on LINE_ROUTE and DATE_HOUR

 

TAP_DATATAP_DATA

 

TAP_LKPTAP_LKP

 

2 ACCEPTED SOLUTIONS
parry2k
Super User
Super User

@Jack2of3  it is correct that using DAX will be the way to go, to make it work first we need to make a key to set relationship between these two table and based on your input:

 

- the relationship between these two table is 1 to 1, correct?

- there might be missing records in TAP_DATA table, means not necessarily each line_route + date hour will have a record, correct?

- there will be always one record for each route and date_Hour in tap_data, correct?

 

add column in each table to set relationship:

 

UniqueKey = TAP_DATA[Line_Route] && TAP_DATA[Date_Hour]


UniqueKey = TAP_LKP[Line_Route] && TAP_LKP[Date_Hour]

in relatioship, set relationship betwee these two table on this newly added "uniquekey" column, add following column in TAP_LKP to get Total from TAP_DATA

 

Total = RELATED( TAP_DATA[Total] )

I made some assumption based on your question, we can always tweak above approach if something I missed.



Subscribe to the @PowerBIHowTo YT channel for an upcoming video on List and Record functions in Power Query!!

Learn Power BI and Fabric - subscribe to our YT channel - Click here: @PowerBIHowTo

If my solution proved useful, I'd be delighted to receive Kudos. When you put effort into asking a question, it's equally thoughtful to acknowledge and give Kudos to the individual who helped you solve the problem. It's a small gesture that shows appreciation and encouragement! ❤


Did I answer your question? Mark my post as a solution. Proud to be a Super User! Appreciate your Kudos 🙂
Feel free to email me with any of your BI needs.

View solution in original post

Thanks @parry2k  You have the description of the data exactly right. The solution worked however, with such a large amount of data and processing it on the Desk Top it is very slow.

I think I'll do it the right way and process the data via an ETL application and denormalize the input data so all Power BI needs to do is slice and dice the data.

View solution in original post

2 REPLIES 2
parry2k
Super User
Super User

@Jack2of3  it is correct that using DAX will be the way to go, to make it work first we need to make a key to set relationship between these two table and based on your input:

 

- the relationship between these two table is 1 to 1, correct?

- there might be missing records in TAP_DATA table, means not necessarily each line_route + date hour will have a record, correct?

- there will be always one record for each route and date_Hour in tap_data, correct?

 

add column in each table to set relationship:

 

UniqueKey = TAP_DATA[Line_Route] && TAP_DATA[Date_Hour]


UniqueKey = TAP_LKP[Line_Route] && TAP_LKP[Date_Hour]

in relatioship, set relationship betwee these two table on this newly added "uniquekey" column, add following column in TAP_LKP to get Total from TAP_DATA

 

Total = RELATED( TAP_DATA[Total] )

I made some assumption based on your question, we can always tweak above approach if something I missed.



Subscribe to the @PowerBIHowTo YT channel for an upcoming video on List and Record functions in Power Query!!

Learn Power BI and Fabric - subscribe to our YT channel - Click here: @PowerBIHowTo

If my solution proved useful, I'd be delighted to receive Kudos. When you put effort into asking a question, it's equally thoughtful to acknowledge and give Kudos to the individual who helped you solve the problem. It's a small gesture that shows appreciation and encouragement! ❤


Did I answer your question? Mark my post as a solution. Proud to be a Super User! Appreciate your Kudos 🙂
Feel free to email me with any of your BI needs.

Thanks @parry2k  You have the description of the data exactly right. The solution worked however, with such a large amount of data and processing it on the Desk Top it is very slow.

I think I'll do it the right way and process the data via an ETL application and denormalize the input data so all Power BI needs to do is slice and dice the data.

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

September Power BI Update Carousel

Power BI Monthly Update - September 2025

Check out the September 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors
Top Kudoed Authors