Starting December 3, join live sessions with database experts and the Microsoft product team to learn just how easy it is to get started
Learn moreGet certified in Microsoft Fabric—for free! For a limited time, get a free DP-600 exam voucher to use by the end of 2024. Register now
Hi Folks,
I am stumped.
I am trying to merge two tables into a new table using a LEFT JOIN. Both tables have unique records on each row. When I join the tables, BI creates duplicate rows on some records for no apparent reason. The duplicates are identical in every way.
Can anybody explain this? Is this a bug?
Solved! Go to Solution.
The problem lay with the Data.
The LEFT JOIN functionality of PowerBI works perfectly.
I had same problem. I was joining Table A with Table B on 2 columns, that were numbers but marked as Text type. I changed the type of these columns I was joining on in both tables to Whole number type. Then the Left Join worked and the duplicates went away!
I had the same problem and found my solution so thought I'd share it, for what its worth.
I had a very large table of data and another table of account details that I was merging to my first data set, but was getting duplicate rows.
There was a unique key to link between the 2 tables but hadn't created a relationship on the model window. As it turns out it was a many to many relationship and this is what caused the dulpicates. Since the account table was only there to extract additional info to my first data set, in the account table in the Power Query editor window I simply clicked on my key column and selected remove duplicates to force a many to one relationship and this removed the duplication on merge.
I don't really get why the people commenting after my comment are still looking for an answer!!!!
my comment was litterally a valid solution, will I actually didn't really understand how merge works back then when I looked for this post, people are still missing merge with the regular vlookup function.
vlookup looks only for the first value in a column.
merge brings all the instances of this value no matter how many there are, and it duplicates the value in the first table to match that count, it works this way and there is no problem in this, the logic is different (it simply doesn't know which instance you wish to have so it brings you all of them).
all you have to do is to remove duplicates, you have to do it in the second table you're merging with (before you do the merge), or you have to do it the way I described in my last "solution" comment (you may look here in the previous comments).
please mark my comment as a solution so people don't keep getting lost everytime they enter this thread.
Regards for all of you.
Th steer here spot on - thanks - duplicate records in the right hand linked table caused duplicate rows from left hand linked table. Unlike Vlookup which only returns first match, the Access query returns a record for each match in the right table. Removing duplicates in right hand table solves problem.
Hi All,
I having similar issue, Below are the tables and the output after using full outer join
Table1
Table 2
Merging query screenshot
Merged table with duplicate entries- actually it copies table 2 for each entry of table 1
Any solution
Had the same issue, where I was merging queries.
The issue in my instance was that the base query was expanded which resulted in duplicates in the base query.
Once the expansion was removed in the base query the merged query was fine.
So if data is okay, the issue may be caused by duplicates in the base query.
Can you explain "base query was expanded"?
he meant the other query that you are merging your data with
so if you are in Table A and trying to merge data with Table B, if Table B has duplicates it will result in a duplicates in your Table A after you expand the results, so it's better to have Table B cleaned from duplicates first.
Hi people.
I've been looking for this issue with no luck as well until I founded a valid work around.
the work around goes like this ::
-Merge your sheets as you like (you don't have to merge on more than one column, one is fine).
-Add index column
-Expand your merged column (this will create duplicate values, the main issue we're looking into).
-since the values in the "index column" were purely unique, now Remove duplicates using index column.
-and now your data is back to original as it was inteded to be.
hopefully you'll reply but when I do this ...it does indeed eliminate the duplicates in the 2nd table added
but after removing the dups in the index, now the original table has dups
here is sample data ..trying to merge "ycr" and "integrated"and end up with "goal"
I work with very extensive datas sheets and as soon as I tried this solution (that works pretty well) the uptade started to take hours to be conclude, thats the only issue Im having right now... Any clues about that?
Having the same issue. Performance severely degrades when I use this method. It works in theory, but in some cases will take forever.
hello there,
the slowness is not coming from adding an index column or removing the duplicate, Power BI and Excel Power Queries are always having a problem when using the "Merge" method, this is happening because "merge" includes a "reference" step inside of it, this specific step "reference" is a performance killer, it really slows everything down in a horrible way, the only two solutions I found to this problem is either you find a workaround to this and try to achieve your result without using "merge" or "reference" (a workaround that does the same effect and I have done that sometimes), the other solution is to not use it all together, there is really no way to optimize the "merge" itself.
Regards,
Thanks. This solved my issue.
Thanks! This solution works like a charm
Don't know why Microsoft don want to fix this simple issue....
this approach should be accepted as solution. It perfectly resolved the issue.
@othmanalazzam wrote:Hi people.
I've been looking for this issue with no luck as well until I founded a valid work around.
the work around goes like this ::
-Merge your sheets as you like (you don't have to merge on more than one column, one is fine).
-Add index column
-Expand your merged column (this will create duplicate values, the main issue we're looking into).
-since the values in the "index column" were purely unique, now Remove duplicates using index column.
-and now your data is back to original as it was inteded to be.
My duplicate issue is solved by applying this method. Thank you very much.
I've been working with a sales table and trying to create dimension tables from the table with all of the fields. I've been frustrated, because I could not tell where the duplicate rows of data were coming from; however, could tell they happened after one of the two merge query steps.
I can affirm that having a unique index prior to the merge, and then removing duplicates from this field works.
Does anyone know why this occurs?
I'm facing the same trouble here. Not sure why merging 2 tables create duplicates. Have anyone got a work around this?
Update 9/10 : Problem Solved. My original first table already have got unique rows hence I should have merged it as aggregated values otherwise it will further expand the table, causing duplicates. Also, it helps to use and select more than one field as the key to look up if you need that level of granularity.
I just ran into the same problem and fixed it by joining the two tables on multiple fields. For example, let's say a transaction number is the unique field you are joining on, expand the join to include other fields if possible. So your joins would be on transaction number, time, and amount.
Starting December 3, join live sessions with database experts and the Fabric product team to learn just how easy it is to get started.
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early Bird pricing ends December 9th.
User | Count |
---|---|
94 | |
92 | |
82 | |
71 | |
49 |
User | Count |
---|---|
143 | |
121 | |
111 | |
59 | |
57 |