Don't miss your chance to take the Fabric Data Engineer (DP-600) exam for FREE! Find out how by attending the DP-600 session on April 23rd (pacific time), live or on-demand.
Learn moreNext up in the FabCon + SQLCon recap series: The roadmap for Microsoft SQL and Maximizing Developer experiences in Fabric. All sessions are available on-demand after the live show. Register now
Is there a way to make the fuzzy merge feature place higher importance on matching rows using the transformation table, then by using it's algorithm? I believe this would help me solve the issue I'm seeing, but am open to trying different things.
I have two queries for which I have created a "bridge" to avoid a many-to-many relationship between rows. The data represents units sold and cases opened to support different account names, and unfortunately these two sources have been set up so that not every account name is guaranteed to be spelled the same in every instance...for example:
| Query 1 Sales Account Names | Query 2 Case Account Names | Desired Visualisation Account Name |
| East Communications | East Communications, Inc. | East Communications |
| East Communications, Inc | East Comm. | East Communications |
| East Communications Inc. | East Communications Inc | East Communications |
| Freeflight | Freeflight (Bennington) | Freeflight |
| Freeflight | Freeflight (IT) | Freeflight |
I have tried fuzzy merging the information in my bridge query with one of the original queries to assign each row the desired account name, but even when using a transformation table and playing around with the threshold, it does not always link the same account name. There are rows where the match has correctly assigned the desired name, but for the same account names there will also be rows that have defaulted to the original, show an incorrect name, or even come up as null.
Hi @akg ,
I have tested the sample data. It works using transformation table. You could download my sample to have a try.
Reference:
Fuzzy Matching in Power BI and Power Query; Match based on Similarity Threshold
If your tables still can't work fine, maybe it is caused by the sample data is too simple. Please share a dummy pbix file that we can test.
Here is a sample of my .pbix in Onedrive:
https://1drv.ms/u/s!As6Gm7qZSazFmSdN0X_5Z7qeNoBp?e=EnJrXJ
I've taken few different account names I was having trouble with along with some other random names to try and represent the dataset and created the bridge along with fuzzy merging. I have removed the duplicates so that the relationships will work, but if you roll it back you can see where the names arent lining up properly.
In this sample every account name has a match in the Fuzzy Match transformation table I loaded, but this is not always the case. My original transformation table only really includes matches for the obvious account names that I want matched. For example, I have ATT West in the "from" column of the fuzzy match transformation table, but this wouldn't normally be included in that table at all.
If you have recently started exploring Fabric, we'd love to hear how it's going. Your feedback can help with product improvements.
A new Power BI DataViz World Championship is coming this June! Don't miss out on submitting your entry.
Share feedback directly with Fabric product managers, participate in targeted research studies and influence the Fabric roadmap.
| User | Count |
|---|---|
| 48 | |
| 43 | |
| 39 | |
| 19 | |
| 17 |
| User | Count |
|---|---|
| 68 | |
| 63 | |
| 31 | |
| 30 | |
| 23 |