Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.
Hi
I have been generating some fuzzy matching queries
The souce is a table of about 6000 names
I have self joined that list with 3 fuzzy fields: sector, subsector, company
Everything was done in excel. I am supposing that both power queries (PBI and Excel) should have the same problem.
I have executed it several times.
Each time I get a different result.
I have even filtrered the list to 4 results that I know should match. These 4.
AUTO; CARS; ALFA ROMEO
AUTO; CARS; ALFAROMEO
HOME; HEATING; ARISTON THERMAL
HOME; HEATING; ARISTON
Sometimes they match, sometimes they don't.
The fuzzyness shouldn't make the match non-deterministic
Or is there a random heuristic behaviour beneath the surface?
If so there should be a set of seeds to be set up.
Any idea?
You can set the Similarity thresold of the fuzzy option to 0.5.
Best Regards!
Yolo Zhu
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
Hi, @Eduardo_Suela
check for similarity threshold
Proud to be a Super User!
Thanks
Obviously I checked that
But my problem is the same query giving different results for the same data in different executions