Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Next up in the FabCon + SQLCon recap series: The roadmap for Microsoft SQL and Maximizing Developer experiences in Fabric. All sessions are available on-demand after the live show. Register now

Reply
Anonymous
Not applicable

Levenshtein String Distance Algorithm In DAX

Hello,

 

So far, this article is the closest I've come to trying to reach a measure that I've been working on for a while...

 

Measure equavalent for string similarity formula

 

I know it can be done in SQL through a scalar value function creation. I am wondering if it can be done in Power BI using some form of calculatetable, selected value, and and an iterator function like sumx and/or rankx. Here is an example of what I am trying to do...

 

I have one static column as seen below. I want to create a measure that finds the top n number of similarity percentages for a [TestColumn] selected value slicer matched with every other value within the same column. 

 

TestColumn

Leaf
Leaves
Trees
Leafly
Lost Lake
Hawaii
Free
Moist
posture
Classical
Classic
Jobe
Job
Freedom
Lost Music
rap
R&B
Rapper
Rapped
Wrap
Wrrap
Wrapper
Boy
Boys
Boston
 

The reason for this question is to help a company do a massive clean up to a 32k vendor list. 

 

Please let me know if this is possible in DAX. 

 

Thanks.

 

 

1 ACCEPTED SOLUTION
v-gizhi-msft
Community Support
Community Support

Hi,

 

Please try to create a seperate slicer table as the same with your original table first.

Create this column:

Column = LEN('Table'[TestColumn])

Then try this measure:

Measure = 
VAR SlicerText =
    SELECTEDVALUE ( 'Slicer Table'[TestColumn] )
VAR TableText =
    SELECTEDVALUE ( 'Table'[TestColumn] )
VAR length =
    MAX ( LEN ( SlicerText ), LEN ( TableText ) )
VAR TestTable =
    ADDCOLUMNS (
        GENERATESERIES ( 1, length, 1 ),
        "InSlicer", MID ( SlicerText, [Value], 1 ),
        "InTable", MID ( TableText, [Value], 1 )
    )
RETURN
    COUNTROWS ( FILTER ( TestTable, [InSlicer] = [InTable] ) )
        / COUNTROWS ( TestTable )

When you select one value in slicer, the result shows:

20.PNG

Here is my test pbix file:

pbix 

Hope this helps.

 

Best Regards,

Giotto Zhi

 

View solution in original post

4 REPLIES 4
v-gizhi-msft
Community Support
Community Support

Hi,

 

Please try to create a seperate slicer table as the same with your original table first.

Create this column:

Column = LEN('Table'[TestColumn])

Then try this measure:

Measure = 
VAR SlicerText =
    SELECTEDVALUE ( 'Slicer Table'[TestColumn] )
VAR TableText =
    SELECTEDVALUE ( 'Table'[TestColumn] )
VAR length =
    MAX ( LEN ( SlicerText ), LEN ( TableText ) )
VAR TestTable =
    ADDCOLUMNS (
        GENERATESERIES ( 1, length, 1 ),
        "InSlicer", MID ( SlicerText, [Value], 1 ),
        "InTable", MID ( TableText, [Value], 1 )
    )
RETURN
    COUNTROWS ( FILTER ( TestTable, [InSlicer] = [InTable] ) )
        / COUNTROWS ( TestTable )

When you select one value in slicer, the result shows:

20.PNG

Here is my test pbix file:

pbix 

Hope this helps.

 

Best Regards,

Giotto Zhi

 

Not Found file .pbix for download

Anonymous
Not applicable

@v-gizhi-msft 

 

Thank you, thank you, thank you! Works great!

Greg_Deckler
Community Champion
Community Champion

So what would be the expected output from the sample data you have provided? Are you basically trying to determine how many characters each value has in common with all of the other values in the column? 



Follow on LinkedIn
@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
DAX For Humans

DAX is easy, CALCULATE makes DAX hard...

Helpful resources

Announcements
New to Fabric survey Carousel

New to Fabric Survey

If you have recently started exploring Fabric, we'd love to hear how it's going. Your feedback can help with product improvements.

Power BI DataViz World Championships carousel

Power BI DataViz World Championships - June 2026

A new Power BI DataViz World Championship is coming this June! Don't miss out on submitting your entry.

Join our Fabric User Panel

Join our Fabric User Panel

Share feedback directly with Fabric product managers, participate in targeted research studies and influence the Fabric roadmap.

March Power BI Update Carousel

Power BI Community Update - March 2026

Check out the March 2026 Power BI update to learn about new features.