Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Don't miss out! 2025 Microsoft Fabric Community Conference, March 31 - April 2, Las Vegas, Nevada. Use code MSCUST for a $150 discount. Prices go up February 11th. Register now.

Reply
jtpiazzamn
Helper I
Helper I

Remove HTML tags from Column

I have a colmn which contains HTML tags as an example: <p><strong>NEW IDEA</strong></p> 

It would be great if we could just PARSE the text out from the HTML. So the end result of this line would be "NEW IDEA" (without the "" of course). 

 

I have seen lots of examples, but can't find one that would work for me. Would I create a new column which have a DAX equation? 

Thanks in advance. HEY Microsoft - you can parse out date from a long date field, can't you parse out html? 

1 ACCEPTED SOLUTION
Shaurya
Memorable Member
Memorable Member

Hi @jtpiazzamn,

 

You can do this transformation in Power Query by extracting the text between delimiters and using advanced options to skip the first occurence of '>'.

 

Screenshot 2022-10-11 021051.jpg

 

I tried this and got "NEW IDEA" (without the "" of course). For your reference:

 

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wiik1MDBOLgBTqRBOcUlRfl46RMTPNVzB08XVESKjjywFFYJqVYqNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Tags = _t]),
    #"Changed Type" = Table.TransformColumnTypes(Source,{{"Tags", type text}}),
    #"Extracted Text Between Delimiters" = Table.TransformColumns(#"Changed Type", {{"Tags", each Text.BetweenDelimiters(_, ">", "<", 1, 0), type text}})
in
    #"Extracted Text Between Delimiters"

 

Works for you? Mark this post as a solution if it does!

View solution in original post

1 REPLY 1
Shaurya
Memorable Member
Memorable Member

Hi @jtpiazzamn,

 

You can do this transformation in Power Query by extracting the text between delimiters and using advanced options to skip the first occurence of '>'.

 

Screenshot 2022-10-11 021051.jpg

 

I tried this and got "NEW IDEA" (without the "" of course). For your reference:

 

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wiik1MDBOLgBTqRBOcUlRfl46RMTPNVzB08XVESKjjywFFYJqVYqNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Tags = _t]),
    #"Changed Type" = Table.TransformColumnTypes(Source,{{"Tags", type text}}),
    #"Extracted Text Between Delimiters" = Table.TransformColumns(#"Changed Type", {{"Tags", each Text.BetweenDelimiters(_, ">", "<", 1, 0), type text}})
in
    #"Extracted Text Between Delimiters"

 

Works for you? Mark this post as a solution if it does!

Helpful resources

Announcements
Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!

Jan25PBI_Carousel

Power BI Monthly Update - January 2025

Check out the January 2025 Power BI update to learn about new features in Reporting, Modeling, and Data Connectivity.

December 2024

A Year in Review - December 2024

Find out what content was popular in the Fabric community during 2024.