Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and more. Get started

Reply
jtpiazzamn
Helper I
Helper I

Remove HTML tags from Column

I have a colmn which contains HTML tags as an example: <p><strong>NEW IDEA</strong></p> 

It would be great if we could just PARSE the text out from the HTML. So the end result of this line would be "NEW IDEA" (without the "" of course). 

 

I have seen lots of examples, but can't find one that would work for me. Would I create a new column which have a DAX equation? 

Thanks in advance. HEY Microsoft - you can parse out date from a long date field, can't you parse out html? 

1 ACCEPTED SOLUTION
Shaurya
Memorable Member
Memorable Member

Hi @jtpiazzamn,

 

You can do this transformation in Power Query by extracting the text between delimiters and using advanced options to skip the first occurence of '>'.

 

Screenshot 2022-10-11 021051.jpg

 

I tried this and got "NEW IDEA" (without the "" of course). For your reference:

 

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wiik1MDBOLgBTqRBOcUlRfl46RMTPNVzB08XVESKjjywFFYJqVYqNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Tags = _t]),
    #"Changed Type" = Table.TransformColumnTypes(Source,{{"Tags", type text}}),
    #"Extracted Text Between Delimiters" = Table.TransformColumns(#"Changed Type", {{"Tags", each Text.BetweenDelimiters(_, ">", "<", 1, 0), type text}})
in
    #"Extracted Text Between Delimiters"

 

Works for you? Mark this post as a solution if it does!

View solution in original post

1 REPLY 1
Shaurya
Memorable Member
Memorable Member

Hi @jtpiazzamn,

 

You can do this transformation in Power Query by extracting the text between delimiters and using advanced options to skip the first occurence of '>'.

 

Screenshot 2022-10-11 021051.jpg

 

I tried this and got "NEW IDEA" (without the "" of course). For your reference:

 

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wiik1MDBOLgBTqRBOcUlRfl46RMTPNVzB08XVESKjjywFFYJqVYqNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Tags = _t]),
    #"Changed Type" = Table.TransformColumnTypes(Source,{{"Tags", type text}}),
    #"Extracted Text Between Delimiters" = Table.TransformColumns(#"Changed Type", {{"Tags", each Text.BetweenDelimiters(_, ">", "<", 1, 0), type text}})
in
    #"Extracted Text Between Delimiters"

 

Works for you? Mark this post as a solution if it does!

Helpful resources

Announcements
Europe Fabric Conference

Europe’s largest Microsoft Fabric Community Conference

Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.

AugPowerBI_Carousel

Power BI Monthly Update - August 2024

Check out the August 2024 Power BI update to learn about new features.

September Hackathon Carousel

Microsoft Fabric & AI Learning Hackathon

Learn from experts, get hands-on experience, and win awesome prizes.

Sept NL Carousel

Fabric Community Update - September 2024

Find out what's new and trending in the Fabric Community.