Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get certified in Microsoft Fabric—for free! For a limited time, get a free DP-600 exam voucher to use by the end of 2024. Register now

Reply
jtpiazzamn
Helper I
Helper I

Remove HTML tags from Column

I have a colmn which contains HTML tags as an example: <p><strong>NEW IDEA</strong></p> 

It would be great if we could just PARSE the text out from the HTML. So the end result of this line would be "NEW IDEA" (without the "" of course). 

 

I have seen lots of examples, but can't find one that would work for me. Would I create a new column which have a DAX equation? 

Thanks in advance. HEY Microsoft - you can parse out date from a long date field, can't you parse out html? 

1 ACCEPTED SOLUTION
Shaurya
Memorable Member
Memorable Member

Hi @jtpiazzamn,

 

You can do this transformation in Power Query by extracting the text between delimiters and using advanced options to skip the first occurence of '>'.

 

Screenshot 2022-10-11 021051.jpg

 

I tried this and got "NEW IDEA" (without the "" of course). For your reference:

 

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wiik1MDBOLgBTqRBOcUlRfl46RMTPNVzB08XVESKjjywFFYJqVYqNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Tags = _t]),
    #"Changed Type" = Table.TransformColumnTypes(Source,{{"Tags", type text}}),
    #"Extracted Text Between Delimiters" = Table.TransformColumns(#"Changed Type", {{"Tags", each Text.BetweenDelimiters(_, ">", "<", 1, 0), type text}})
in
    #"Extracted Text Between Delimiters"

 

Works for you? Mark this post as a solution if it does!

View solution in original post

1 REPLY 1
Shaurya
Memorable Member
Memorable Member

Hi @jtpiazzamn,

 

You can do this transformation in Power Query by extracting the text between delimiters and using advanced options to skip the first occurence of '>'.

 

Screenshot 2022-10-11 021051.jpg

 

I tried this and got "NEW IDEA" (without the "" of course). For your reference:

 

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wiik1MDBOLgBTqRBOcUlRfl46RMTPNVzB08XVESKjjywFFYJqVYqNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Tags = _t]),
    #"Changed Type" = Table.TransformColumnTypes(Source,{{"Tags", type text}}),
    #"Extracted Text Between Delimiters" = Table.TransformColumns(#"Changed Type", {{"Tags", each Text.BetweenDelimiters(_, ">", "<", 1, 0), type text}})
in
    #"Extracted Text Between Delimiters"

 

Works for you? Mark this post as a solution if it does!

Helpful resources

Announcements
November Carousel

Fabric Community Update - November 2024

Find out what's new and trending in the Fabric Community.

Live Sessions with Fabric DB

Be one of the first to start using Fabric Databases

Starting December 3, join live sessions with database experts and the Fabric product team to learn just how easy it is to get started.

Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early Bird pricing ends December 9th.

Nov PBI Update Carousel

Power BI Monthly Update - November 2024

Check out the November 2024 Power BI update to learn about new features.