Power BI is turning 10, and we’re marking the occasion with a special community challenge. Use your creativity to tell a story, uncover trends, or highlight something unexpected.
Get startedJoin us for an expert-led overview of the tools and concepts you'll need to become a Certified Power BI Data Analyst and pass exam PL-300. Register now.
Hi. Is there a way to parse text to single white spaces only?
That is:
"mary white helloooo"
Becomes:
"mary white helloooo"
Solved! Go to Solution.
try this
let
Source = Excel.Workbook(File.Contents("C:yourpath\Q.14 Data.xlsx"), null, true),
Table1_Table = Source{[Item="Table1",Kind="Table"]}[Data],
#"Changed Type" = Table.TransformColumnTypes(Table1_Table,{{"Single space everything, and trim both ends (don’t make a new column)", type text}}),
tc=Table.TransformColumns( #"Changed Type", {"Single space everything, and trim both ends (don’t make a new column)", each Text.Combine(List.Select(Text.Split(_," "), each _<>""), " ")})
in
tc
Hi @michellepace ,
Could @Anonymous 's method meet your requirements?
If it could, please accept his reply as a solution so that people who may have the same question can get the solution directly.
If not, please let us know.
Best Regards,
Icey
Hi, @michellepace
Try this:
= let str="mary white helloooo" in Text.Combine(Splitter.SplitTextByWhitespace()(str), " ")
If my code solves your problem, mark it as a solution
= Splitter.SplitTextByWhitespace()(" mary white helloooo ")
this function, that of which I now know the existence thanks to @ziying35 , for a string with leading or trailing whites spaces, gives this result
where the first and last element are the empty string "".
Why?
what kind of internal logic is applied in your opinion?
@Anonymous im not sure, but maybe Splitter.SplitTextByWhitespace() considers more spaces in a row (" ") as a single "whitespace"... thats why there is only one empty element at the beginnning and end of the list:
try to write, for example, in pseudo-code (simulating a low-level language) the various steps that produce that result.
Let's see where the different management of leading and trailing spaces intervenes with respect to those inside the text string.
PS
Un empty element "" is different from blank " "
@Anonymous
The function does have the problems you describe, and I'm not sure about its internal logic. The scenarios I've used it in before have been without leading and trailing spaces
= let str=" mary white helloooo " in Text. Trim(Text.Combine(Splitter.SplitTextByWhitespace()(str), " "))
Thank you all for your replies. In truth, I am struggling to use any of the solutions. Looking at my initial post, I did not explain my requirement sufficiently. If you look at the column name below, it is exactly what I need to do. Does anyone have the line of M-Code I can paste into the Advanced Editor to achieve this? I am happy for Format>Trim to be used as the first step if it simplifies the solution.
.pbix and data here: clickme
Many thanks once again.
try this
let
Source = Excel.Workbook(File.Contents("C:yourpath\Q.14 Data.xlsx"), null, true),
Table1_Table = Source{[Item="Table1",Kind="Table"]}[Data],
#"Changed Type" = Table.TransformColumnTypes(Table1_Table,{{"Single space everything, and trim both ends (don’t make a new column)", type text}}),
tc=Table.TransformColumns( #"Changed Type", {"Single space everything, and trim both ends (don’t make a new column)", each Text.Combine(List.Select(Text.Split(_," "), each _<>""), " ")})
in
tc
Text.Combine(List.Select(Text.Split("mary white helloooo"," "), each _<>""), " ")
I do not intend to boast this solution, but just as a side observation, I point out that this cuts away the blanks that precede the first word or follow the last word.
This of course can be an advantage or a disadvantage, it depends on what one is looking for.
Paste this M code in a blank query to see the steps. The recursive function is the relevant one.
let
Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wyk0sqlQAgvKMzJJUEEMhIzUnJx8IlGJjAQ==", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Column1 = _t]),
#"Changed Type" = Table.TransformColumnTypes(Source,{{"Column1", type text}}),
fx = (input as text) => let reduce = Text.Replace(input," "," "), res=if Text.Length(reduce)=Text.Length(input) then input else @fx(reduce) in res,
#"Added Custom" = Table.AddColumn(#"Changed Type", "Custom", each fx([Column1]))
in
#"Added Custom"
Please mark the question solved when done and consider giving kudos if posts are helpful.
Contact me privately for support with any larger-scale BI needs, tutoring, etc.
Cheers
User | Count |
---|---|
8 | |
6 | |
6 | |
5 | |
5 |
User | Count |
---|---|
9 | |
8 | |
6 | |
6 | |
6 |