Starting December 3, join live sessions with database experts and the Microsoft product team to learn just how easy it is to get started
Learn moreGet certified in Microsoft Fabric—for free! For a limited time, get a free DP-600 exam voucher to use by the end of 2024. Register now
Hello every one,
Firstly thanks for your coming support.
As an example, i would like to transform in power query the excel text list below
Column | Text |
A | regerge |
B | ere |
C | et |
D | aar |
A | arr |
B | thhtrjh |
C | sf |
D | sgkuy |
E | sdfz |
A | scsff |
C | sretj |
D | ki |
to the following table
A | B | C | D | E | |
record 1 | regerge | ere | et | aar | |
record 2 | arr | thhtrjh | sf | sgkuy | sdfz |
record 3 | scsff | sretj | ki |
It maybe looks easy for an expert but surely not for me...
Regards
Solved! Go to Solution.
Hi @Murat62,
I'm not sure if I get this straight, but based on what you had said, my understanding is the following:
The category column (named Column) is recursive, and whenever the sequence starts from the beginning (with "A" value), the record number increases with 1.
So, based on it, I don't know if there's any solution in the Power Query for that matter, but I figured out a way in Excel: if you create a new column with the following formula, you'll get the record numbers:
=IFERROR(IF(A1>A2;C1+1;C1);1)
If you load this table into PBI, you can create the pivot table in the Power Query you wanted. (with Column From Examples or even in Excel with the concatenate formula you can get the "record 1", "record 2"... format easily)
Let me know please if it solves your problem and whether it was helpful or not.
Best regards,
Andris
let Source = Web.Page(Web.Contents("https://community.powerbi.com/t5/Desktop/How-to-transform-excel-text-list-to-table/td-p/288742")), Data0 = Source{0}[Data], #"Changed Type" = Table.TransformColumnTypes(Data0,{{"Column1", type text}, {"Column2", type text}}), #"Promoted Headers" = Table.PromoteHeaders(#"Changed Type", [PromoteAllScalars=true]), #"Changed Type1" = Table.TransformColumnTypes(#"Promoted Headers",{{"Column", type text}, {"Text", type text}}), #"Grouped Rows" = Table.Group(#"Changed Type1", {"Column"}, {{"Count", each _, type table}}), #"Added Custom" = Table.AddColumn(#"Grouped Rows", "Custom", each Table.AddIndexColumn([Count],"IX",1,1)), #"Removed Columns" = Table.RemoveColumns(#"Added Custom",{"Count"}), #"Expanded Custom" = Table.ExpandTableColumn(#"Removed Columns", "Custom", {"Column", "Text", "IX"}, {"Custom.Column", "Custom.Text", "Custom.IX"}), #"Removed Columns1" = Table.RemoveColumns(#"Expanded Custom",{"Column"}), #"Added Prefix" = Table.TransformColumns(#"Removed Columns1", {{"Custom.IX", each "record" & Text.From(_, "en-US"), type text}}), #"Pivoted Column" = Table.Pivot(#"Added Prefix", List.Distinct(#"Added Prefix"[Custom.Column]), "Custom.Column", "Custom.Text") in #"Pivoted Column"
@Murat62 are you working in imported mode?
in powerquery you can pivot the data, click on the first column and from the transform menu click pivot and then select how you would like to aggregate it looks like you need dont aggregate
the only thing is what defines record 1?
Proud to be a Super User!
Thanks for your quick answer.
I already tried to pivot this excel text list after importing it with power BI desktop.
It did not work in a way i got only counting numbers and not text values as i would like
Regards
Hi @Murat62,
Do you have any more information of your source table? Such as more columns. It's hard to create a table like yours with DAX and M based on my knowledge. I have tried a lot. One simple problem is it's hard to identify which records are record1.
Best Regards!
Dale
Thanks for your answer
Source data is coming from the web page where i have a list of companies name, with address, city, phone,...
I copied & pasted it to an excel file and insert a column indicating name/category per row as "company name", "address", "city", "phone".
Nevertheless, from my knowledge, i agree with you it hard to transform this list into table. I did not find by myself any solution even if this issue or problem looks to me common one.
Logically speaking as an example every time we find an "A" category in "column" it should create a new record into the table.
Best regards
Hi @Murat62,
I'm not sure if I get this straight, but based on what you had said, my understanding is the following:
The category column (named Column) is recursive, and whenever the sequence starts from the beginning (with "A" value), the record number increases with 1.
So, based on it, I don't know if there's any solution in the Power Query for that matter, but I figured out a way in Excel: if you create a new column with the following formula, you'll get the record numbers:
=IFERROR(IF(A1>A2;C1+1;C1);1)
If you load this table into PBI, you can create the pivot table in the Power Query you wanted. (with Column From Examples or even in Excel with the concatenate formula you can get the "record 1", "record 2"... format easily)
Let me know please if it solves your problem and whether it was helpful or not.
Best regards,
Andris
Hi Andris,
Working perfectly. Thanks for your support.
I learn something new & that's great.
My best regards
In Power Quey, you can:
buffer the table,
group on "Column", chosing operation "All Rows",
adjust the generated code to have an index column added to the nested tables,
expand the nested tables,
pivot the column "Column" with advanced option "Don't Aggregate" and
as a finishing touch you may add prefix "Record " to the record numbers.
let Source = Table.Buffer(ExcelTextList), #"Grouped Rows" = Table.Group(Source, {"Column"}, {{"AllRows", each Table.AddIndexColumn(_, "Record", 1, 1), type table}}), #"Expanded AllRows" = Table.ExpandTableColumn(#"Grouped Rows", "AllRows", {"Text ", "Record"}, {"Text ", "Record"}), #"Pivoted Column" = Table.Pivot(#"Expanded AllRows", List.Distinct(#"Expanded AllRows"[Column]), "Column", "Text "), #"Added Prefix" = Table.TransformColumns(#"Pivoted Column", {{"Record", each "Record " & Text.From(_, "en-US"), type text}}) in #"Added Prefix"
Hi @MarcelBeug,
Your method is lovely, but according to my understanding, this - way because of the Group By - it gets the record numbers by the characters (so the first "A" will get 1, the second "A" get 2... the first "E" get 1), instead of the alphabetical order of the "sequences" (so the first sequence of 4 types "A, B, C, D" should get the record number 1, and after the second sequence of 5 "A, B, C, D, E" should get the record number 2).
Here's what @Murat62 showed in the original post,
And that's what you get with your formula.
Regards,
Andris
@andris_ Oops, you're right.
Your Excel solution, translated to Power Query, could look like:
let
Source = Table.Buffer(ExcelTextList),
Accumulation = List.Skip(List.Accumulate(Source[Column],{{Source[Column]{0},0}},(r,c) => r & {{c, if List.Last(r){0} >= c then List.Last(r){1} + 1 else List.Last(r){1}}})),
Tabled = Table.FromRows(Accumulation),
Combined = Table.FromColumns(Table.ToColumns(Source)&{Tabled[Column2]}, Value.Type(Table.AddColumn(Source, "Record", each null, Int64.Type))),
#"Pivoted Column" = Table.Pivot(Combined, List.Distinct(Combined[Column]), "Column", "Text "),
#"Added Prefix" = Table.TransformColumns(#"Pivoted Column", {{"Record", each "Record " & Text.From(_, "en-US"), type text}})
in
#"Added Prefix"
Thanks for your answer. It looks quite too high level for me.
Nevertheless, it gave the opportunity to discover & learn new functions as table.buffer in Power Query.
will need time to digest them...
My best regards
My code with explanatory comments:
let // The Source is buffered (i.e. cached in memory). In general List.Accumulates performs better on a buffered table. Source = Table.Buffer(ExcelTextList), /* The purpose of the Accumulation step is to create a list with record numbers, starting with 1 and increasing with 1 each time the "Column" value is less than or equal to the previous "Column" value. As this requires both the previous "Column" value and the previous record number, the list is accumulated with the "Column" and record values in 1 item, so a list of lists. With Column values A,B,C,D,A,B, the list starts with {{A,0}}; after the first iteration it is {{A,0},{A,1}} etcetera. The first argument supplied to List.Accumulate is the list with "Column" values: Source[Column]. List.Accumulate will iterate - or loop - over this list. The second argument is the initial value or {{A,0}}. The third argument is the actual accumulation function: r refers to the accumulation so far, starting with {{A,0}}. The function transforms the value of r with each iteration. c refers to the current item in the list of column values, starting with A. So the first iteration takes r {{A,0}} and concatenates: {{A, 1}}: List.Last(r) takes the last item of the accumulated list {A,0}, so: List.Last(r){0} is the A from this pair. List.Last(r){1} is the 0 from this pair. As the A from this pair is <= c (Value A), the second element of the new pair becomes 0 + 1. The second iteration takes r {{A,0},{A,1}} and concatenates {B,1}, resulting in {{A,0},{A,1},{B,1}} etcetera. After the accumulation, the first item {A,0} is removed with List.Skip. */ Accumulation = List.Skip(List.Accumulate(Source[Column],{{Source[Column]{0},0}},(r,c) => r & {{c, if List.Last(r){0} >= c then List.Last(r){1} + 1 else List.Last(r){1}}})), // The list with list is converted to a table with 2 columns (default Column1 and Column2). Tabled = Table.FromRows(Accumulation), /* Column2 of this table (the record numbers) must be added as a new column to the original table. Unfortunately, no function is available to do that. But we can use Table.FromColumns which will take a list of lists, where each inner list contains column values. So first the original table must be transformed to a list of lists, using Table.ToColumns. Tabled[Column2] is concatenated to this list of list and this ensemble is converted to a table with Table.FromColumns. The (optional) second argument of table columns, defines the columns of the result. Value.Type(Source) would give the column names and types of the Source table. As the new table has an additional column (the record numbers), in this formula a dummy column is added to Source with column name "Record" and type Int64.Type. So the Value.Type is taken from: Source with the added "Record" column. */ Combined = Table.FromColumns(Table.ToColumns(Source)&{Tabled[Column2]}, Value.Type(Table.AddColumn(Source, "Record", each null, Int64.Type))), // The rest is straightforward. #"Pivoted Column" = Table.Pivot(Combined, List.Distinct(Combined[Column]), "Column", "Text "), #"Added Prefix" = Table.TransformColumns(#"Pivoted Column", {{"Record", each "Record " & Text.From(_, "en-US"), type text}}) in #"Added Prefix"
Can you explain please how exactly the code works (until the pivot part), what does it exactly do? It's working to me, but I can't totally understand every step of the formulas.
Regards,
Andris
Starting December 3, join live sessions with database experts and the Fabric product team to learn just how easy it is to get started.
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early Bird pricing ends December 9th.
User | Count |
---|---|
87 | |
87 | |
84 | |
66 | |
49 |
User | Count |
---|---|
127 | |
109 | |
93 | |
70 | |
67 |