cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
jacek_k
Frequent Visitor

Extracting data from string - mixed column names and values

Hi Guys,

I have to extract data from strings. 100 columns and 10k+ rows.

Every row contains column names and corresponded values after semicolon but not each row contains each column.

I found solution but I think it is to complicated and can generate errors because of simply use of isOdd/isEven/FillDown functions. For example I had to add Tabel.Buffer to handle errors.

jacek_k_0-1652448116160.png

 

Is there easiest way to get the result?

Link to sample file on my onedrive:

Sample_file 

Regards

Jacek

 

1 ACCEPTED SOLUTION
AlexisOlson
Super User
Super User

Maybe this will work better. I'm not sure it will, but it's worth a try.

let
    Source = Excel.CurrentWorkbook(){[Name="Data"]}[Content],
    #"Added Index" = Table.AddIndexColumn(Source, "Index", 1, 1, Int64.Type),
    #"To Tables" = Table.TransformColumns(#"Added Index",
        {{"Data", each
            let
                Pairs = List.Split(Text.Split(_, ";"), 2),
                ColNames = List.Transform(Pairs, each _{1}),
                ColValues = List.Transform(Pairs, each _{0})
            in
                Record.ToTable(Record.FromList(ColNames, ColValues))
            , type table}}
        ),
    #"Expanded Data" = Table.ExpandTableColumn(#"To Tables", "Data", {"Name", "Value"}, {"Name", "Value"}),
    #"Pivoted Column" = Table.Pivot(#"Expanded Data", List.Distinct(#"Expanded Data"[Name]), "Name", "Value")
in
    #"Pivoted Column"

 

The idea here is that the ColName;ColRowValue pairs are similar to how a record is structured [ColName=ColRowValue]. I split each text string into a list and then use List.Split to group it into pairs, taking the names from the first element of each pair and the values from the second element of each pair. Then turn that pair of lists into a record and then a table. Expand the tables and pivot on the names.

 

There may be some possible buffering optimizations but I'll let you experiment with that.

View solution in original post

2 REPLIES 2
jacek_k
Frequent Visitor

Hi, it works perfectly, i forgot to reply 😞

AlexisOlson
Super User
Super User

Maybe this will work better. I'm not sure it will, but it's worth a try.

let
    Source = Excel.CurrentWorkbook(){[Name="Data"]}[Content],
    #"Added Index" = Table.AddIndexColumn(Source, "Index", 1, 1, Int64.Type),
    #"To Tables" = Table.TransformColumns(#"Added Index",
        {{"Data", each
            let
                Pairs = List.Split(Text.Split(_, ";"), 2),
                ColNames = List.Transform(Pairs, each _{1}),
                ColValues = List.Transform(Pairs, each _{0})
            in
                Record.ToTable(Record.FromList(ColNames, ColValues))
            , type table}}
        ),
    #"Expanded Data" = Table.ExpandTableColumn(#"To Tables", "Data", {"Name", "Value"}, {"Name", "Value"}),
    #"Pivoted Column" = Table.Pivot(#"Expanded Data", List.Distinct(#"Expanded Data"[Name]), "Name", "Value")
in
    #"Pivoted Column"

 

The idea here is that the ColName;ColRowValue pairs are similar to how a record is structured [ColName=ColRowValue]. I split each text string into a list and then use List.Split to group it into pairs, taking the names from the first element of each pair and the values from the second element of each pair. Then turn that pair of lists into a record and then a table. Expand the tables and pivot on the names.

 

There may be some possible buffering optimizations but I'll let you experiment with that.

Helpful resources

Announcements
PBI Sept Update Carousel

Power BI September 2023 Update

Take a look at the September 2023 Power BI update to learn more.

Learn Live

Learn Live: Event Series

Join Microsoft Reactor and learn from developers.

Dashboard in a day with date

Exclusive opportunity for Women!

Join us for a free, hands-on Microsoft workshop led by women trainers for women where you will learn how to build a Dashboard in a Day!

MPPC 2023 PBI Carousel

Power Platform Conference-Power BI and Fabric Sessions

Join us Oct 1 - 6 in Las Vegas for the Microsoft Power Platform Conference.

Top Solution Authors
Top Kudoed Authors