Microsoft Fabric Community Conference 2025, March 31 - April 2, Las Vegas, Nevada. Use code FABINSIDER for a $400 discount.
Register nowThe Power BI DataViz World Championships are on! With four chances to enter, you could win a spot in the LIVE Grand Finale in Las Vegas. Show off your skills.
Hello all super users,
wandering if anyone has seen this and has a solution for me.
I'm using following transformation step in a transformation function when extracting data from PDF files on a sharepoint site. I use the function to dynamically adjust the transformation depending on the varying number of columns contained in the PDF files.
= Table.TransformColumns(Table.ExpandRecordColumn(#"Stage2", "Data",
Table.ColumnNames(Table.FromRecords(List.Select(Table.Column(#"Stage2", "Data"),each _ <> null and _ <> ""))),
Table.ColumnNames(Table.FromRecords(List.Select(Table.Column(#"Stage2", "Data"),each _ <> null and _ <> "")))),
List.Transform (Table.ColumnNames(Table.FromRecords(List.Select(Table.Column(#"Stage2", "Data"),each _ <> null and _ <> ""))), each{_,fnSuffix, type text}))
It works well in Power BI desktop and pro services dataflow, however when importing it into dataflow on a premium (PPU) workspace I get following error message:
"PipelineException: Attempted to read or write protected memory. This is often an indication that other memory is corrupt. . RootActivityId"
Solutions or pointers very welcome!
Hi @MondayMorning ,
I can't reproduce your issue. I can refresh data from PDF on SharePoint successfully.
It is suggested to create a support ticket to get technical help.
And here is a similar issue: PDF on sharepoint can't be read when scheduled refresh.
Hope it helps,
Community Support Team _ Caitlyn
If this post helps then please consider Accept it as the solution to help the other members find it more quickly.
have you tried to reproduce in a PPU enviroment? could you send me code, which extracts data from sharepoint source in a dataflow on PPU or does follwoing code work for you?
let
Source = SharePoint.Files("https://xxxxxxxxxxxxx", [ApiVersion = 15]),
#"Filtered rows" = Table.SelectRows(Source, each Text.StartsWith([Folder Path], "https://xxxxxxxxxxxxx")),
Navigation = #"Filtered rows"{[Name = "0206005409180001_Vermögensausweis_per_16062020_20200617030757083256.pdf", #"Folder Path" = "https://xxxxxxxxxxxxx"]}[Content],
#"Imported PDF" = Pdf.Tables(Navigation, [Implementation = "1.3"]),
#"Expanded Data" = Table.ExpandTableColumn(#"Imported PDF", "Data", {"Column1", "Column2", "Column3", "Column4", "Column5", "Column6", "Column7", "Column8", "Column9", "Column10", "Column11", "Column12", "Column13", "Column14", "Column15"}, {"Column1", "Column2", "Column3", "Column4", "Column5", "Column6", "Column7", "Column8", "Column9", "Column10", "Column11", "Column12", "Column13", "Column14", "Column15"}),
#"Changed column type" = Table.TransformColumnTypes(#"Expanded Data", {{"Id", type text}, {"Name", type text}, {"Kind", type text}, {"Column1", type text}, {"Column2", type text}, {"Column3", type text}, {"Column4", type text}, {"Column5", type text}, {"Column6", type text}, {"Column7", type text}, {"Column8", type text}, {"Column9", type text}, {"Column10", type text}, {"Column11", type text}, {"Column12", type text}, {"Column13", type text}, {"Column14", type text}, {"Column15", type text}})
in
#"Changed column type"
I see you have had a similar problem a year back. Have you found out anything new relating to the error message and PDF&Services? I assume that there might be an issue with the PDF-converter from sharepoint-folders in the premium services (since all works well in the pro services).
thanks!
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!
Check out the February 2025 Power BI update to learn about new features.
User | Count |
---|---|
45 | |
33 | |
29 | |
26 | |
24 |
User | Count |
---|---|
40 | |
33 | |
19 | |
18 | |
15 |