Power BI is turning 10! Tune in for a special live episode on July 24 with behind-the-scenes stories, product evolution highlights, and a sneak peek at what’s in store for the future.
Save the dateEnhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.
Good morning,
How do I import a PDF without dividing it into columns?
Att.
Julio
Solved! Go to Solution.
The only built-in connector is Pdf.Tables, and I do not believe there is any way to prevent it from parsing the PDF content into a table of tables of page text and tables (charts, etc.). You may want to look into using Power Query R or Python.
That said, here is a general set of PQ transformations you can use to convert the default Pdf.Tables parsing into a something similar to Lines.FromBinary, which I think is what you are asking for.
let
// Load the PDF file a location
// Note: Use File.Contents for local files
Source = Web.Contents(
"https://file-examples.com/storage/feeed4f6296807c3196e058/2017/10/file-example_PDF_1MB.pdf"
),
// Parse the PDF file to extract its content
PdfParse = Pdf.Tables(Source),
// Filter the parsed content to get only the pages
GetPagesOnly = Table.SelectRows(PdfParse, each ([Kind] = "Page")),
// Combine all the separate tables of page content into one table for the whole file
CombineAllPageContent = Table.Combine(GetPagesOnly[Data]),
// Merge all columns into a single column (called "Merged")
MergeAllPageColumns = Table.CombineColumns(
CombineAllPageContent,
Table.ColumnNames(CombineAllPageContent),
Combiner.CombineTextByDelimiter("", QuoteStyle.None),
"Merged"
)
in
MergeAllPageColumns
Quick visual of the steps to see what the above is doing:
Hi @Jbuzios ,
As we haven’t heard back from you, we wanted to kindly follow up to check if the solution provided for the issue worked? or Let us know if you need any further assistance?
If our response addressed, please mark it as Accept as solution and click Yes if you found it helpful.
Regards,
The only built-in connector is Pdf.Tables, and I do not believe there is any way to prevent it from parsing the PDF content into a table of tables of page text and tables (charts, etc.). You may want to look into using Power Query R or Python.
That said, here is a general set of PQ transformations you can use to convert the default Pdf.Tables parsing into a something similar to Lines.FromBinary, which I think is what you are asking for.
let
// Load the PDF file a location
// Note: Use File.Contents for local files
Source = Web.Contents(
"https://file-examples.com/storage/feeed4f6296807c3196e058/2017/10/file-example_PDF_1MB.pdf"
),
// Parse the PDF file to extract its content
PdfParse = Pdf.Tables(Source),
// Filter the parsed content to get only the pages
GetPagesOnly = Table.SelectRows(PdfParse, each ([Kind] = "Page")),
// Combine all the separate tables of page content into one table for the whole file
CombineAllPageContent = Table.Combine(GetPagesOnly[Data]),
// Merge all columns into a single column (called "Merged")
MergeAllPageColumns = Table.CombineColumns(
CombineAllPageContent,
Table.ColumnNames(CombineAllPageContent),
Combiner.CombineTextByDelimiter("", QuoteStyle.None),
"Merged"
)
in
MergeAllPageColumns
Quick visual of the steps to see what the above is doing:
Hi @Jbuzios ,
We wanted to kindly follow up to check if the solution provided for the issue worked? or Let us know if you need any further assistance?
If our response addressed, please mark it as Accept as solution and click Yes if you found it helpful.
Regards,
Hi @Jbuzios ,
When importing a PDF into Power BI, it often auto-detects and splits the data into multiple columns based on layout. To keep everything in a single column, you can open Power Query, find the table or page you’re importing, and use the “Combine” or “Extract Text” options instead of loading structured tables. Alternatively, choose to import from the “Document” or “Page” level rather than individual tables—this way, each line or block of text stays together. You can then clean or split the data manually as needed.
Hi @Jbuzios ,
As we haven’t heard back from you, we wanted to kindly follow up to check if the solution provided for the issue worked? or Let us know if you need any further assistance?
If our response addressed, please mark it as Accept as solution and click Yes if you found it helpful.
Regards,
Assuming you have the data already inputed in PQ. Using the following steps post that should work:
= Table.FromRows( List.Combine( Table.ToColumns( [YourTableName] ) ) )
If you're looking for all the contents in different columns to be in one single column, then this should work. Thanks!
Let me know if I understood your query correctly.
hi @Jbuzios ,
probably multiple steps to this depending on the pdf structure. Potential steps:
1. Combine Data from Multiple PDF Files into a Single Excel File or Combine Data from Multiple PDFs with Inconsistent Column Names!
2. to get it into a couple of columns : Unpivot Multiple Column Groups
if this doesn't resolve the issue, kindly provide a sample input masking senstitive data and a sample output
Check out the July 2025 Power BI update to learn about new features.
This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.