Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Join us at FabCon Vienna from September 15-18, 2025, for the ultimate Fabric, Power BI, SQL, and AI community-led learning event. Save €200 with code FABCOMM. Get registered

Tables in PDF files

I have come across so many public data repositories that hold data in PDF format. Other websites have tables within documents such as annual reports etc., also in PDF format. A data source for PDFs or tables from PDFs would be awesome!
Status: Completed
Comments
nishalit
New Member
The PDF connector is now generally available in the April release of Power BI Desktop. Learn more here: https://powerbi.microsoft.com/en-us/blog/power-bi-desktop-april-2019-feature-summary/#pdf
gerryb
New Member
I'll add a third vote for this. As Gogula indicates, PDFs are the rule for a lot of public domain data on the Web, especially from the US Gov. Personally, I hate PDFs and my choice would be to simply make them illegal 🙂 , but if we have to live with them, we're going to need a way to mine the data from that hideous file format.
fbcideas_migusr
New Member
This is huge to CFO and CMO teams. Parsing financial reports is essential task toward any competition analysis and strategic planning.
brentm1
New Member
This would be super for government data sources. Example: http://www.dfw.state.or.us/MRP/salmon/Historical_Data/docs/TrollEffTable.pdf
ksaleh
New Member
I also vote for PDF
Paulx99
Kudo Kingpin
It would be great if PBI Desktop could load PDF files - both physical and scanned.
squalleitor
New Member
Agree, sometimes you just dont have access to the nice to have CSV file. If the PDF was generated from an Excel file to begin with reverting it back would be awesome.
maurogsc
New Member
'+1
fbcideas_migusr
New Member
Since this got merged from a different thread, I just want to clarify something as the topic is not quite the same... What I'm looking for is the ability to read from a PDF. While extracting tables would be nice, my priority would be to read the PDF as a text file so that I can do my own parsing of any of the data inside. I.e. I don't want this restricted to only pulling in data that looks like a table.
fbcideas_migusr
New Member
Please bring this to Excel as well. I get this question EVERY time I teach a course on using Power Query. It's a very big need!