Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Request now

Reply
Madhumitha_V
Helper II
Helper II

How to import data from a PDF / Word Document embedded in the Excel file as a source to power BI?

Hi All,

 

The source for my Report is an Excel spreedsheet where in one of the columns, I have an embedded PDF / Word document. How can I import that PDF / Word document as an another source in the same report ? 

 

Is there any way or solution to implement this in power BI ?

 

Column with embedded PDF and Word documents in an Excel fileColumn with embedded PDF and Word documents in an Excel file

 

Please share your thoughts on this.

Any pointers on this would be of great help!

 

Thankyou in advance !!

1 ACCEPTED SOLUTION
OwenAuger
Super User
Super User

Hi @Madhumitha_V , 

 

Yes you can do this. I just tested a small example at my end.

You can treat an .xlsx file as a zip file containing the embedded files.

 

The steps I followed are:

  1. Use a function to "decompress" the contents of your .xlsx file. I used the function posted here:
    http://sql10.blogspot.com/2016/06/reading-zip-files-in-powerquery-m.html
  2. After invoking the function above, you get a table listing the files contained in the .xlsx file. The embedded files are under xl/embeddings.
    Here's my example, with the .docx file being the Word document, and the .bin file actually being a PDF.
    OwenAuger_0-1601299644765.png

     

  3. You can then click on the Binary items and transform them as you would the binary content of a standalone file.

 

Hopefully you can get this working in your case. Please post back if needed or if you would like a sample PBIX.

 

Regards,

Owen


Owen Auger
Did I answer your question? Mark my post as a solution!
Blog
LinkedIn

View solution in original post

2 REPLIES 2
OwenAuger
Super User
Super User

Hi @Madhumitha_V , 

 

Yes you can do this. I just tested a small example at my end.

You can treat an .xlsx file as a zip file containing the embedded files.

 

The steps I followed are:

  1. Use a function to "decompress" the contents of your .xlsx file. I used the function posted here:
    http://sql10.blogspot.com/2016/06/reading-zip-files-in-powerquery-m.html
  2. After invoking the function above, you get a table listing the files contained in the .xlsx file. The embedded files are under xl/embeddings.
    Here's my example, with the .docx file being the Word document, and the .bin file actually being a PDF.
    OwenAuger_0-1601299644765.png

     

  3. You can then click on the Binary items and transform them as you would the binary content of a standalone file.

 

Hopefully you can get this working in your case. Please post back if needed or if you would like a sample PBIX.

 

Regards,

Owen


Owen Auger
Did I answer your question? Mark my post as a solution!
Blog
LinkedIn

Hi @OwenAuger, Thanks for the solution. It worked in my case too.

Helpful resources

Announcements
Fabric Data Days Carousel

Fabric Data Days

Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!

October Power BI Update Carousel

Power BI Monthly Update - October 2025

Check out the October 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors