Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.

Reply
Anonymous
Not applicable

Extract tables from word document

Hi there

Does anyone manage to scrape data from word documents via PowerBI / Power Query?

Is this doable?

Thank you so much

 

1 ACCEPTED SOLUTION
Anonymous
Not applicable

@MattAllington has a pattern to extract tables into Power BI from Word saved as HTML - see http://exceleratorbi.com.au/import-tabular-data-pdf-using-power-query/

View solution in original post

2 REPLIES 2
v-caliao-msft
Microsoft Employee
Microsoft Employee

Hi @Anonymous,

 

Yes, we can achieve this requirement. You need to leverage the fact that Microsoft Word .docx files are actually ZIP files containing a group of XML files.  We will decompress the ZIP file and parse the XML to pull information into Power Query.

 

Reference
http://www.excelandpowerbi.com/?p=201
http://www.excelandpowerbi.com/?p=146

 

Regards,

Charlie Liao

Anonymous
Not applicable

@MattAllington has a pattern to extract tables into Power BI from Word saved as HTML - see http://exceleratorbi.com.au/import-tabular-data-pdf-using-power-query/

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors