Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now! Learn more

Reply
Anonymous
Not applicable

Cleaning up data after PDF Conversion

Hello,

 

I needed to convert a PDF of pricing agreement to an excel and after conversion it resulted in 299 tables. 

Power2you_0-1674138937583.png

as you can see these data have turned into stacked tables. I would like to make the highlighted names as column headers and the value that belongs to each column is the row after the label. 

Power2you_1-1674139193746.png

if there were 3 tables, I was able to transform them individually and append them, but being some many tables I could not do it. 

Power2you_2-1674139742581.png

 

 Could you give me some ideas in how to go about formatting the data? This is so messy.  I watched a lot of videos yesterday I still have not figured out a solution! Thank you in advance! 

2 REPLIES 2
adudani
Super User
Super User

@Anonymous ,

 

if the requirement is 3 tables, duplicate the query 3 times.

Filter each table for the different headers using ( remove top N rows).

Promote the row you want into headers.

 

This should work if the pdf is standardize.

 

Please accept this as the solution if it resolves the question.

 

Appreciate a thumbs up if this is helpful.

Did I answer your question? Mark my post as a solution, this will help others!
If my response(s) assisted you in any way, don't forget to drop me a Kudos 🙂
Kind Regards,
Avinash
Anonymous
Not applicable

@adudani Thank you for the reply. What do you mean if the PDF is standardized?  What are the prerequisites? I am afraid that I am unable to complete the task.

 

the PDF version has 55 pages but each page follows the same PDF format. 

I have 299 tables ( microsoft reads as tables when converting ) that are stacked onto each other with repeated headers in the rows. I only used 3 tables as an example BUT I actually have 299 "tables"  that are stacked on top of each other after conversion. 

 

 

 

 

Helpful resources

Announcements
Power BI DataViz World Championships

Power BI Dataviz World Championships

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now!

December 2025 Power BI Update Carousel

Power BI Monthly Update - December 2025

Check out the December 2025 Power BI Holiday Recap!

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.