Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Request now

Reply
Anonymous
Not applicable

Cleaning up data after PDF Conversion

Hello,

 

I needed to convert a PDF of pricing agreement to an excel and after conversion it resulted in 299 tables. 

Power2you_0-1674138937583.png

as you can see these data have turned into stacked tables. I would like to make the highlighted names as column headers and the value that belongs to each column is the row after the label. 

Power2you_1-1674139193746.png

if there were 3 tables, I was able to transform them individually and append them, but being some many tables I could not do it. 

Power2you_2-1674139742581.png

 

 Could you give me some ideas in how to go about formatting the data? This is so messy.  I watched a lot of videos yesterday I still have not figured out a solution! Thank you in advance! 

2 REPLIES 2
adudani
Super User
Super User

@Anonymous ,

 

if the requirement is 3 tables, duplicate the query 3 times.

Filter each table for the different headers using ( remove top N rows).

Promote the row you want into headers.

 

This should work if the pdf is standardize.

 

Please accept this as the solution if it resolves the question.

 

Appreciate a thumbs up if this is helpful.

Did I answer your question? Mark my post as a solution, this will help others!
If my response(s) assisted you in any way, don't forget to drop me a Kudos 🙂
Kind Regards,
Avinash
Anonymous
Not applicable

@adudani Thank you for the reply. What do you mean if the PDF is standardized?  What are the prerequisites? I am afraid that I am unable to complete the task.

 

the PDF version has 55 pages but each page follows the same PDF format. 

I have 299 tables ( microsoft reads as tables when converting ) that are stacked onto each other with repeated headers in the rows. I only used 3 tables as an example BUT I actually have 299 "tables"  that are stacked on top of each other after conversion. 

 

 

 

 

Helpful resources

Announcements
November Power BI Update Carousel

Power BI Monthly Update - November 2025

Check out the November 2025 Power BI update to learn about new features.

Fabric Data Days Carousel

Fabric Data Days

Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.