Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now! Learn more

Reply
Robjr
New Member

Appending 419 pages

I have a single pdf report (consisting of 419 pages) that has been imported into PQ and the page appended.  6600 rows and 15 columns. Using Windows 10.  Fast processor and 64 mb or ram.  Even so, it takes a LONG time to update.  Is there a way I can speed this up?  Do I need to break this up into multiple appends and then append those?  

 

Any help is greatly appreciated.

1 ACCEPTED SOLUTION
Akash_Varuna
Super User
Super User

Hi @Robjr Try these please 

  • Filter pages and columns early in the query to limit data.
  • Split the PDF into smaller files, process separately, and append later.
  • Use Table.Buffer to reduce repeated data loads:
    BufferedTable = Table.Buffer(YourTable)
  • Simplify the query by removing unnecessary steps and filtering early.
  • You could also Pre - Process the pdf using Python or any other compatable tool
    If this post helped please do give a kudos and accept this as a solution
    Thanks In Advance

View solution in original post

3 REPLIES 3
Omid_Motamedise
Super User
Super User

To speed up your Power Query process with a large PDF (419 pages, 6600 rows, 15 columns), consider these optimizations: 1) Load only necessary columns to reduce data size. 2) Use Table.Buffer() after key transformation steps to store intermediate results in memory. 3) Disable "Enable Data Preview" under Query Options. 4) If merging queries, ensure they're optimized with indexed keys. 5) Load the PDF in smaller batches, process them separately, and then append them. 6) Use a folder-based approach instead of a single PDF file if possible. These steps should improve performance significantly.


If my answer helped solve your issue, please consider marking it as the accepted solution.
Akash_Varuna
Super User
Super User

Hi @Robjr Try these please 

  • Filter pages and columns early in the query to limit data.
  • Split the PDF into smaller files, process separately, and append later.
  • Use Table.Buffer to reduce repeated data loads:
    BufferedTable = Table.Buffer(YourTable)
  • Simplify the query by removing unnecessary steps and filtering early.
  • You could also Pre - Process the pdf using Python or any other compatable tool
    If this post helped please do give a kudos and accept this as a solution
    Thanks In Advance

Thank you for your sugggestions.  I had already incorporated several of them but the Table.Buffer approach was unknown to me.  I found a couple of good instructional videos on YouTube (e.g., from APQ16) that guided me through the process.

Helpful resources

Announcements
Power BI DataViz World Championships

Power BI Dataviz World Championships

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now!

December 2025 Power BI Update Carousel

Power BI Monthly Update - December 2025

Check out the December 2025 Power BI Holiday Recap!

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors