Join us at FabCon Atlanta from March 16 - 20, 2026, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.
Register now!Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.
I have a folder full of PDFs. I'm importing and appending the tables from those PDFs - but double quote (") are causing additional columns to be generated.
Background:
I want to extract from all the files in the folder, [Data Table 1] & [Data Table 2] in the below screenshot.
When Power Query interprets the file:
Currently, I'm importing the folder of PDFs 1x, then referencing it 2x using the built-in functionality
1 time to reference [Data Table 1], and 1 time to reference [Data Table 2]
The issue is sometimes the first column of [Data Table 1] or [Data Table 2] has a " in it, which Power Query then interprets as a new column break. Well, so far it's only occured in [Data Table 2], but I'm sure " will pop up in [Data Table 1] eventually.
e.g. - Power Query when no " are present
And one with ", - an additional column is generated
Due to the additional column, it naturally messes up the Append function.
How can I work around this? I was thinking:
Not sure what the best course is though to avoid that extra column messing up the Append function (nor how to do any of the above!)
Hi,
In PowerQuery, use one of the correct PDFs without quotation marks, Ctrl+Click on all field names that you want to keep. Then right-click and select Remove Other Columns. This will remove any extra columns that are added in the future that is not in your selected set of columns. I hope this helps.
Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!
Check out the September 2025 Power BI update to learn about new features.