- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Extract PDF tables and append them but the column name & Structure is inconsistent
Hi,
I am trying to extract the tables from more than 100 pdf files and wish to append the tables but facing few challenges i.e.
1. One pdf files have multiple tables
2. Column names are inconsistent for example - Table1 (Column - Age) Table2 (Column - Owner Age)
3. Table structure is also inconsistent i.e. some tables have 5 columns while few have 6 columns. So while extracting the data, miss the information of extra column.
Please share if we can resolve this via power bi.
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Is the data inconsistently structured in the source files? Or is Power BI extracting the data inconsistently?
If the source files themselves are inconsistent, you can write some conditional cleanup logic in M to make them consistent. For example, the following logic renames the "Owner Age" column to "Age", but only if such a column exists:
if Table.HasColumns(myTable, {"Owner Age"}) then Table.RenameColumns(myTable, {{"Owner Age", "Age"}}) else myTable

Helpful resources
Power BI Monthly Update - April 2025
Check out the April 2025 Power BI update to learn about new features.
NEW! Community Notebooks Gallery
Explore and share Fabric Notebooks to boost Power BI insights in the new community notebooks gallery.

Subject | Author | Posted | |
---|---|---|---|
09-09-2024 03:29 PM | |||
03-25-2024 10:29 PM | |||
01-15-2025 02:08 AM | |||
02-28-2025 08:07 AM | |||
12-25-2024 02:33 PM |
User | Count |
---|---|
17 | |
8 | |
8 | |
7 | |
5 |