Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now! Learn more

Reply
nidheshtiwari
Frequent Visitor

Extract PDF tables and append them but the column name & Structure is inconsistent

Hi,

I am trying to extract the tables from more than 100 pdf files and wish to append the tables but facing few challenges i.e.

1. One pdf files have multiple tables

2. Column names are inconsistent  for example -  Table1 (Column - Age)   Table2 (Column - Owner Age)

3. Table structure is also inconsistent i.e. some tables have 5 columns while few have 6 columns. So while extracting the data, miss the information of extra column.

 

Please share if we can resolve this via power bi.

Thanks

1 REPLY 1
Ehren
Microsoft Employee
Microsoft Employee

Is the data inconsistently structured in the source files? Or is Power BI extracting the data inconsistently?

 

If the source files themselves are inconsistent, you can write some conditional cleanup logic in M to make them consistent. For example, the following logic renames the "Owner Age" column to "Age", but only if such a column exists:

 

if Table.HasColumns(myTable, {"Owner Age"}) then Table.RenameColumns(myTable, {{"Owner Age", "Age"}}) else myTable

Helpful resources

Announcements
Power BI DataViz World Championships

Power BI Dataviz World Championships

The Power BI Data Visualization World Championships is back! Get ahead of the game and start preparing now!

December 2025 Power BI Update Carousel

Power BI Monthly Update - December 2025

Check out the December 2025 Power BI Holiday Recap!

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.