Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Join us for an expert-led overview of the tools and concepts you'll need to become a Certified Power BI Data Analyst and pass exam PL-300. Register now.

Reply
nidheshtiwari
Frequent Visitor

Extract PDF tables and append them but the column name & Structure is inconsistent

Hi,

I am trying to extract the tables from more than 100 pdf files and wish to append the tables but facing few challenges i.e.

1. One pdf files have multiple tables

2. Column names are inconsistent  for example -  Table1 (Column - Age)   Table2 (Column - Owner Age)

3. Table structure is also inconsistent i.e. some tables have 5 columns while few have 6 columns. So while extracting the data, miss the information of extra column.

 

Please share if we can resolve this via power bi.

Thanks

1 REPLY 1
Ehren
Microsoft Employee
Microsoft Employee

Is the data inconsistently structured in the source files? Or is Power BI extracting the data inconsistently?

 

If the source files themselves are inconsistent, you can write some conditional cleanup logic in M to make them consistent. For example, the following logic renames the "Owner Age" column to "Age", but only if such a column exists:

 

if Table.HasColumns(myTable, {"Owner Age"}) then Table.RenameColumns(myTable, {{"Owner Age", "Age"}}) else myTable

Helpful resources

Announcements
Join our Fabric User Panel

Join our Fabric User Panel

This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.

June 2025 Power BI Update Carousel

Power BI Monthly Update - June 2025

Check out the June 2025 Power BI update to learn about new features.

June 2025 community update carousel

Fabric Community Update - June 2025

Find out what's new and trending in the Fabric community.