Check your eligibility for this 50% exam voucher offer and join us for free live learning sessions to get prepared for Exam DP-700.
Get StartedDon't miss out! 2025 Microsoft Fabric Community Conference, March 31 - April 2, Las Vegas, Nevada. Use code MSCUST for a $150 discount. Prices go up February 11th. Register now.
Hi,
I am trying to extract the tables from more than 100 pdf files and wish to append the tables but facing few challenges i.e.
1. One pdf files have multiple tables
2. Column names are inconsistent for example - Table1 (Column - Age) Table2 (Column - Owner Age)
3. Table structure is also inconsistent i.e. some tables have 5 columns while few have 6 columns. So while extracting the data, miss the information of extra column.
Please share if we can resolve this via power bi.
Thanks
Is the data inconsistently structured in the source files? Or is Power BI extracting the data inconsistently?
If the source files themselves are inconsistent, you can write some conditional cleanup logic in M to make them consistent. For example, the following logic renames the "Owner Age" column to "Age", but only if such a column exists:
if Table.HasColumns(myTable, {"Owner Age"}) then Table.RenameColumns(myTable, {{"Owner Age", "Age"}}) else myTable
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!
Check out the January 2025 Power BI update to learn about new features in Reporting, Modeling, and Data Connectivity.
User | Count |
---|---|
21 | |
20 | |
10 | |
10 | |
10 |