Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Join us for an expert-led overview of the tools and concepts you'll need to become a Certified Power BI Data Analyst and pass exam PL-300. Register now.

Reply
ibmbaranski
New Member

MIME type for excel file when row count is greater than ~50K

I have a daily file I receive from a vendor, and 6 days a week it works fine in the (closed source) tool I'm sending it to. These days the row count is < 10000.

 

One day a week I get the same file with over 60K rows and my tool will not load it. It gives me an error that says:

 

MIME type mismatch for file: x.xlsx. Expected type:application/vnd.openxmlformats-officedocument.spreadsheetml.sheet, Actual type: application/x-tika-ooxml

 

Does anyone know why I would be getting this error? I'm assuming that it has to do with the number of rows, but I'm not positive. Everything else (I am assured by the vendor) is the same when generating the file - it's the same job. I am copying the file using the same method (downloading and using scp) and loading the file into the tool the same way.

 

I've used `file --mime-type` and the python magic libs and the files look to be the same from those tools. The utility that's giving me the error is closed source and I don't know how they check the MIME type.

 

Does anyone have any ideas? I'm stuck.

6 REPLIES 6
lbendlin
Super User
Super User

Are they at least admitting that they are producing OpenOffice files?  Can you ask them to send CSV instead?

I might ask for CSV - I need to see if the other team can ingest that properly.

 

The answer has been "We generate all the files the same way so it is not on our end"

lbendlin
Super User
Super User

wait, are you reading these Excel files with Python?  What made you do that?  Why not use the native Excel connector in Power Query ?

I'm not reading the files with Python. They are being delivered to me from PowerBI and I'm passing them on to the next step (via SCP) with Python.

 

The MIME type is what I'm trying to get to the bottom of.

They sent you OpenOffice files, not Excel files.

So now we are back to the original question, which is:


They are adamant that the job is the exact same every day, and that the large file is generated by the same job that generates the smaller files.

 

What would cause the MIME type to be different, the size is the obvious answer...

Helpful resources

Announcements
Join our Fabric User Panel

Join our Fabric User Panel

This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.

June 2025 Power BI Update Carousel

Power BI Monthly Update - June 2025

Check out the June 2025 Power BI update to learn about new features.

June 2025 community update carousel

Fabric Community Update - June 2025

Find out what's new and trending in the Fabric community.