Microsoft Fabric Community Conference 2025, March 31 - April 2, Las Vegas, Nevada. Use code MSCUST for a $150 discount.
Register nowThe Power BI DataViz World Championships are on! With four chances to enter, you could win a spot in the LIVE Grand Finale in Las Vegas. Show off your skills.
I have a daily file I receive from a vendor, and 6 days a week it works fine in the (closed source) tool I'm sending it to. These days the row count is < 10000.
One day a week I get the same file with over 60K rows and my tool will not load it. It gives me an error that says:
MIME type mismatch for file: x.xlsx. Expected type:application/vnd.openxmlformats-officedocument.spreadsheetml.sheet, Actual type: application/x-tika-ooxml
Does anyone know why I would be getting this error? I'm assuming that it has to do with the number of rows, but I'm not positive. Everything else (I am assured by the vendor) is the same when generating the file - it's the same job. I am copying the file using the same method (downloading and using scp) and loading the file into the tool the same way.
I've used `file --mime-type` and the python magic libs and the files look to be the same from those tools. The utility that's giving me the error is closed source and I don't know how they check the MIME type.
Does anyone have any ideas? I'm stuck.
Are they at least admitting that they are producing OpenOffice files? Can you ask them to send CSV instead?
I might ask for CSV - I need to see if the other team can ingest that properly.
The answer has been "We generate all the files the same way so it is not on our end"
wait, are you reading these Excel files with Python? What made you do that? Why not use the native Excel connector in Power Query ?
I'm not reading the files with Python. They are being delivered to me from PowerBI and I'm passing them on to the next step (via SCP) with Python.
The MIME type is what I'm trying to get to the bottom of.
They sent you OpenOffice files, not Excel files.
So now we are back to the original question, which is:
They are adamant that the job is the exact same every day, and that the large file is generated by the same job that generates the smaller files.
What would cause the MIME type to be different, the size is the obvious answer...
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Prices go up Feb. 11th.
If you love stickers, then you will definitely want to check out our Community Sticker Challenge!
User | Count |
---|---|
3 | |
2 | |
1 | |
1 | |
1 |