Power BI is turning 10! Tune in for a special live episode on July 24 with behind-the-scenes stories, product evolution highlights, and a sneak peek at what’s in store for the future.
Save the dateEnhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.
I'm trying to import an XLSX file (with a size of about 300 MB) into a table using a pipline in Microsoft Fabric. There are 700k rows of data in the file. When I try to run the pipeline I get the below error. I tried to run it for the same file but with rows of data reduced to c. 1000 and it run just fine. This is the error I get when I run the pipeline for the original file:
ErrorCode=ExcelUnsupportedFormat,'Type=Microsoft.DataTransfer.Common.Shared.HybridDeliveryException,Message=Only '.xls' and '.xlsx' format is supported in reading excel file while error is ' at NPOI.OpenXml4Net.Util.ZipInputStreamZipEntrySource.FakeZipEntry..ctor(ZipEntry entry, ZipInputStream inp)> at NPOI.OpenXml4Net.Util.ZipInputStreamZipEntrySource..ctor(ZipInputStream inp)> at NPOI.OpenXml4Net.OPC.ZipPackage..ctor(Stream filestream, PackageAccess access)> at NPOI.OpenXml4Net.OPC.OPCPackage.Open(Stream in1)> at NPOI.Util.PackageHelper.Open(Stream is1)> at NPOI.XSSF.UserModel.XSSFWorkbook..ctor(Stream is1)> at Microsoft.DataTransfer.ClientLibrary.ExcelUtility.GetExcelWorkbook(String fileExtension, TransferStream stream)'.,Source=Microsoft.DataTransfer.ClientLibrary,''Type=System.IO.IOException,Message=ZIP entry size is too large,Source=NPOI.OpenXml4Net,'
Solved! Go to Solution.
Can you try using Dataflow Gen 2 to read the original file and confirm?
The issue is similar in PaaS ADF, similar thread :
https://stackoverflow.com/questions/67376837/how-to-access-power-bi-service-in-multi-tenant-scenario
Glad that it helped!!
https://learn.microsoft.com/en-us/fabric/get-started/decision-guide-pipeline-dataflow-spark
Please upvote and accept as solution as it might benefit other community members as well
Can you try using Dataflow Gen 2 to read the original file and confirm?
The issue is similar in PaaS ADF, similar thread :
https://stackoverflow.com/questions/67376837/how-to-access-power-bi-service-in-multi-tenant-scenario
Hi, I've just tried Dataflow Gen 2, as per your suggestion, and it worked fine - the data was loaded successfully to the Lakehouse.