Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.

Reply
IgorM
Frequent Visitor

Error for a data copy activity in a pipeline within Microsoft Fabric - ZIP entry size is too large

I'm trying to import an XLSX file (with a size of about 300 MB) into a table using a pipline in Microsoft Fabric. There are 700k rows of data in the file. When I try to run the pipeline I get the below error. I tried to run it for the same file but with rows of data reduced to c. 1000 and it run just fine. This is the error I get when I run the pipeline for the original file:

 

ErrorCode=ExcelUnsupportedFormat,'Type=Microsoft.DataTransfer.Common.Shared.HybridDeliveryException,Message=Only '.xls' and '.xlsx' format is supported in reading excel file while error is ' at NPOI.OpenXml4Net.Util.ZipInputStreamZipEntrySource.FakeZipEntry..ctor(ZipEntry entry, ZipInputStream inp)> at NPOI.OpenXml4Net.Util.ZipInputStreamZipEntrySource..ctor(ZipInputStream inp)> at NPOI.OpenXml4Net.OPC.ZipPackage..ctor(Stream filestream, PackageAccess access)> at NPOI.OpenXml4Net.OPC.OPCPackage.Open(Stream in1)> at NPOI.Util.PackageHelper.Open(Stream is1)> at NPOI.XSSF.UserModel.XSSFWorkbook..ctor(Stream is1)> at Microsoft.DataTransfer.ClientLibrary.ExcelUtility.GetExcelWorkbook(String fileExtension, TransferStream stream)'.,Source=Microsoft.DataTransfer.ClientLibrary,''Type=System.IO.IOException,Message=ZIP entry size is too large,Source=NPOI.OpenXml4Net,'

 

 

1 ACCEPTED SOLUTION
NandanHegde
Super User
Super User

Can you try using Dataflow Gen 2 to read the original file and confirm?

The issue is similar in PaaS ADF, similar thread :

https://stackoverflow.com/questions/67376837/how-to-access-power-bi-service-in-multi-tenant-scenario

 




----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

View solution in original post

3 REPLIES 3
NandanHegde
Super User
Super User

Glad that it helped!! 

https://learn.microsoft.com/en-us/fabric/get-started/decision-guide-pipeline-dataflow-spark

 

Please upvote and accept as solution as it might benefit other community members as well




----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com
NandanHegde
Super User
Super User

Can you try using Dataflow Gen 2 to read the original file and confirm?

The issue is similar in PaaS ADF, similar thread :

https://stackoverflow.com/questions/67376837/how-to-access-power-bi-service-in-multi-tenant-scenario

 




----------------------------------------------------------------------------------------------
Nandan Hegde (MSFT Data MVP)
LinkedIn Profile : www.linkedin.com/in/nandan-hegde-4a195a66
GitHUB Profile : https://github.com/NandanHegde15
Twitter Profile : @nandan_hegde15
MSFT MVP Profile : https://mvp.microsoft.com/en-US/MVP/profile/8977819f-95fb-ed11-8f6d-000d3a560942
Topmate : https://topmate.io/nandan_hegde
Blog :https://datasharkx.wordpress.com

Hi, I've just tried Dataflow Gen 2, as per your suggestion, and it worked fine - the data was loaded successfully to the Lakehouse.

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

September Fabric Update Carousel

Fabric Monthly Update - September 2025

Check out the September 2025 Fabric update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.