<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Copy job to on-prem Oracle fails if source has more rows than batch size in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4257490#M4789</link>
    <description>&lt;P&gt;Hello&amp;nbsp;@Anonymous&lt;/a&gt;,&amp;nbsp;thank you for your suggestion. Unfortunately, I cannot see a solution to the problem in there.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 25 Oct 2024 06:28:18 GMT</pubDate>
    <dc:creator>AwadFabric</dc:creator>
    <dc:date>2024-10-25T06:28:18Z</dc:date>
    <item>
      <title>Copy job to on-prem Oracle fails if source has more rows than batch size</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4256490#M4782</link>
      <description>&lt;P&gt;Hello everybody,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are transitioning our data pipelines from Synapse to Fabric and have encountered an error that wasn't present in Synapse.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The pipeline that we implemented in Fabric uses a copy activity with ADLS Gen2 as source and an on-prem Oracle Database as the destination. We are experiencing the error that if the source data contains more rows than specified in the "Write batch size" option, the copy activity fails:&lt;/P&gt;&lt;PRE&gt;&lt;SPAN&gt;Failure happened on 'destination' side. ErrorCode=OracleTableNotExistError,'Type=Microsoft.DataTransfer.Common.Shared.HybridDeliveryException,Message=The specified table &amp;lt;OUR_TABLE_NAME&amp;gt; doesn't exist.,Source=Microsoft.DataTransfer.Connectors.OracleV2Core,'&lt;/SPAN&gt;&lt;/PRE&gt;&lt;P&gt;However, the copy activity actually writes data to the table with the number of rows of the specified batch size to the Oracle table before raising the error.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Since we used the same table as a destination in our Synapse pipeline, we can assume that the error is not on the database-side.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here are some screenshots with test data that was used to recreate this behavior. The test data has 100 rows of data:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;1) Working pipeline if the batch size is bigger than the source data:&lt;/STRONG&gt;&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="AwadFabric_0-1729780926177.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1188737i28E64B72797A8E83/image-size/medium?v=v2&amp;amp;px=400" role="button" title="AwadFabric_0-1729780926177.png" alt="AwadFabric_0-1729780926177.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="AwadFabric_1-1729780968172.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1188738iD4D6A716E749424F/image-size/medium?v=v2&amp;amp;px=400" role="button" title="AwadFabric_1-1729780968172.png" alt="AwadFabric_1-1729780968172.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="AwadFabric_2-1729781004234.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1188739i21406FE887BCEB46/image-size/medium?v=v2&amp;amp;px=400" role="button" title="AwadFabric_2-1729781004234.png" alt="AwadFabric_2-1729781004234.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;2) Error if batch size if less than rows in source dataset:&lt;/STRONG&gt;&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="AwadFabric_3-1729781065584.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1188742iAD8809B142540DCD/image-size/medium?v=v2&amp;amp;px=400" role="button" title="AwadFabric_3-1729781065584.png" alt="AwadFabric_3-1729781065584.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="AwadFabric_0-1729781226266.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1188745i0D7D37F9C571EFCC/image-size/medium?v=v2&amp;amp;px=400" role="button" title="AwadFabric_0-1729781226266.png" alt="AwadFabric_0-1729781226266.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="AwadFabric_5-1729781123514.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1188744i765341974083A021/image-size/medium?v=v2&amp;amp;px=400" role="button" title="AwadFabric_5-1729781123514.png" alt="AwadFabric_5-1729781123514.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Attempted solutions include:&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Enable staging in copy activity&lt;/LI&gt;&lt;LI&gt;Limit number of max. concurrent connections to 1&lt;/LI&gt;&lt;LI&gt;Set "&lt;SPAN&gt;Degree of copy parallelism" to 1&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;Trying another source data set and other source data types (tried JSON, CSV)&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;SPAN&gt;All of these attempted solutions gave the same error.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As a temporary workaround, we set the batch size to a maximum. However, this batch size seems to be capped at around one million rows, and datasets larger than this threshold still result in errors&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does anyone have insights on why this error occurs or how to fix it? Any help would be greatly appreciated!&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Thu, 24 Oct 2024 14:47:19 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4256490#M4782</guid>
      <dc:creator>AwadFabric</dc:creator>
      <dc:date>2024-10-24T14:47:19Z</dc:date>
    </item>
    <item>
      <title>Re: Copy job to on-prem Oracle fails if source has more rows than batch size</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4256768#M4784</link>
      <description>&lt;P&gt;Have you tried setting a value for the 'write batch timeout' on your destination config?&lt;BR /&gt;&lt;BR /&gt;I don't see a default value listed anywhere so I'm not sure how it will behave when it has written one batch and is waiting to write another batch. Maybe it isn't waiting at all, and closing the connection before the entire transfer is complete?&lt;BR /&gt;&lt;BR /&gt;Try setting it to something like&amp;nbsp;&lt;SPAN&gt;00:01:00 for small batch sizes, or really however long you think it should/could reasonbly take to write the number of rows in your batch.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Oct 2024 18:24:57 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4256768#M4784</guid>
      <dc:creator>IntegrateGuru</dc:creator>
      <dc:date>2024-10-24T18:24:57Z</dc:date>
    </item>
    <item>
      <title>Re: Copy job to on-prem Oracle fails if source has more rows than batch size</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4256794#M4785</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/802656"&gt;@IntegrateGuru&lt;/a&gt;,&amp;nbsp;thank you for your idea. I tried it again using different times for write batch timeout on the destination. I tried 1 second, 10 seconds, 30 seconds, 1 minute, and 10 minutes for a batch size of 15 rows. However, for all of these runs, the original error occured a few seconds after the pipeline was started.&lt;/P&gt;</description>
      <pubDate>Thu, 24 Oct 2024 18:54:29 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4256794#M4785</guid>
      <dc:creator>AwadFabric</dc:creator>
      <dc:date>2024-10-24T18:54:29Z</dc:date>
    </item>
    <item>
      <title>Re: Copy job to on-prem Oracle fails if source has more rows than batch size</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4257116#M4786</link>
      <description>&lt;P&gt;HI&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/815945"&gt;@AwadFabric&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;I'd like to suggest you take al look the following document about data factory feature limitations if they meets to your scenario:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/fabric/data-factory/data-factory-limitations#data-pipeline-resource-limits" target="_blank"&gt;Data Factory limitations overview - Microsoft Fabric | Microsoft Learn&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Xiaoxin Sheng&lt;/P&gt;</description>
      <pubDate>Fri, 25 Oct 2024 01:42:19 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4257116#M4786</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-10-25T01:42:19Z</dc:date>
    </item>
    <item>
      <title>Re: Copy job to on-prem Oracle fails if source has more rows than batch size</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4257490#M4789</link>
      <description>&lt;P&gt;Hello&amp;nbsp;@Anonymous&lt;/a&gt;,&amp;nbsp;thank you for your suggestion. Unfortunately, I cannot see a solution to the problem in there.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 25 Oct 2024 06:28:18 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4257490#M4789</guid>
      <dc:creator>AwadFabric</dc:creator>
      <dc:date>2024-10-25T06:28:18Z</dc:date>
    </item>
    <item>
      <title>Re: Copy job to on-prem Oracle fails if source has more rows than batch size</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4260040#M4861</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/815945"&gt;@AwadFabric&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Perhaps you can take a look the following link that told about the similar issue if it help with your scenario:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/answers/questions/1384469/copy-activity-successfully-loads-more-rows-than-wr" target="_blank"&gt;Copy activity successfully loads more rows than Write Batch Size in Azure pipeline - Microsoft Q&amp;amp;A&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Xiaoxin Sheng&lt;/P&gt;</description>
      <pubDate>Mon, 28 Oct 2024 07:12:19 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Copy-job-to-on-prem-Oracle-fails-if-source-has-more-rows-than/m-p/4260040#M4861</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-10-28T07:12:19Z</dc:date>
    </item>
  </channel>
</rss>

