<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Data Flow Gen2 ODBC connection to AWS for Incremental Refresh in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5011957#M15083</link>
    <description>&lt;P&gt;&amp;nbsp;Hi, I am in higer Ed and we are not currently on Fabric License.&amp;nbsp;&lt;BR /&gt;I have a query against an AWS Athena table using&amp;nbsp; an AWS ODBC Connection.&lt;/P&gt;&lt;P&gt;The table is upward of 17 million rows and approximately 70 columns, many are ID columns.&amp;nbsp; I would like to incrementally update this data.&lt;/P&gt;&lt;P&gt;I have try many ways to do this unsuccessfully for one reason or anther.&amp;nbsp; &amp;nbsp;Either it does not fold or it does not do an incremental refesh.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My current state:&lt;/STRONG&gt;&amp;nbsp; dsn, catalog and table are in the source (suggestion in troubleshooting)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Primary query:&lt;/STRONG&gt; set to allow staging, it has no destination (Microsoft doc)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Reference query:&lt;/STRONG&gt; does not allow staging, it is set to not require folding (it does in Power Query) ,this is where the incremental update is set.&amp;nbsp; I want to refresh an entire year, when the update field has changed.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Destination is a lake house table columns are not dynamic, options set to limit 10&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Parameters&lt;/STRONG&gt; are RangeStart, RangeEnd points to a query to todays date.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would really love it if anyone has an issue with AWS and figured this out.&amp;nbsp; &amp;nbsp;(Dates are set as they are per another blog, stating date issues)&lt;/P&gt;&lt;P&gt;This query runs, but it does not appear to do incremental updates.&amp;nbsp; The second time I run it after adding additional data is longer then the first.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; // 1. Format dates for Athena&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; StartText = DateTime.ToText(RangeStart, "yyyy-MM-dd"),&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; EndText &amp;nbsp; = DateTime.ToText(RangeEnd, &amp;nbsp; "yyyy-MM-dd"),&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; // 2. The SQL Statement with manual folding built-in&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; FullQuery = "&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; SELECT&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; journal_line_wid,&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; as_entered_company, (lots of fields)&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FROM edl_workday_fin.cur_journal_transactions_vw&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; WHERE accounting_date &amp;gt;= (select date '" &amp;amp; StartText &amp;amp; "') and accounting_date &amp;lt; (select date '" &amp;amp; EndText &amp;amp; "')",&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; // 3. The Actual Data Connection&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; Source = Odbc.Query("dsn=DSNName;Catalog=AwsDataCatalog;Schema=SchemaName;",&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FullQuery&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; )&lt;/P&gt;&lt;P&gt;in&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; Source&lt;/P&gt;</description>
    <pubDate>Thu, 12 Feb 2026 22:21:05 GMT</pubDate>
    <dc:creator>Suzanne_Shuford</dc:creator>
    <dc:date>2026-02-12T22:21:05Z</dc:date>
    <item>
      <title>Data Flow Gen2 ODBC connection to AWS for Incremental Refresh</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5011957#M15083</link>
      <description>&lt;P&gt;&amp;nbsp;Hi, I am in higer Ed and we are not currently on Fabric License.&amp;nbsp;&lt;BR /&gt;I have a query against an AWS Athena table using&amp;nbsp; an AWS ODBC Connection.&lt;/P&gt;&lt;P&gt;The table is upward of 17 million rows and approximately 70 columns, many are ID columns.&amp;nbsp; I would like to incrementally update this data.&lt;/P&gt;&lt;P&gt;I have try many ways to do this unsuccessfully for one reason or anther.&amp;nbsp; &amp;nbsp;Either it does not fold or it does not do an incremental refesh.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My current state:&lt;/STRONG&gt;&amp;nbsp; dsn, catalog and table are in the source (suggestion in troubleshooting)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Primary query:&lt;/STRONG&gt; set to allow staging, it has no destination (Microsoft doc)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Reference query:&lt;/STRONG&gt; does not allow staging, it is set to not require folding (it does in Power Query) ,this is where the incremental update is set.&amp;nbsp; I want to refresh an entire year, when the update field has changed.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Destination is a lake house table columns are not dynamic, options set to limit 10&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Parameters&lt;/STRONG&gt; are RangeStart, RangeEnd points to a query to todays date.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would really love it if anyone has an issue with AWS and figured this out.&amp;nbsp; &amp;nbsp;(Dates are set as they are per another blog, stating date issues)&lt;/P&gt;&lt;P&gt;This query runs, but it does not appear to do incremental updates.&amp;nbsp; The second time I run it after adding additional data is longer then the first.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; // 1. Format dates for Athena&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; StartText = DateTime.ToText(RangeStart, "yyyy-MM-dd"),&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; EndText &amp;nbsp; = DateTime.ToText(RangeEnd, &amp;nbsp; "yyyy-MM-dd"),&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; // 2. The SQL Statement with manual folding built-in&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; FullQuery = "&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; SELECT&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; journal_line_wid,&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; as_entered_company, (lots of fields)&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FROM edl_workday_fin.cur_journal_transactions_vw&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; WHERE accounting_date &amp;gt;= (select date '" &amp;amp; StartText &amp;amp; "') and accounting_date &amp;lt; (select date '" &amp;amp; EndText &amp;amp; "')",&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; // 3. The Actual Data Connection&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; Source = Odbc.Query("dsn=DSNName;Catalog=AwsDataCatalog;Schema=SchemaName;",&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FullQuery&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; )&lt;/P&gt;&lt;P&gt;in&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; Source&lt;/P&gt;</description>
      <pubDate>Thu, 12 Feb 2026 22:21:05 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5011957#M15083</guid>
      <dc:creator>Suzanne_Shuford</dc:creator>
      <dc:date>2026-02-12T22:21:05Z</dc:date>
    </item>
    <item>
      <title>Re: Data Flow Gen2 ODBC connection to AWS for Incremental Refresh</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5041279#M15132</link>
      <description>&lt;P&gt;Hi Suzanne,&lt;/P&gt;&lt;P&gt;What you’re seeing is usually expected with ODBC + “hand-built SQL” in Dataflow Gen2: the incremental refresh feature only behaves incrementally when the &lt;STRONG&gt;RangeStart/RangeEnd filter step can be query-folded back to the source&lt;/STRONG&gt;. When we inject RangeStart/RangeEnd into a SQL string (or use Odbc.Query directly), the engine often can’t detect/validate folding, so it ends up re-reading more data than you expect.&lt;/P&gt;&lt;P&gt;Here are two practical paths:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Option A (try to make it fold):&lt;/STRONG&gt;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Keep RangeStart / RangeEnd as &lt;EM&gt;Date/DateTime parameters&lt;/EM&gt; (don’t convert them to text for the main filter logic).&lt;/LI&gt;&lt;LI&gt;Use a normal Power Query filter step on your date column first (e.g., Table.SelectRows with &amp;gt;= RangeStart and &amp;lt; RangeEnd) and only then rely on folding.&lt;/LI&gt;&lt;LI&gt;If you must use a native query, wrap it with Value.NativeQuery and pass the folding option (where supported) instead of Odbc.Query. This is the pattern Microsoft documents for enabling folding with native queries:&lt;UL&gt;&lt;LI&gt;Value.NativeQuery(Source, SqlText, null, [EnableFolding=true])&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;If Athena’s ODBC driver still doesn’t fold reliably, incremental refresh will not be effective (and refresh times can even grow on subsequent runs).&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Option B (recommended for Athena/ODBC): do incremental outside Dataflow IR&lt;/STRONG&gt; If folding can’t be guaranteed through the Athena ODBC driver, the reliable approach is:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Land data into OneLake/Lakehouse via a pipeline/Copy activity (or another ingestion method that can push predicate to Athena),&lt;/LI&gt;&lt;LI&gt;Use a &lt;STRONG&gt;watermark (max accounting_date / last_updated)&lt;/STRONG&gt; and load only new/changed rows,&lt;/LI&gt;&lt;LI&gt;Then MERGE into a Lakehouse Delta table. This avoids depending on ODBC folding and gives you deterministic incremental behavior.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Also note: Dataflow Gen2 incremental refresh updates destination buckets using &lt;STRONG&gt;replace&lt;/STRONG&gt;, not append, so make sure your bucket size is appropriate (e.g., daily/monthly) and that your “change detection” column is a true last-updated timestamp if you want updates within the same date range handled correctly.&lt;/P&gt;&lt;P&gt;Docs reference for Dataflow Gen2 incremental refresh behavior (bucket replace): &lt;A href="https://learn.microsoft.com/fabric/data-factory/dataflow-gen2-incremental-refresh" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/fabric/data-factory/dataflow-gen2-incremental-refresh&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Docs reference for enabling folding on native queries (Value.NativeQuery + EnableFolding): &lt;A href="https://learn.microsoft.com/power-query/native-query-folding" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/power-query/native-query-folding&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Hope this helps — if you can confirm whether “View Native Query” is enabled/disabled on the filter step (or whether Athena ODBC supports folding for your query), we’ll know immediately whether Option A is feasible or if Option B is the best route.&lt;/P&gt;</description>
      <pubDate>Tue, 17 Feb 2026 06:41:48 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5041279#M15132</guid>
      <dc:creator>bariscihan</dc:creator>
      <dc:date>2026-02-17T06:41:48Z</dc:date>
    </item>
    <item>
      <title>Re: Data Flow Gen2 ODBC connection to AWS for Incremental Refresh</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5104848#M15168</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/518586"&gt;@Suzanne_Shuford&lt;/a&gt;,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Thank you for reaching out to the Microsoft Fabric Community Forum. Also, thanks to&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/833169"&gt;@bariscihan&lt;/a&gt;, for those inputs on this thread.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Has your issue been resolved? If the response provided by the community member&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/833169"&gt;@bariscihan&lt;/a&gt;, addressed your query, could you please confirm? It helps us ensure that the solutions provided are effective and beneficial for everyone.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;Hope this helps clarify things and let me know what you find after giving these steps a try happy to help you investigate this further.&lt;BR /&gt;&lt;BR /&gt;Thank you for using the Microsoft Community Forum.&lt;/P&gt;
&lt;P&gt;&lt;LI-WRAPPER&gt;&lt;/LI-WRAPPER&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Feb 2026 04:49:42 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5104848#M15168</guid>
      <dc:creator>v-kpoloju-msft</dc:creator>
      <dc:date>2026-02-20T04:49:42Z</dc:date>
    </item>
    <item>
      <title>Re: Data Flow Gen2 ODBC connection to AWS for Incremental Refresh</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5110351#M15190</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/518586"&gt;@Suzanne_Shuford&lt;/a&gt;,&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Just wanted to follow up. If the shared guidance worked for you, that’s wonderful hopefully it also helps others looking for similar answers. If there’s anything else you'd like to explore or clarify, don’t hesitate to reach out.&lt;/P&gt;
&lt;P&gt;Thank you.&lt;/P&gt;
&lt;P&gt;&lt;LI-WRAPPER&gt;&lt;/LI-WRAPPER&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Feb 2026 05:17:38 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Data-Flow-Gen2-ODBC-connection-to-AWS-for-Incremental-Refresh/m-p/5110351#M15190</guid>
      <dc:creator>v-kpoloju-msft</dc:creator>
      <dc:date>2026-02-23T05:17:38Z</dc:date>
    </item>
  </channel>
</rss>

