Supplies are limited. Contact info@espc.tech right away to save your spot before the conference sells out.
Get your discountScore big with last-minute savings on the final tickets to FabCon Vienna. Secure your discount
Description:
We are experiencing inconsistent row loading behavior when querying Delta tables through DeltaLake.Table() in Power Query (ADLS Gen2).
Context:
Workarounds (all return 19 rows as expected):
https://github.com/delta-io/delta/issues/1978
Expected behavior:
DeltaLake.Table() should return all 19 rows with the filter InvoiceDate > #date(2021,12,31) — consistent with both Databricks and the github function.
Observed behavior:
One row is missing when using DeltaLake.Table() with the > operator.
Notes:
For this reason, we are concerned of a potential issue in how DeltaLake.Table() applies filters compared to the underlying Delta engine especially because the > filter behaves as expected using the github function.
Solved! Go to Solution.
The missing row is due to a bug in DeltaLake.Table() filter folding — #date with > causes a precision mismatch.
How to Fixes:
Use #datetime(2021,12,31,0,0,0) instead of #date.
Or use >=.
Or apply the filter inside Power Query (not folded).
Or use the GitHub connector (works correctly).
It’s a connector bug — safe to log with Microsoft.
Hi @MatthewBI ,
Just wanted to check if you got a chance to review the suggestion provided and were able to resolve the issue?
If you are still facing any problem, let us know.
Thank you @Shahid12523 for your input.
The solution seems to be what we suggested ourselves and has worked. However, I am still concerned that > filtering does not apply correctly
The missing row is due to a bug in DeltaLake.Table() filter folding — #date with > causes a precision mismatch.
How to Fixes:
Use #datetime(2021,12,31,0,0,0) instead of #date.
Or use >=.
Or apply the filter inside Power Query (not folded).
Or use the GitHub connector (works correctly).
It’s a connector bug — safe to log with Microsoft.