This time we’re going bigger than ever. Fabric, Power BI, SQL, AI and more. We're covering it all. You won't want to miss it.
Learn moreDid you hear? There's a new SQL AI Developer certification (DP-800). Start preparing now and be one of the first to get certified. Register now
Introduce catalog-level “join groups” in Fabric/SQL to enable multiple columns across different tables (e.g., fact and dimension tables) to share a consistent dictionary encoding for join keys. This would ensure that identical string or UUID values are assigned the same internal integer code, making joins and group-bys on these keys significantly faster and more memory-efficient.
DuckDB Feature Request I made: Shared / Common Dictionary Encoding for Join Keys includes more details on the motivation and potent...https://issuetracker.google.com/issues/442600668
-- Define a join group to align dictionary codes across tables CREATE JOIN GROUP customer_id_group ( fact_customer_daily.customer_id, fact_customer_orders_daily.customer_id ); -- Optionally rebuild or refresh the dictionary ALTER JOIN GROUP customer_id_group REBUILD;
Internally, Fabric/SQL would persist and reuse a common dictionary for the grouped columns, ensuring that equivalent string/UUID values map to identical encoded integer IDs across tables and partitions.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.