Solved: Pre-filter Fact Tables for reporting

ObungiNiels · ‎03-11-2025

Hi fellow fabricators,

I was asked to provide options for the following scenario. Our customer has some very large fact tables in their gold lakehouse. To maintain performance in their PBI reports, they want to filter the fact table based on date (for instance include only the data of the last fiscal year) prior to using the data in the report. They are using direct lake semantic models.

I checked as best as I can and came up with the following:

Set up a view on the fact table with the desired filter: This would be my preferred way. However, when my research is correct this would mean that the semantic model will fall back to DirectQuery since direct lake does not work on views.
Create a duplicate table: This would probably work, but is not ideal due to data redundancy concerns.
Switch to import mode: This way, we could leverage Power Query to realize a filter at data read-in. However, this would rule out direct lake.

So far, none of the above methods seem to suffice the requirements I got. Do you have other ideas on how to achieve this or do you see errors in my thought process? I am tahnkful for any sort of input.

Kind regards,

Niels

ObungiNiels · ‎03-20-2025

Hi there,

quick update on this.

We eventually went with the option of creating duplicate tables. We followed the following steps to ensure data integrity:

For the warehouse tables we appended a CTAS statement to the stored procedure which handles the transformations to the gold layer. The CTAS select data from the complete gold table and applies the desired filtering. Appending this to the stored procedure ensure that we are selecting the most recent version of the data.

We are hoping that the additional storage costs will be justified by faster queries on the filtered fact tables.

View solution in original post

Anonymous · ‎03-18-2025

Hi @ObungiNiels
Thank you for reaching out microsoft fabric community forum.

May I ask if you have resolved this issue? If so, please mark the helpful reply and accept it as the solution. This will be helpful for other community members who have similar problems to solve it faster.

Thank you.

ObungiNiels · ‎03-20-2025

Hi there,

quick update on this.

We eventually went with the option of creating duplicate tables. We followed the following steps to ensure data integrity:

For the warehouse tables we appended a CTAS statement to the stored procedure which handles the transformations to the gold layer. The CTAS select data from the complete gold table and applies the desired filtering. Appending this to the stored procedure ensure that we are selecting the most recent version of the data.

We are hoping that the additional storage costs will be justified by faster queries on the filtered fact tables.

Anonymous · ‎03-20-2025

Hi @ObungiNiels
Good to hear that you found a workaround. Please accept it as a solution so that it might be helpful for other community members with similar problems to solve it faster.
Thank you.