The ultimate Fabric, Power BI, SQL, and AI community-led learning event. Save €200 with code FABCOMM.
Get registeredEnhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.
Hi,
My team is trying to work with branched workspaces, backed up by the git integration.
We have our development workspace, which is connected to the develop branch. Then, each developer branches off into a new workspace to develop privately in their own branch and workspace.
What we've found that works up to now:
What we are finding challenging & need help with is in our development workspace, we have a lakehouse and datawarehouse. We would like to create a View or a Stored Procedure that references a Lakehouse table. When a workspace is branched, the lakehouse is empty (as described in the documentation), so to get around this, we created some shortcuts to the development workspace Lakehouse table from a branched workspace, so that a view could be created in the branched workspace datawarehouse and then merged into the development workspace/ develop branch. First off, we got errors due to shortcuts having the same names as existing development Lakehouse tables. Therefore, we removed the shortcuts from the develop branch and decided not to commit shortcuts from branched workspaces, which could easily be done. However, we found that when new developers pulled code or branched off to a new workspace, with a view referencing a Lakehouse table, this caused errors in the new branched workspace as there was no shortcut or Lakehouse table to reference, and was just not workable.
So, I next resorted to removing the datawarehouse from the git versioning by adding the datawarehouse's repo path to the .gitignore file e.g. 'datawarehouse.Warehouse/'. This works when I have the repo open locally in VS code, however, in the Fabric UI when something is changed in the datawarehouse, we still get a Source Control Changes alert icon appear, with the option to commit. The develop branch is protected, as we'd like to use the PR process to merge code.
So it seems that the .gitignore file is not being recognised either, which is frustrating. Is this right? Or am I doing something wrong? If this option worked then we were just going to control the datawarehouse deployments with its own db project deployment.
I know in the forum and documentation, people recommend creating lakehouse tables in their branched workspaces to be referenced, however, this just does not seem feasible when you have 1000s of lakehouse tables to reference.
Any recommendations or guidance on this would be most helpful.
Thanks!
Hi @caseybks
There is no views in Fabric DataWarehouse. you can find Tables only that are generated from data lake ( apache spark ) and delta lake. You can connect with me in Linkedin ( https://www.linkedin.com/in/bhavesh-patel-472829106/ ) and I will go through each and individual steps of Delta Lake. Cheers
This is what we have done so far and it is successful outcome project.
Bronze Tables -- > Silver Tables -- > Gold Tables. All these is happening in In memory engine of Delta Lake.
Hi @BhaveshPatel, thanks for your reply, but you can create Views in Fabric datawarehouse.
A simple example is:
CREAT VIEW datawarehouse.dbo.view
AS
SELECT* FROM lakehouse.gold.table1