Advance your Data & AI career with 50 days of live learning, dataviz contests, hands-on challenges, study groups & certifications and more!
Get registeredJoin us at FabCon Atlanta from March 16 - 20, 2026, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM. Register now.
Hello everyone,
My team is in the process of implementing a Medallion architecture in Microsoft Fabric, ingesting data from multiple ERP systems. One of these systems will serve as the “master,” and all dimensions must align with a Single Version of Truth (SVOT) based on its schema.
So far, we have ingested and transformed data from one source system, which includes both Finance and Sales data. Although the core SVOT mapping logic is nearly identical across functions, there are a handful of dimensions that require function-specific handling.
Today, we maintain separate PySpark notebooks for each dataset (Finance, Sales, etc.) to apply the SVOT mapping. We’d like to consolidate this into a single, parameterized PySpark notebook that can handle all functional datasets while still accommodating their slight dimension differences.
Does anyone have recommendations on how we could design and orchestrate one reusable SVOT mapping notebook in Microsoft Fabric?
Solved! Go to Solution.
Hello @Mirdula
not official MS recommendations, but these are few which I try to follow in my deployments
Centralize SVOT Logic: Keep your mapping logic modular and in shared modules to avoid duplication.
• Parameterize Notebooks: Use parameters for dataset type, source/target paths, and configuration settings.
• Handle Function-Specific Needs: Apply conditional logic or configuration-driven mappings for each functional dataset (e.g., Finance, Sales).
• Orchestrate with Pipelines: Use Microsoft Fabric pipelines or notebook widgets to pass parameters and manage execution.
• Follow Medallion Architecture: Structure your data into Bronze (raw), Silver (SVOT-mapped), and Gold (curated) layers.
• Prioritize Maintainability: Use the DRY (Don’t Repeat Yourself) principle so updates are easy and consistent across datasets.
Hello @Mirdula
not official MS recommendations, but these are few which I try to follow in my deployments
Centralize SVOT Logic: Keep your mapping logic modular and in shared modules to avoid duplication.
• Parameterize Notebooks: Use parameters for dataset type, source/target paths, and configuration settings.
• Handle Function-Specific Needs: Apply conditional logic or configuration-driven mappings for each functional dataset (e.g., Finance, Sales).
• Orchestrate with Pipelines: Use Microsoft Fabric pipelines or notebook widgets to pass parameters and manage execution.
• Follow Medallion Architecture: Structure your data into Bronze (raw), Silver (SVOT-mapped), and Gold (curated) layers.
• Prioritize Maintainability: Use the DRY (Don’t Repeat Yourself) principle so updates are easy and consistent across datasets.
Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!
Check out the September 2025 Fabric update to learn about new features.
User | Count |
---|---|
14 | |
7 | |
3 | |
2 | |
2 |