This time we’re going bigger than ever. Fabric, Power BI, SQL, AI and more. We're covering it all. You won't want to miss it.
Learn moreDid you hear? There's a new SQL AI Developer certification (DP-800). Start preparing now and be one of the first to get certified. Register now
High Concurrency mode for notebooks in pipelines is now generally available (GA)! This powerful feature enhances enterprise data ingestion and transformation by optimizing session sharing within one of the most widely used orchestration mechanisms. With this release, we’re also introducing Comprehensive Monitoring for High-Concurrency Spark Applications, bringing deeper visibility and control to your workloads.
High Concurrency mode dramatically improves the session start experience, reducing the time to ~5 seconds for shared notebooks. This is approximately 30 times faster than traditional methods, leading to significant performance improvements in pipeline execution.
We’ve introduced session tags, allowing users to efficiently assign notebooks to specific High Concurrency sessions for better organization and resource utilization.
By sharing a single session across multiple notebooks, High Concurrency mode reduces both compute costs and execution times. You only pay for a single session, minimizing queuing issues during peak usage hours and ensuring a smoother workflow.
For example, a pipeline with five notebook steps, each taking 5 minutes to execute, would traditionally require separate Spark session startups (3 minutes each), leading to a total runtime of ~40 minutes. With High Concurrency mode, the time is reduced to ~28 minutes, representing a 30% performance improvement.
With the GA release, we are introducing Comprehensive Monitoring for High-Concurrency Spark Applications, providing enhanced visibility and logging capabilities.
Understand the performance of individual notebook steps by mapping Spark jobs and execution stages directly to their originating notebooks. This granular insight enables better debugging and performance tuning.
High_Concurrency_mode_for_notebooks_in_pipelines_Generally_Available
Each notebook now maintains its own distinct logs, making it easier to track execution progress, troubleshoot errors, and analyze without interference from other notebooks sharing the same session.
High_Concurrency_mode_for_notebooks_in_pipelines_Generally_Available
Now, snapshots of all notebooks involved in a High Concurrency session are captured, allowing users to debug active and historical executions to explore statement level results and understand past runs with greater clarity.
High_Concurrency_mode_for_notebooks_in_pipelines_Generally_Available
To enable High Concurrency mode for your Fabric Spark workspace, follow these steps:
High_Concurrency_mode_for_notebooks_in_pipelines_Generally_Available
Once enabled, all Spark sessions triggered by notebook steps within a pipeline will be automatically packed into a shared High Concurrency session, boosting efficiency and performance.
High_Concurrency_mode_for_notebooks_in_pipelines_Generally_Available
By adopting High Concurrency mode, you can achieve:
To learn more, please refer to our documentation, High Concurrency Mode for notebooks in pipelines.
For more information on High Concurrency mode, check out the Overview of High Concurrency Mode in Microsoft Fabric documentation and join the conversation on the Fabric Community.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.