This is best Fabric, Power BI, SQL and AI community event. How do we know? The last event sold out! Save €200 with code FABCMTY200.
Register nowA new Data Days event is coming soon! This time we’re going bigger than ever. Fabric, Power BI, SQL, AI and more. Don't miss out.
OneLake is a single, unified, logical data lake for your whole organization. OneLake comes automatically with every Microsoft Fabric tenant and is designed to be the single place for all your analytics data. You can access your data in OneLake through any API, SDK, or tool compatible with Azure Blob Storage or Azure Data Lake Storage (ADLS) just by using a OneLake URI instead. OneLake supports the same APIs as ADLS and Azure Blob Storage.
BlobFuse is a virtual file system driver that enables you to mount Azure Storage as a file system to Linux-based virtual machines. It uses the libfuse open-source library (fuse3) to communicate with the Linux FUSE kernel module and implements the filesystem operations using the Azure Storage REST APIs. Because OneLake supports these APIs, BlobFuse works with OneLake!
OneLake unifies data via shortcuts and applies OneLake security end‑to‑end, so you can bring external data into a single, logical namespace and control access consistently. Shortcuts currently reference internal OneLake locations as well as ADLS Gen2, Amazon S3, S3‑compatible stores, Google Cloud Storage (GCS), Microsoft Dataverse, and Azure Blob Storage—with no data movement. When you mount OneLake with BlobFuse, those shortcut folders surface just like regular folders, with the correct permissions, in the OneLake path.
With BlobFuse, you can mount OneLake directly onto a virtual machine as a filesystem.
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
Ubuntu 24.04.2
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
AI-generated content may be incorrect." class="wp-image-27203" />
Unless you plan to run compute-heavy operations on the VM, you do not require a large VM.
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
AI-generated content may be incorrect." class="wp-image-27204" />
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
# Logger configuration
logging:
type: base
level: log_debug
file-path: /home/azureuser/ols-logs.log
# Pipeline configuration
components:
- libfuse
- attr_cache
- azstorage
# Azure storage configuration
libfuse:
direct-io: true
negative-entry-expiration-sec: 0
allow-other: true
uid: 1000
gid: 1000
streaming:
enabled: true
block_size_mb: 4
parallelism: 4
attr_cache:
entry_timeout: 240
negative_timeout: 120
no-symlinks: true
timeout-sec: 0
cleanup-on-start: true
azstorage:
type: adls
account-name: onelake
container: <replace-with-fabric-workspace-id>
subdirectory: <replace-with-fabric-lakehouse-id>/Files
max-concurrency: 20
endpoint: onelake.dfs.fabric.microsoft.com
mode: spn
tenantid: <redacted>
clientid: <redacted>
clientsecret: <redacted>
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
AI-generated content may be incorrect." class="wp-image-27208" />
blobfuse2 mount ./ols-files/ --config-file=ols-ms-config.yaml
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
AI-generated content may be incorrect." class="wp-image-27211" />
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
AI-generated content may be incorrect." class="wp-image-27212" />
Note – performance of dd is dependent on resources available on the VM.
Note – network bandwidth on Azure compute is generally a function of the size (SKU) of the Azure VM. While OneLake offers large network bandwidth and throughput, writes to OneLake or reads from OneLake via an Azure VM will be constrained by the network bandwidth a VM can support. Please refer to Azure compute size
Note – Latency is dependent on the region of the VM and the region where the Fabric capacity is provisioned. In this case, the VM and capacity are NOT in the same region.
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
Mount_Microsoft_OneLake_on_Linux_VMs_with_BlobFuse
AI-generated content may be incorrect." class="wp-image-27214" />
While BlobFuse makes it easy to mount OneLake as a local filesystem, it’s important to note a few limitations. BlobFuse is not fully POSIX-compliant—certain operations, such as atomic renames or concurrent writes, may behave differently than on a traditional disk. For most analytics and data engineering scenarios, these differences are minor, but it’s worth designing your workflows with them in mind.
For performance, choose the access pattern that matches your workload. Local caching can be worth enabling if you expect random access and repeated reads of the same data; otherwise, consider streaming to avoid unnecessary disk overhead for large, sequential transfers. The right choice depends on your mix of rereads vs. one‑pass flows and the VM’s local storage characteristics.
OneLake’s API compatibility means you can use familiar tools and patterns—like BlobFuse—without major changes to your applications or infrastructure. Whether you’re moving data into Fabric for analytics, sharing files across VMs, or integrating with existing pipelines, the process feels natural and straightforward. This example highlights how OneLake’s design lets you leverage your existing skills and workflows, making the transition to Fabric seamless for both developers and data engineers.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.