Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get certified in Microsoft Fabric—for free! For a limited time, get a free DP-600 exam voucher to use by the end of 2024. Register now

Reply
HamidBee
Power Participant
Power Participant

Why do I need to create an Azure ADLS Gen2 storage account when creating an Azure Synapse Analytics?

When creating an Azure Synapse resource one is forced to either create or use an existing ADLS Gen 2 storage account. I'd kindly like to ask:

1) Why do we have to create/use an ADLS Gen 2 storage account? Why can't we use an ordinary storage account?

2) Also, let's assume one did create an ADLS Gen 2 storage account and created the Synapse resource. What would happen if the storage account was deleted?

3) Lastly, why is a serverless SQL pool created by default?

1 ACCEPTED SOLUTION
HamidBee
Power Participant
Power Participant

I managed to find the answer:

 

When creating an Azure Synapse Analytics resource, you need to create or use an existing ADLS Gen 2 storage account because it is required to store the metadata for the workspace. Additionally, the storage account is used to store the data for the dedicated SQL pools and serverless SQL pools.

 

1. An ordinary storage account cannot be used because it does not support hierarchical namespace, which is required for ADLS Gen 2. ADLS Gen 2 provides a hierarchical namespace that allows you to organize your data into a hierarchy of directories and files, which is important for big data workloads.

 

2. If you delete the ADLS Gen 2 storage account associated with your Synapse workspace, you will lose all the data stored in the account, including the metadata for the workspace. This will make it impossible to access the workspace and its resources. Therefore, it is important to be careful when deleting storage accounts associated with Synapse workspaces.

 

3, A serverless SQL pool is created by default because it provides a cost-effective way to query data stored in ADLS Gen 2 without having to provision and manage dedicated resources. It allows you to pay only for the queries you run, and it automatically scales up or down based on the workload. However, you can choose to delete the serverless SQL pool if you do not need it.

 

In order to use T-SQL on a storage account, hierarchical namespace needs to be enabled so that the blobs can be organized in a way that allows for easy querying of the data using T-SQL. This is not possible with an ordinary storage account (non-Gen2) due to the blob storage structure.

 

The metadata for the workspace is stored in the ADLS Gen2 storage account because it provides a hierarchical namespace that allows for efficient organization and management of the metadata. An ordinary storage account does not provide this feature, which is why it cannot be used to store the metadata for a Synapse workspace.

 

View solution in original post

2 REPLIES 2
HamidBee
Power Participant
Power Participant

I managed to find the answer:

 

When creating an Azure Synapse Analytics resource, you need to create or use an existing ADLS Gen 2 storage account because it is required to store the metadata for the workspace. Additionally, the storage account is used to store the data for the dedicated SQL pools and serverless SQL pools.

 

1. An ordinary storage account cannot be used because it does not support hierarchical namespace, which is required for ADLS Gen 2. ADLS Gen 2 provides a hierarchical namespace that allows you to organize your data into a hierarchy of directories and files, which is important for big data workloads.

 

2. If you delete the ADLS Gen 2 storage account associated with your Synapse workspace, you will lose all the data stored in the account, including the metadata for the workspace. This will make it impossible to access the workspace and its resources. Therefore, it is important to be careful when deleting storage accounts associated with Synapse workspaces.

 

3, A serverless SQL pool is created by default because it provides a cost-effective way to query data stored in ADLS Gen 2 without having to provision and manage dedicated resources. It allows you to pay only for the queries you run, and it automatically scales up or down based on the workload. However, you can choose to delete the serverless SQL pool if you do not need it.

 

In order to use T-SQL on a storage account, hierarchical namespace needs to be enabled so that the blobs can be organized in a way that allows for easy querying of the data using T-SQL. This is not possible with an ordinary storage account (non-Gen2) due to the blob storage structure.

 

The metadata for the workspace is stored in the ADLS Gen2 storage account because it provides a hierarchical namespace that allows for efficient organization and management of the metadata. An ordinary storage account does not provide this feature, which is why it cannot be used to store the metadata for a Synapse workspace.

 

@HamidBee Thanks for sharing the solution with the community and accepting the solution.

 

Regards

Geetha

Helpful resources

Announcements
November Carousel

Fabric Community Update - November 2024

Find out what's new and trending in the Fabric Community.

Live Sessions with Fabric DB

Be one of the first to start using Fabric Databases

Starting December 3, join live sessions with database experts and the Fabric product team to learn just how easy it is to get started.

November Update

Fabric Monthly Update - November 2024

Check out the November 2024 Fabric update to learn about new features.

Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early Bird pricing ends December 9th.