Azure portal Migrate Azure Data Lake Storage from Gen1 to Gen2
By: Date: 30/05/2022 Categories: azure Tags:

Gen2 migrate using the Azure portal, follow the below steps:

Step 1: Assess readiness

Step 2: Create a storage account with Gen2 capabilities

Step 3: Migrate data using the Azure portal

Step 4: Migrate workloads and applications

Step 1:

Verify RBAC role assignments

For Gen2, ensure that the Storage Blob Data Owner role has been assigned to your Azure Active Directory (Azure AD) user identity in the scope of the storage account, parent resource group, or subscription.

For Gen1, ensure that the Owner role has been assigned to your Azure AD identity in the scope of the Gen1 account, parent resource group, or subscription.

Migrate Azure Data Lake Analytics workloads

Azure Data Lake Storage Gen2 doesn’t support Azure Data Lake Analytics. Azure Data Lake Analytics will be retired on February 29, 2024. If you attempt to use the Azure portal to migrate an Azure Data Lake Storage Gen1 account that is used for Azure Data Lake Analytics, it’s possible that you’ll break your Azure Data Lake Analytics workloads. You must first migrate your Azure Data Lake Analytics workloads to Azure Synapse Analytics or another supported compute platform before attempting to migrate your Gen1 account.

Step 2:

Create a storage account with Gen2 capabilities

Azure Data Lake Storage Gen2 is not a dedicated storage account or service type. It’s a set of capabilities that you can obtain by enabling the Hierarchical namespace feature of an Azure storage account. To create an account that has Gen2 capabilities, see Create a storage account to use with Azure Data Lake Storage Gen2.

Step 3:

Choose a migration option

Option 1: Copy data only (recommended) – Copy data from Gen1 to Gen2

  1. Sign in to the Azure portal to get started.
  2. Locate your Data Lake Storage Gen1 account and display the account overview.
  3. Select the Migrate data button.
  4. Select Copy data to a new Gen2 account.
Copy data option
  1. Give Microsoft consent to perform the data migration by selecting the checkbox. Then, click the Apply button.

Option 2: Perform a complete migration. 

  1. Sign in to the Azure portal to get started.
  2. Locate your Data Lake Storage Gen1 account and display the account overview.
  3. Select the Migrate data button.
  4. Select Complete migration to a new Gen2 account.
Complete migration option
  1. Give Microsoft consent to perform the data migration by selecting the checkbox. Then, click the Apply button.

Step 4:

Choose a migrate workloads and applications

  1. Configure services in your workloads to point to your Gen2 endpoint. For links to articles that help you configure Azure Databricks, HDInsight, and other Azure services to use Gen2, see Azure services that support Azure Data Lake Storage Gen2.
  2. Update scripts to use Data Lake Storage Gen2 PowerShell cmdlets, and Azure CLI commands.
  3. Search for URI references that contain the string adl:// in code files, or in Databricks notebooks, Apache Hive HQL files or any other file used as part of your workloads. Replace these references with the Gen2 formatted URI of your new storage account. For example: the Gen1 URI: adl://mydatalakestore.azuredatalakestore.net/mydirectory/myfile might become abfss://[email protected]/mydirectory/myfile.