Prajwal214 30a091b466
Docs: Updating datalake & dbt Cloud docs (#17983)
Co-authored-by: Prajwal Pandit <prajwalpandit@Prajwals-MacBook-Air.local>
2024-09-25 10:49:44 +05:30

2.8 KiB

title slug
ADLS Datalake /connectors/database/adls-datalake

{% connectorDetailsHeader name="ADLS Datalake" stage="PROD" platform="OpenMetadata" availableFeatures=["Metadata", "Data Profiler", "Data Quality"] unavailableFeatures=["Query Usage", "Lineage", "Column-level Lineage", "Owners", "dbt", "Tags", "Stored Procedures"] / %}

In this section, we provide guides and references to use the ADLS Datalake connector.

Configure and schedule Datalake metadata and profiler workflows from the OpenMetadata UI:

{% partial file="/v1.6/connectors/ingestion-modes-tiles.md" variables={yamlPath: "/connectors/database/adls-datalake/yaml"} /%}

Requirements

{% note %} The ADLS Datalake connector supports extracting metadata from file types JSON, CSV, TSV & Parquet. {% /note %}

ADLS Permissions

To extract metadata from Azure ADLS (Storage Account - StorageV2), you will need an App Registration with the following permissions on the Storage Account:

  • Storage Blob Data Contributor
  • Storage Queue Data Contributor

Metadata Ingestion

{% partial file="/v1.6/connectors/metadata-ingestion-ui.md" variables={ connector: "Datalake", selectServicePath: "/images/v1.6/connectors/datalake/select-service.png", addNewServicePath: "/images/v1.6/connectors/datalake/add-new-service.png", serviceConnectionPath: "/images/v1.6/connectors/datalake/service-connection.png", } /%}

{% stepsContainer %} {% extraContent parentTagName="stepsContainer" %}

Connection Details for Azure

  • Azure Credentials

    • Client ID : Client ID of the data storage account
    • Client Secret : Client Secret of the account
    • Tenant ID : Tenant ID under which the data storage account falls
    • Account Name : Account Name of the data Storage
  • Required Roles

    Please make sure the following roles associated with the data storage account.

    • Storage Blob Data Contributor
    • Storage Queue Data Contributor

The current approach for authentication is based on app registration, reach out to us on slack if you find the need for another auth system

{% partial file="/v1.6/connectors/database/advanced-configuration.md" /%}

{% /extraContent %}

{% partial file="/v1.6/connectors/test-connection.md" /%}

{% partial file="/v1.6/connectors/database/configure-ingestion.md" /%}

{% partial file="/v1.6/connectors/ingestion-schedule-and-deploy.md" /%}

{% /stepsContainer %}

{% partial file="/v1.6/connectors/troubleshooting.md" /%}

{% partial file="/v1.6/connectors/database/related.md" /%}