2024-06-18 15:53:06 +02:00
---
title: Databricks
slug: /connectors/database/databricks
---
{% connectorDetailsHeader
name="Databricks"
stage="PROD"
platform="OpenMetadata"
2025-04-03 11:37:57 +05:30
availableFeatures=["Metadata", "Query Usage", "Lineage", "Column-level Lineage", "Data Profiler", "Data Quality", "dbt", "Tags", "Sample Data", "Reverse Metadata Ingestion"]
2024-06-18 15:53:06 +02:00
unavailableFeatures=["Owners", "Stored Procedures"]
/ %}
{% note %}
As per the [documentation ](https://docs.databricks.com/en/data-governance/unity-catalog/tags.html#manage-tags-with-sql-commands ) here, note that we only support metadata `tag` extraction for databricks version 13.3 version and higher.
{% /note %}
In this section, we provide guides and references to use the Databricks connector.
Configure and schedule Databricks metadata and profiler workflows from the OpenMetadata UI:
- [Unity Catalog ](#unity-catalog )
- [Metadata Ingestion ](#metadata-ingestion )
- [Query Usage ](/connectors/ingestion/workflows/usage )
2024-09-02 10:22:51 +05:30
- [Data Profiler ](/how-to-guides/data-quality-observability/profiler/workflow )
- [Data Quality ](/how-to-guides/data-quality-observability/quality )
2025-01-03 16:07:38 +05:30
- [Lineage ](/how-to-guides/data-lineage/workflow )
2024-06-18 15:53:06 +02:00
- [dbt Integration ](/connectors/ingestion/workflows/dbt )
2025-04-09 12:28:13 +02:00
- [Reverse Metadata ](#reverse-metadata )
2025-04-04 14:33:30 +05:30
- [Troubleshooting ](/connectors/database/databricks/troubleshooting )
2024-12-12 11:34:09 +05:30
{% partial file="/v1.7/connectors/ingestion-modes-tiles.md" variables={yamlPath: "/connectors/database/databricks/yaml"} /%}
2024-06-18 15:53:06 +02:00
2024-12-12 11:34:09 +05:30
{% partial file="/v1.7/connectors/external-ingestion-deployment.md" /%}
2024-06-18 15:53:06 +02:00
## Unity Catalog
If you are using unity catalog in Databricks, then checkout the [Unity Catalog ](/connectors/database/unity-catalog ) connector.
## Metadata Ingestion
{% partial
2024-12-12 11:34:09 +05:30
file="/v1.7/connectors/metadata-ingestion-ui.md"
2024-06-18 15:53:06 +02:00
variables={
connector: "Databricks",
2024-12-12 11:34:09 +05:30
selectServicePath: "/images/v1.7/connectors/databricks/select-service.png",
addNewServicePath: "/images/v1.7/connectors/databricks/add-new-service.png",
serviceConnectionPath: "/images/v1.7/connectors/databricks/service-connection.png",
2024-06-18 15:53:06 +02:00
}
/%}
{% stepsContainer %}
{% extraContent parentTagName="stepsContainer" %}
#### Connection Details
- **Host and Port**: Enter the fully qualified hostname and port number for your Databricks deployment in the Host and Port field.
- **Token**: Generated Token to connect to Databricks.
- **HTTP Path**: Databricks compute resources URL.
- **connectionTimeout**: The maximum amount of time (in seconds) to wait for a successful connection to the data source. If the connection attempt takes longer than this timeout period, an error will be returned.
- **Catalog**: Catalog of the data source(Example: hive_metastore). This is optional parameter, if you would like to restrict the metadata reading to a single catalog. When left blank, OpenMetadata Ingestion attempts to scan all the catalog.
- **DatabaseSchema**: databaseSchema of the data source. This is optional parameter, if you would like to restrict the metadata reading to a single databaseSchema. When left blank, OpenMetadata Ingestion attempts to scan all the databaseSchema.
2024-12-12 11:34:09 +05:30
{% partial file="/v1.7/connectors/database/advanced-configuration.md" /%}
2024-06-18 15:53:06 +02:00
{% /extraContent %}
2024-12-12 11:34:09 +05:30
{% partial file="/v1.7/connectors/test-connection.md" /%}
2024-06-18 15:53:06 +02:00
2024-12-12 11:34:09 +05:30
{% partial file="/v1.7/connectors/database/configure-ingestion.md" /%}
2024-06-18 15:53:06 +02:00
2024-12-12 11:34:09 +05:30
{% partial file="/v1.7/connectors/ingestion-schedule-and-deploy.md" /%}
2024-06-18 15:53:06 +02:00
{% /stepsContainer %}
2024-12-12 11:34:09 +05:30
{% partial file="/v1.7/connectors/database/related.md" /%}
2025-04-03 11:37:57 +05:30
2025-04-09 12:28:13 +02:00
## Reverse Metadata
2025-04-03 11:37:57 +05:30
{% note %}
This feature is specific to Collate and requires the Collate Enterprise License.
{% /note %}
2025-04-09 12:28:13 +02:00
### Description Management
Databricks supports description updates at all levels:
- Database level
- Schema level
- Table level
- Column level
### Owner Management
Databricks supports owner management at the following levels:
- Database level
- Schema level
- Table level
### Tag Management
Databricks supports tag management at all levels:
- Database level
- Schema level
- Table level
- Column level
### Custom SQL Template
Databricks supports custom SQL templates for metadata changes. The template is interpreted using python f-strings.
2025-04-09 14:03:12 +02:00
Here are examples of custom SQL queries for metadata changes:
```sql
-- Set table tags
ALTER TABLE {database}.{schema}.{table} SET TAGS {tags};
```
The list of variables for custom SQL can be found [here ](/connectors/ingestion/workflows/reverse-metadata#custom-sql-template ).
2025-04-03 11:37:57 +05:30
For more details about reverse metadata ingestion, visit our [Reverse Metadata Documentation ](/connectors/ingestion/workflows/reverse-metadata ).