mirror of
				https://github.com/open-metadata/OpenMetadata.git
				synced 2025-10-31 02:29:03 +00:00 
			
		
		
		
	
		
			
	
	
		
			118 lines
		
	
	
		
			5.8 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
		
		
			
		
	
	
			118 lines
		
	
	
		
			5.8 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
|   | --- | ||
|  | title: Hive | ||
|  | slug: /connectors/database/hive | ||
|  | --- | ||
|  | 
 | ||
|  | # Hive
 | ||
|  | 
 | ||
|  | {% multiTablesWrapper %} | ||
|  | 
 | ||
|  | | Feature            | Status                       | | ||
|  | | :----------------- | :--------------------------- | | ||
|  | | Stage              | PROD                         | | ||
|  | | Metadata           | {% icon iconName="check" /%} | | ||
|  | | Query Usage        | {% icon iconName="cross" /%} | | ||
|  | | Data Profiler      | {% icon iconName="check" /%} | | ||
|  | | Data Quality       | {% icon iconName="check" /%} | | ||
|  | | Lineage            | Manual          | | ||
|  | | DBT                | {% icon iconName="cross" /%} | | ||
|  | | Supported Versions | Hive >= 2.0                         | | ||
|  | 
 | ||
|  | | Feature      | Status                       | | ||
|  | | :----------- | :--------------------------- | | ||
|  | | Lineage      | Manual          | | ||
|  | | Table-level  | {% icon iconName="check" /%} | | ||
|  | | Column-level | {% icon iconName="check" /%} | | ||
|  | 
 | ||
|  | {% /multiTablesWrapper %} | ||
|  | 
 | ||
|  | 
 | ||
|  | In this section, we provide guides and references to use the Hive connector. | ||
|  | 
 | ||
|  | Configure and schedule Hive metadata and profiler workflows from the OpenMetadata UI: | ||
|  | - [Requirements](#requirements) | ||
|  | - [Metadata Ingestion](#metadata-ingestion) | ||
|  | - [Data Profiler](/connectors/ingestion/workflows/profiler) | ||
|  | - [Data Quality](/connectors/ingestion/workflows/data-quality) | ||
|  | - [dbt Integration](/connectors/ingestion/workflows/dbt) | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/ingestion-modes-tiles.md" variables={yamlPath: "/connectors/database/hive/yaml"} /%} | ||
|  | 
 | ||
|  | ## Requirements
 | ||
|  | 
 | ||
|  | {%inlineCallout icon="description" bold="OpenMetadata 0.12 or later" href="/deployment"%} | ||
|  | To deploy OpenMetadata, check the Deployment guides. | ||
|  | {%/inlineCallout%} | ||
|  | 
 | ||
|  | ### Metadata
 | ||
|  | 
 | ||
|  | To extract metadata, the user used in the connection needs to be able to perform `SELECT`, `SHOW`, and `DESCRIBE` operations in the database/schema where the metadata needs to be extracted from. | ||
|  | 
 | ||
|  | ### Profiler & Data Quality
 | ||
|  | Executing the profiler workflow or data quality tests, will require the user to have `SELECT` permission on the tables/schemas where the profiler/tests will be executed. More information on the profiler workflow setup can be found [here](/connectors/ingestion/workflows/profiler) and data quality tests [here](/connectors/ingestion/workflows/data-quality). | ||
|  | 
 | ||
|  | ## Metadata Ingestion
 | ||
|  | 
 | ||
|  | {% partial  | ||
|  |   file="/v1.2.0/connectors/metadata-ingestion-ui.md"  | ||
|  |   variables={ | ||
|  |     connector: "Hive",  | ||
|  |     selectServicePath: "/images/v1.2.0/connectors/hive/select-service.png", | ||
|  |     addNewServicePath: "/images/v1.2.0/connectors/hive/add-new-service.png", | ||
|  |     serviceConnectionPath: "/images/v1.2.0/connectors/hive/service-connection.png", | ||
|  | }  | ||
|  | /%} | ||
|  | 
 | ||
|  | {% stepsContainer %} | ||
|  | {% extraContent parentTagName="stepsContainer" %} | ||
|  | 
 | ||
|  | #### Connection Details
 | ||
|  | 
 | ||
|  | - **Username**: Specify the User to connect to Hive. It should have enough privileges to read all the metadata. | ||
|  | - **Password**: Password to connect to Hive. | ||
|  | - **Host and Port**: This parameter specifies the host and port of the Hive server instance. This should be specified as a string in the format `hostname:port`. For example, you might set the hostPort parameter to `myhivehost:10000`. | ||
|  | - **Auth Options (Optional)**: The auth parameter specifies the authentication method to use when connecting to the Hive server. Possible values are `LDAP`, `NONE`, `CUSTOM`, or `KERBEROS`. If you are using Kerberos authentication, you should set auth to `KERBEROS`. If you are using custom authentication, you should set auth to `CUSTOM` and provide additional options in the `authOptions` parameter. | ||
|  | - **Kerberos Service Name**: This parameter specifies the Kerberos service name to use for authentication. This should only be specified if using Kerberos authentication. The default value is `hive`. | ||
|  | - **Database Schema**: Schema of the data source. This is optional parameter, if you would like to restrict the metadata reading to a single schema. When left blank, OpenMetadata Ingestion attempts to scan all the schemas. | ||
|  | - **Database Name**: Optional name to give to the database in OpenMetadata. If left blank, we will use default as the database name. | ||
|  | 
 | ||
|  | 
 | ||
|  | #### For MySQL Metastore Connection
 | ||
|  | 
 | ||
|  | You can also ingest the metadata using Mysql metastore. This step is optional if metastore details are not provided then we will query the hive server directly. | ||
|  | 
 | ||
|  | - **Username**: Specify the User to connect to MySQL Metastore. It should have enough privileges to read all the metadata. | ||
|  | - **Password**: Password to connect to MySQL. | ||
|  | - **Host and Port**: Enter the fully qualified hostname and port number for your MySQL Metastore deployment in the Host and Port field in the format `hostname:port`. | ||
|  | - **databaseSchema**: Enter the database schema which is associated with the metastore. | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/database/advanced-configuration.md" /%} | ||
|  | 
 | ||
|  | #### For Postgres Metastore Connection
 | ||
|  | 
 | ||
|  | You can also ingest the metadata using Postgres metastore. This step is optional if metastore details are not provided then we will query the hive server directly. | ||
|  | 
 | ||
|  | - **Username**: Specify the User to connect to Postgres Metastore. It should have enough privileges to read all the metadata. | ||
|  | - **Password**: Password to connect to Postgres. | ||
|  | - **Host and Port**: Enter the fully qualified hostname and port number for your Postgres deployment in the Host and Port field in the format `hostname:port`. | ||
|  | - **Database**: Initial Postgres database to connect to. Specify the name of database associated with metastore instance. | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/database/advanced-configuration.md" /%} | ||
|  | 
 | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/database/advanced-configuration.md" /%} | ||
|  | 
 | ||
|  | {% /extraContent %} | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/test-connection.md" /%} | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/database/configure-ingestion.md" /%} | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/ingestion-schedule-and-deploy.md" /%} | ||
|  | 
 | ||
|  | {% /stepsContainer %} | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/troubleshooting.md" /%} | ||
|  | 
 | ||
|  | {% partial file="/v1.2.0/connectors/database/related.md" /%} |