mirror of
https://github.com/open-metadata/OpenMetadata.git
synced 2026-01-06 12:36:56 +00:00
2.2 KiB
2.2 KiB
| description |
|---|
| This guide will help install Glue connector and run manually |
Glue Catalog
{% hint style="info" %} Prerequisites
OpenMetadata is built using Java, DropWizard, Jetty, and MySQL.
- Python 3.7 or above {% endhint %}
Install from PyPI
{% tabs %} {% tab title="Install Using PyPI" %}
pip install 'openmetadata-ingestion[glue]'
{% endtab %} {% endtabs %}
Run Manually
metadata ingest -c ./examples/workflows/glue.json
Configuration
{% code title="glue.json" %}
{
"source": {
"type": "glue",
"config": {
"aws_access_key_id": "aws_access_key_id",
"aws_secret_access_key": "aws_secret_access_key",
"db_service_name": "local_glue_db",
"pipeline_service_name": "local_glue_pipeline",
"region_name": "region_name",
"endpoint_url": "endpoint_url",
"service_name": "local_glue"
}
},
...
{% endcode %}
- aws_access_key_id - Access Key for AWS.
- aws_secret_access_key - Secret Key for AWS.
- db_service_name - Service Name for this Glue Database cluster.
- pipeline_service_name - Service Name for this Glue Pipeline cluster.
- region_name - AWS account region.
- endpoint_url - Service Endpoints from AWS.
Publish to OpenMetadata
Below is the configuration to publish Glue data into the OpenMetadata service.
Add optionally pii processor and metadata-rest sink along with metadata-server config
{% code title="glue.json" %}
{
"source": {
"type": "glue",
"config": {
"aws_access_key_id": "aws_access_key_id",
"aws_secret_access_key": "aws_secret_access_key",
"db_service_name": "local_glue_db",
"pipeline_service_name": "local_glue_pipeline",
"region_name": "region_name",
"endpoint_url": "endpoint_url",
"service_name": "local_glue"
}
},
"sink": {
"type": "metadata-rest",
"config": {}
},
"metadata_server": {
"type": "metadata-server",
"config": {
"api_endpoint": "http://localhost:8585/api",
"auth_provider_type": "no-auth"
}
}
}
{% endcode %}