mirror of
https://github.com/open-metadata/OpenMetadata.git
synced 2025-10-06 06:16:21 +00:00
GitBook: [#31] Release version changed from 0.5.0 to 0.6.0, added connectors
This commit is contained in:
parent
a1fc3c6ee9
commit
ed4119af2c
@ -10,7 +10,11 @@
|
||||
* [Athena](openmetadata/connectors/athena.md)
|
||||
* [BigQuery](openmetadata/connectors/bigquery.md)
|
||||
* [BigQuery Usage](openmetadata/connectors/bigquery-usage.md)
|
||||
* [Data Model](connectors/data-model/README.md)
|
||||
* [DBT](connectors/data-model/dbt.md)
|
||||
* [MariaDB](connectors/data-model/mariadb.md)
|
||||
* [ElasticSearch](openmetadata/connectors/elastic-search.md)
|
||||
* [Glue Catalog](connectors/glue-catalog.md)
|
||||
* [Hive](openmetadata/connectors/hive.md)
|
||||
* [Kafka](openmetadata/connectors/kafka.md)
|
||||
* [Looker](openmetadata/connectors/looker.md)
|
||||
|
10
docs/connectors/data-model/README.md
Normal file
10
docs/connectors/data-model/README.md
Normal file
@ -0,0 +1,10 @@
|
||||
# Data Model
|
||||
|
||||
{% content-ref url="dbt.md" %}
|
||||
[dbt.md](dbt.md)
|
||||
{% endcontent-ref %}
|
||||
|
||||
{% content-ref url="mariadb.md" %}
|
||||
[mariadb.md](mariadb.md)
|
||||
{% endcontent-ref %}
|
||||
|
90
docs/connectors/data-model/dbt.md
Normal file
90
docs/connectors/data-model/dbt.md
Normal file
@ -0,0 +1,90 @@
|
||||
---
|
||||
description: This guide will help install DBT connector and run manually
|
||||
---
|
||||
|
||||
# DBT
|
||||
|
||||
{% hint style="info" %}
|
||||
**Prerequisites**
|
||||
|
||||
OpenMetadata is built using Java, DropWizard, Jetty, and MySQL.
|
||||
|
||||
1. Python 3.7 or above
|
||||
{% endhint %}
|
||||
|
||||
### Install from PyPI
|
||||
|
||||
{% tabs %}
|
||||
{% tab title="Install Using PyPI" %}
|
||||
```bash
|
||||
pip install 'openmetadata-ingestion[dbt]'
|
||||
```
|
||||
{% endtab %}
|
||||
{% endtabs %}
|
||||
|
||||
### Run Manually
|
||||
|
||||
```bash
|
||||
metadata ingest -c ./examples/workflows/dbt.json
|
||||
```
|
||||
|
||||
### Configuration
|
||||
|
||||
{% code title="dbt.json" %}
|
||||
```javascript
|
||||
{
|
||||
"source": {
|
||||
"type": "dbt",
|
||||
"config": {
|
||||
"service_name": "bigquery",
|
||||
"service_type": "BigQuery",
|
||||
"catalog_file": "./examples/sample_data/dbt/catalog.json",
|
||||
"manifest_file": "./examples/sample_data/dbt/manifest.json",
|
||||
"run_results_file": "./examples/sample_data/dbt/run_results.json",
|
||||
"database": "shopify"
|
||||
}
|
||||
}
|
||||
...
|
||||
```
|
||||
{% endcode %}
|
||||
|
||||
1. **service\_name** - Service Name for this MySQL cluster. If you added MySQL cluster through OpenMetadata UI, make sure the service name matches the same.
|
||||
2. **catalog\_file** - It contains includes, excludes options to choose which pattern of datasets you want to ingest into OpenMetadata
|
||||
3. **manifest\_file** - Enable data-profiling (Optional). It will provide you the newly ingested data.
|
||||
4. **run\_results\_file** - Specify offset.
|
||||
5. **database** - Specify limit.
|
||||
|
||||
## Publish to OpenMetadata
|
||||
|
||||
Below is the configuration to publish DBT data into the OpenMetadata service.
|
||||
|
||||
Add optionally `pii` processor and `metadata-rest` sink along with `metadata-server` config
|
||||
|
||||
{% code title="dbt.json" %}
|
||||
```javascript
|
||||
{
|
||||
"source": {
|
||||
"type": "dbt",
|
||||
"config": {
|
||||
"service_name": "bigquery",
|
||||
"service_type": "BigQuery",
|
||||
"catalog_file": "./examples/sample_data/dbt/catalog.json",
|
||||
"manifest_file": "./examples/sample_data/dbt/manifest.json",
|
||||
"run_results_file": "./examples/sample_data/dbt/run_results.json",
|
||||
"database": "shopify"
|
||||
}
|
||||
},
|
||||
"sink": {
|
||||
"type": "metadata-rest",
|
||||
"config": {}
|
||||
},
|
||||
"metadata_server": {
|
||||
"type": "metadata-server",
|
||||
"config": {
|
||||
"api_endpoint": "http://localhost:8585/api",
|
||||
"auth_provider_type": "no-auth"
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
{% endcode %}
|
94
docs/connectors/data-model/mariadb.md
Normal file
94
docs/connectors/data-model/mariadb.md
Normal file
@ -0,0 +1,94 @@
|
||||
---
|
||||
description: This guide will help install MariaDB connector and run manually
|
||||
---
|
||||
|
||||
# MariaDB
|
||||
|
||||
{% hint style="info" %}
|
||||
**Prerequisites**
|
||||
|
||||
OpenMetadata is built using Java, DropWizard, Jetty, and MySQL.
|
||||
|
||||
1. Python 3.7 or above
|
||||
{% endhint %}
|
||||
|
||||
### Install from PyPI
|
||||
|
||||
{% tabs %}
|
||||
{% tab title="Install Using PyPI" %}
|
||||
```bash
|
||||
pip install 'openmetadata-ingestion[mysql]'
|
||||
```
|
||||
{% endtab %}
|
||||
{% endtabs %}
|
||||
|
||||
### Run Manually
|
||||
|
||||
```bash
|
||||
metadata ingest -c ./examples/workflows/mariadb.json
|
||||
```
|
||||
|
||||
### Configuration
|
||||
|
||||
{% code title="mariadb.json" %}
|
||||
```javascript
|
||||
{
|
||||
"source": {
|
||||
"type": "mariadb",
|
||||
"config": {
|
||||
"username": "openmetadata_user",
|
||||
"password": "openmetadata_password",
|
||||
"database": "openmetadata_db",
|
||||
"service_name": "local_mysql",
|
||||
"filter_pattern": {
|
||||
"excludes": ["mysql.*", "information_schema.*", "performance_schema.*", "sys.*"]
|
||||
}
|
||||
}
|
||||
},
|
||||
...
|
||||
```
|
||||
{% endcode %}
|
||||
|
||||
1. **username** - pass the MariaDB username.
|
||||
2. **password** - password for the username
|
||||
3. **service\_name** - Service Name for this MariaDB cluster. If you added MariaDB cluster through OpenMetadata UI, make sure the service name matches the same.
|
||||
4. **filter\_pattern** - It contains includes, excludes options to choose which pattern of datasets you want to ingest into OpenMetadata
|
||||
5. **data\_profiler\_enabled** - Enable data-profiling (Optional). It will provide you the newly ingested data.
|
||||
6. **data\_profiler\_offset** - Specify offset.
|
||||
7. **data\_profiler\_limit** - Specify limit.
|
||||
|
||||
## Publish to OpenMetadata
|
||||
|
||||
Below is the configuration to publish MariaDB data into the OpenMetadata service.
|
||||
|
||||
Add optionally `pii` processor and `metadata-rest` sink along with `metadata-server` config
|
||||
|
||||
{% code title="mariadb.json" %}
|
||||
```javascript
|
||||
{
|
||||
"source": {
|
||||
"type": "mariadb",
|
||||
"config": {
|
||||
"username": "openmetadata_user",
|
||||
"password": "openmetadata_password",
|
||||
"database": "openmetadata_db",
|
||||
"service_name": "local_mysql",
|
||||
"filter_pattern": {
|
||||
"excludes": ["mysql.*", "information_schema.*", "performance_schema.*", "sys.*"]
|
||||
}
|
||||
}
|
||||
},
|
||||
"sink": {
|
||||
"type": "metadata-rest",
|
||||
"config": {}
|
||||
},
|
||||
"metadata_server": {
|
||||
"type": "metadata-server",
|
||||
"config": {
|
||||
"api_endpoint": "http://localhost:8585/api",
|
||||
"auth_provider_type": "no-auth"
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
{% endcode %}
|
93
docs/connectors/glue-catalog.md
Normal file
93
docs/connectors/glue-catalog.md
Normal file
@ -0,0 +1,93 @@
|
||||
---
|
||||
description: This guide will help install Glue connector and run manually
|
||||
---
|
||||
|
||||
# Glue Catalog
|
||||
|
||||
{% hint style="info" %}
|
||||
**Prerequisites**
|
||||
|
||||
OpenMetadata is built using Java, DropWizard, Jetty, and MySQL.
|
||||
|
||||
1. Python 3.7 or above
|
||||
{% endhint %}
|
||||
|
||||
### Install from PyPI
|
||||
|
||||
{% tabs %}
|
||||
{% tab title="Install Using PyPI" %}
|
||||
```bash
|
||||
pip install 'openmetadata-ingestion[glue]'
|
||||
```
|
||||
{% endtab %}
|
||||
{% endtabs %}
|
||||
|
||||
### Run Manually
|
||||
|
||||
```bash
|
||||
metadata ingest -c ./examples/workflows/glue.json
|
||||
```
|
||||
|
||||
### Configuration
|
||||
|
||||
{% code title="glue.json" %}
|
||||
```javascript
|
||||
{
|
||||
"source": {
|
||||
"type": "glue",
|
||||
"config": {
|
||||
"aws_access_key_id": "aws_access_key_id",
|
||||
"aws_secret_access_key": "aws_secret_access_key",
|
||||
"db_service_name": "local_glue_db",
|
||||
"pipeline_service_name": "local_glue_pipeline",
|
||||
"region_name": "region_name",
|
||||
"endpoint_url": "endpoint_url",
|
||||
"service_name": "local_glue"
|
||||
}
|
||||
},
|
||||
...
|
||||
```
|
||||
{% endcode %}
|
||||
|
||||
1. **aws\_access\_key\_id** - Access Key for AWS.
|
||||
2. **aws\_secret\_access\_key** - Secret Key for AWS.
|
||||
3. **db\_service\_name** - Service Name for this Glue Database cluster.
|
||||
4. **pipeline\_service\_name** - Service Name for this Glue Pipeline cluster.
|
||||
5. **region\_name** - AWS account region.
|
||||
6. **endpoint\_url** - Service Endpoints from [AWS](https://docs.aws.amazon.com/general/latest/gr/glue.html).
|
||||
|
||||
## Publish to OpenMetadata
|
||||
|
||||
Below is the configuration to publish Glue data into the OpenMetadata service.
|
||||
|
||||
Add optionally `pii` processor and `metadata-rest` sink along with `metadata-server` config
|
||||
|
||||
{% code title="glue.json" %}
|
||||
```javascript
|
||||
{
|
||||
"source": {
|
||||
"type": "glue",
|
||||
"config": {
|
||||
"aws_access_key_id": "aws_access_key_id",
|
||||
"aws_secret_access_key": "aws_secret_access_key",
|
||||
"db_service_name": "local_glue_db",
|
||||
"pipeline_service_name": "local_glue_pipeline",
|
||||
"region_name": "region_name",
|
||||
"endpoint_url": "endpoint_url",
|
||||
"service_name": "local_glue"
|
||||
}
|
||||
},
|
||||
"sink": {
|
||||
"type": "metadata-rest",
|
||||
"config": {}
|
||||
},
|
||||
"metadata_server": {
|
||||
"type": "metadata-server",
|
||||
"config": {
|
||||
"api_endpoint": "http://localhost:8585/api",
|
||||
"auth_provider_type": "no-auth"
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
{% endcode %}
|
@ -7,7 +7,11 @@ OpenMetadata supports connectors to some popular services. We will continue as a
|
||||
* [Athena](../../../openmetadata/connectors/athena.md)
|
||||
* [BigQuery](../../../openmetadata/connectors/bigquery.md)
|
||||
* [BigQuery Usage](../../../openmetadata/connectors/bigquery-usage.md)
|
||||
* [Data Model](../../../connectors/data-model/)
|
||||
* [DBT](../../../connectors/data-model/dbt.md)
|
||||
* [MariaDB](../../../connectors/data-model/mariadb.md)
|
||||
* [ElasticSearch](../../../openmetadata/connectors/elastic-search.md)
|
||||
* Glue Catalog
|
||||
* [MSSQL](../../../openmetadata/connectors/mssql.md)
|
||||
* [MySQL](../../../openmetadata/connectors/mysql.md)
|
||||
* [Hive](../../../openmetadata/connectors/hive.md)
|
||||
@ -38,3 +42,4 @@ OpenMetadata supports connectors to some popular services. We will continue as a
|
||||
|
||||
* Airflow
|
||||
* Prefect
|
||||
* Glue
|
||||
|
@ -36,10 +36,9 @@ Sample Data, Tables, Usage, Users, Topics, and Dashboards.
|
||||
|
||||
```bash
|
||||
#Make sure the OpenMetadata Server is up and running
|
||||
cd openmetadata-0.5.0/ingestion
|
||||
cd openmetadata-0.6.0/ingestion
|
||||
metadata ingest -c ./pipelines/sample_data.json
|
||||
metadata ingest -c ./pipelines/sample_usage.json
|
||||
metadata ingest -c ./pipelines/sample_users.json
|
||||
```
|
||||
|
||||
### Index Sample Data into ElasticSearch
|
||||
@ -58,6 +57,6 @@ Index sample data in ElasticSearch:
|
||||
|
||||
```bash
|
||||
#Make sure the OpenMetadata Server is up and running
|
||||
cd openmetadata-0.5.0/ingestion
|
||||
cd openmetadata-0.6.0/ingestion
|
||||
metadata ingest -c ./pipelines/metadata_to_es.json
|
||||
```
|
||||
|
@ -21,7 +21,7 @@ Please refer to the previous section [Run Openmetadata](run-openmetadata.md) for
|
||||
OpenMetadata release ships with `./bin/openmetadata` init.d style script.
|
||||
|
||||
```
|
||||
cd openmetdata-0.5.0
|
||||
cd openmetdata-0.6.0
|
||||
./bin/openmetdata start
|
||||
```
|
||||
|
||||
|
@ -51,7 +51,8 @@ Preferences -> Resources -> Advanced
|
||||
Create a new directory for OpenMetadata and navigate into that directory.
|
||||
|
||||
```
|
||||
mkdir openmetadata-docker; cd openmetadata-docker
|
||||
mkdir openmetadata-docker
|
||||
cd openmetadata-docker
|
||||
```
|
||||
|
||||
### 2. Create a Python virtual environment
|
||||
|
Loading…
x
Reference in New Issue
Block a user