98 lines
2.4 KiB
Markdown
Raw Permalink Normal View History

2021-08-01 14:27:44 -07:00
---
description: This guide will help install Postgres connector and run manually
---
# Postgres
{% hint style="info" %}
**Prerequisites**
OpenMetadata is built using Java, DropWizard, Jetty, and MySQL.
1. Python 3.7 or above
{% endhint %}
## Install from PyPI
2021-08-01 14:27:44 -07:00
2021-08-12 13:53:29 -07:00
{% tabs %}
{% tab title="Install Using PyPI" %}
```bash
pip install 'openmetadata-ingestion[postgres]'
```
{% endtab %}
{% endtabs %}
## Run Manually
2021-08-01 14:27:44 -07:00
```bash
2021-08-16 07:57:20 +00:00
metadata ingest -c ./examples/workflows/postgres.json
2021-08-01 14:27:44 -07:00
```
## Configuration
2021-08-01 14:27:44 -07:00
{% code title="postgres.json" %}
```javascript
{
"source": {
"type": "postgres",
"config": {
"username": "openmetadata_user",
"password": "openmetadata_password",
2021-08-01 14:27:44 -07:00
"host_port": "localhost:5432",
"database": "pagila",
"service_name": "local_postgres",
"data_profiler_enabled": "true",
"data_profiler_offset": "0",
"data_profiler_limit": "50000"
2021-08-01 14:27:44 -07:00
}
},
...
```
{% endcode %}
1. **username** - pass the Postgres username.
2021-08-16 16:52:35 +00:00
2. **password** - the password for the Postgres username.
3. **service\_name** - Service Name for this Postgres cluster. If you added the Postgres cluster through OpenMetadata UI, make sure the service name matches the same.
4. **filter\_pattern** - It contains includes, excludes options to choose which pattern of datasets you want to ingest into OpenMetadata.
2021-08-01 14:27:44 -07:00
5. **database -** Database name from where data is to be fetched.
6. **data\_profiler\_enabled** - Enable data-profiling (Optional). It will provide you the newly ingested data.
7. **data\_profiler\_offset** - Specify offset.
8. **data\_profiler\_limit** - Specify limit.
2021-08-01 14:27:44 -07:00
## Publish to OpenMetadata
2021-08-16 16:52:35 +00:00
Below is the configuration to publish Postgres data into the OpenMetadata service.
2021-08-12 14:11:56 -07:00
Add Optionally `pii` processor and `metadata-rest` sink along with `metadata-server` config
2021-08-12 14:11:56 -07:00
{% code title="postgres.json" %}
```javascript
{
"source": {
"type": "postgres",
"config": {
"username": "openmetadata_user",
"password": "openmetadata_password",
"host_port": "localhost:5432",
"database": "pagila",
"service_name": "local_postgres",
"data_profiler_enabled": "true",
"data_profiler_offset": "0",
"data_profiler_limit": "50000"
2021-08-12 14:11:56 -07:00
}
},
"sink": {
2021-09-07 17:52:06 +00:00
"type": "metadata-rest",
2021-08-12 14:11:56 -07:00
"config": {}
},
"metadata_server": {
"type": "metadata-server",
"config": {
"api_endpoint": "http://localhost:8585/api",
"auth_provider_type": "no-auth"
}
}
}
```
{% endcode %}