mirror of
https://github.com/datahub-project/datahub.git
synced 2025-07-13 12:10:23 +00:00
2.5 KiB
2.5 KiB
Elastic Search
For context on getting started with ingestion, check out our metadata ingestion guide.
Setup
To install this plugin, run pip install 'acryl-datahub[elasticsearch]'
.
Capabilities
This plugin extracts the following:
- Metadata for indexes
- Column types associated with each index field
Capability | Status | Details |
---|---|---|
Platform Instance | ✔️ | link |
Quickstart recipe
Check out the following recipe to get started with ingestion! See below for full configuration options.
For general pointers on writing and running a recipe, see our main recipe guide.
source:
type: "elasticsearch"
config:
# Coordinates
host: 'localhost:9200'
# Credentials
username: ""
password: ""
# Options
env: "PROD"
index_pattern:
allow: [".*some_index_name_pattern*"]
deny: [".*skip_index_name_pattern*"]
sink:
# sink configs
Config details
Note that a .
is used to denote nested fields in the YAML recipe.
Field | Required | Default | Description |
---|---|---|---|
host |
"localhost:9092" | The elastic search host URI. | |
username |
"" | The username credential. | |
password |
"" | The password credential. | |
env |
"PROD" |
Environment to use in namespace when constructing URNs. | |
platform_instance |
None | The Platform instance to use while constructing URNs. | |
index_pattern.allow |
List of regex patterns for indexes to include in ingestion. | ||
index_pattern.deny |
List of regex patterns for indexes to exclude from ingestion. | ||
index_pattern.ignoreCase |
True |
Whether regex matching should ignore case or not |
Compatibility
Coming soon!
Questions
If you've got any questions on configuring this source, feel free to ping us on our Slack!