mirror of
https://github.com/datahub-project/datahub.git
synced 2025-07-04 23:57:03 +00:00
25 lines
1.6 KiB
Markdown
25 lines
1.6 KiB
Markdown
## Integration Details
|
|
|
|
This plugin extracts the following:
|
|
|
|
- Source and Sink Connectors in Kafka Connect as Data Pipelines
|
|
- For Source connectors - Data Jobs to represent lineage information between source dataset to Kafka topic per `{connector_name}:{source_dataset}` combination
|
|
- For Sink connectors - Data Jobs to represent lineage information between Kafka topic to destination dataset per `{connector_name}:{topic}` combination
|
|
|
|
### Concept Mapping
|
|
|
|
This ingestion source maps the following Source System Concepts to DataHub Concepts:
|
|
|
|
| Source Concept | DataHub Concept | Notes |
|
|
| ------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------- | ----- |
|
|
| `"kafka-connect"` | [Data Platform](https://docs.datahub.com/docs/generated/metamodel/entities/dataplatform/) | |
|
|
| [Connector](https://kafka.apache.org/documentation/#connect_connectorsandtasks) | [DataFlow](https://docs.datahub.com/docs/generated/metamodel/entities/dataflow/) | |
|
|
| Kafka Topic | [Dataset](https://docs.datahub.com/docs/generated/metamodel/entities/dataset/) | |
|
|
|
|
## Current limitations
|
|
|
|
Works only for
|
|
|
|
- Source connectors: JDBC, Debezium, Mongo and Generic connectors with user-defined lineage graph
|
|
- Sink connectors: BigQuery, Confluent, S3, Snowflake
|