mirror of
https://github.com/datahub-project/datahub.git
synced 2025-11-01 19:25:56 +00:00
* Integarte spring-kafka & spring-boot for security props - Upgrade spring-kafka to 2.1.14 - Use KafkaListener and KafkaTemplates to enable KafkaAutoConfiguration - Integrates spring-boot's KafkaProperties into spring-kafka's config * Cleanup imports * Add DataHub kafka env vars * Remove kafka-streams dependency * Add KafkaProperties to gms; Add docs * Add to Adoption * Remove KAFKA_BOOTSTRAP_SERVER default Co-authored-by: jsotelo <javier.sotelo@viasat.com> Co-authored-by: Kerem Sahin <ksahin@linkedin.com>
MXE Processing Jobs
DataHub uses Kafka as the pub-sub message queue in the backend. There are 2 Kafka topics used by DataHub which are
MetadataChangeEvent and MetadataAuditEvent.
MetadataChangeEvent:This message is emitted by any data platform or crawler in which there is a change in the metadata.MetadataAuditEvent:This message is emitted by DataHub GMS to notify that metadata change is registered.
To be able to consume from these two topics, there are two Kafka Streams jobs DataHub uses:
- MCE Consumer Job: Writes to DataHub GMS
- MAE Consumer Job: Writes to Elasticsearch & Neo4j