Suman Maharana a8d288cc32
Fix #11351 Added Kafka Connect connector docs (#15747)
* Added Kafka Connect JSON Schemas

* Added Kafka Connect Documentation

* Revert "Added Kafka Connect JSON Schemas"

This reverts commit 89cf700e93f1391f24156a1235e5d5e70b067704.

* Added webp image formats for kafkaconnect docs

---------

Co-authored-by: Suman Maharana <suman@Reyna.local>
2024-04-01 11:34:38 +05:30

3.3 KiB

title slug
Run the KafkaConnect Connector Externally /connectors/pipeline/kafkaconnect/yaml

{% connectorDetailsHeader name="KafkaConnect" stage="PROD" platform="Collate" availableFeatures=["Pipelines", "Pipeline Status", "Tags"] unavailableFeatures=["Owners", "Lineage"] / %}

In this section, we provide guides and references to use the KafkaConnect connector.

Configure and schedule KafkaConnect metadata and profiler workflows from the OpenMetadata UI:

{% partial file="/v1.3/connectors/external-ingestion-deployment.md" /%}

Requirements

Python Requirements

To run the KafkaConnect ingestion, you will need to install:

pip3 install "openmetadata-ingestion[kafkaconnect]"

Metadata Ingestion

All connectors are defined as JSON Schemas. Here you can find the structure to create a connection to KafkaConnect.

In order to create and run a Metadata Ingestion workflow, we will follow the steps to create a YAML configuration able to connect to the source, process the Entities if needed, and reach the OpenMetadata server.

The workflow is modeled around the following JSON Schema

1. Define the YAML Config

This is a sample config for KafkaConnect:

{% codePreview %}

{% codeInfoContainer %}

Source Configuration - Service Connection

{% codeInfo srNumber=1 %}

hostPort: The hostname or IP address of the Kafka Connect worker with the REST API enabled

{% /codeInfo %}

{% codeInfo srNumber=2 %}

verifySSL: Whether SSL verification should be perform when authenticating.

{% /codeInfo %}

{% codeInfo srNumber=3 %}

Kafka Connect Config: OpenMetadata supports username/password or no Authentication.

Basic Authentication - Username: Username to connect to Kafka Connect. This user should be able to send request to the Kafka Connect API and access the Rest API GET endpoints. - Password: Password to connect to Kafka Connect.

{% /codeInfo %}

{% partial file="/v1.3/connectors/yaml/pipeline/source-config-def.md" /%}

{% partial file="/v1.3/connectors/yaml/ingestion-sink-def.md" /%}

{% partial file="/v1.3/connectors/yaml/workflow-config-def.md" /%}

{% /codeInfoContainer %}

{% codeBlock fileName="filename.yaml" %}

source:
  type: kafkaconnect
  serviceName: kafka_connect_source
  serviceConnection:
    config:
      type: KafkaConnect
        hostPort: "https://<yourkafkaconnectresturihere>" # or http://localhost:8083 or http://127.0.0.1:8083
        verifySSL: true
        authType:
          username: username
          password: password

{% partial file="/v1.3/connectors/yaml/pipeline/source-config.md" /%}

{% partial file="/v1.3/connectors/yaml/ingestion-sink.md" /%}

{% partial file="/v1.3/connectors/yaml/workflow-config.md" /%}

{% /codeBlock %}

{% /codePreview %}

{% partial file="/v1.3/connectors/yaml/ingestion-cli.md" /%}