2022-03-22 11:44:28 -07:00

10 KiB

Pipeline

Pipeline

This schema defines the Pipeline entity. A pipeline enables the flow of data from source to destination through a series of processing steps. ETL is a type of pipeline where the series of steps Extract, Transform and Load the data.

$id:https://open-metadata.org/schema/entity/data/pipeline.json

Type: object

This schema does not accept additional properties.

Properties

<<<<<<< HEAD

a07bc411 (updated json schema and schema docs (#3219))

Type definitions in this schema

<<<<<<< HEAD

task

  • Type: object
  • Properties
    • name required
      • Name that identifies this task instance uniquely.
      • Type: string
    • displayName
      • Display Name that identifies this Task. It could be title or label from the pipeline services.
      • Type: string
    • fullyQualifiedName
      • A unique name that identifies a pipeline in the format 'ServiceName.PipelineName.TaskName'.
      • Type: string
    • description
      • Description of this Task.
      • Type: string
    • taskUrl
      • Task URL to visit/manage. This URL points to respective pipeline service UI.
      • Type: string
      • String format must be a "uri"
    • downstreamTasks
      • All the tasks that are downstream of this task.
      • Type: array
        • Items
        • Type: string
    • taskType
      • Type of the Task. Usually refers to the class it implements.
      • Type: string
    • taskSQL
    • tags

This document was updated on: Tuesday, January 25, 2022

statusType

  • Enum defining the possible Status.
  • Type: string
  • The value is restricted to the following:
    1. "Successful"
    2. "Failed"
    3. "Pending"

taskStatus

  • This schema defines a time series of the status of a Pipeline or Task.
  • Type: object
  • This schema does not accept additional properties.
  • Properties
    • name
      • Name of the Task.
      • Type: string
    • executionStatus

task

  • Type: object
  • This schema does not accept additional properties.
  • Properties
    • name required
      • Name that identifies this task instance uniquely.
      • Type: string
    • displayName
      • Display Name that identifies this Task. It could be title or label from the pipeline services.
      • Type: string
    • fullyQualifiedName
      • A unique name that identifies a pipeline in the format 'ServiceName.PipelineName.TaskName'.
      • Type: string
    • description
      • Description of this Task.
      • Type: string
    • taskUrl
      • Task URL to visit/manage. This URL points to respective pipeline service UI.
      • Type: string
      • String format must be a "uri"
    • downstreamTasks
      • All the tasks that are downstream of this task.
      • Type: array
        • Items
        • Type: string
    • taskType
      • Type of the Task. Usually refers to the class it implements.
      • Type: string
    • taskSQL
    • tags

pipelineStatus

This document was updated on: Monday, March 7, 2022

a07bc411 (updated json schema and schema docs (#3219))