Sriharsha Chintalapani 876ac24e44
Docs (#3276)
* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
2022-03-08 08:13:37 -08:00

11 KiB

Pipeline

This schema defines the Pipeline entity. A pipeline enables the flow of data from source to destination through a series of processing steps. ETL is a type of pipeline where the series of steps Extract, Transform and Load the data.

$id:https://open-metadata.org/schema/entity/data/pipeline.json

Type: object

This schema does not accept additional properties.

Properties

<<<<<<< HEAD

a07bc411 (updated json schema and schema docs (#3219))

Type definitions in this schema

statusType

  • Enum defining the possible Status.
  • Type: string
  • The value is restricted to the following:
    1. "Successful"
    2. "Failed"
    3. "Pending"

taskStatus

  • This schema defines a time series of the status of a Pipeline or Task.
  • Type: object
  • This schema does not accept additional properties.
  • Properties
    • name
      • Name of the Task.
      • Type: string
    • executionStatus

task

  • Type: object
  • This schema does not accept additional properties.
  • Properties
    • name required
      • Name that identifies this task instance uniquely.
      • Type: string
    • displayName
      • Display Name that identifies this Task. It could be title or label from the pipeline services.
      • Type: string
    • fullyQualifiedName
      • A unique name that identifies a pipeline in the format 'ServiceName.PipelineName.TaskName'.
      • Type: string
    • description
      • Description of this Task.
      • Type: string
    • taskUrl
      • Task URL to visit/manage. This URL points to respective pipeline service UI.
      • Type: string
      • String format must be a "uri"
    • downstreamTasks
      • All the tasks that are downstream of this task.
      • Type: array
        • Items
        • Type: string
    • taskType
      • Type of the Task. Usually refers to the class it implements.
      • Type: string
    • taskSQL
    • tags

This document was updated on: Tuesday, January 25, 2022

statusType

  • Enum defining the possible Status.
  • Type: string
  • The value is restricted to the following:
    1. "Successful"
    2. "Failed"
    3. "Pending"

taskStatus

  • This schema defines a time series of the status of a Pipeline or Task.
  • Type: object
  • This schema does not accept additional properties.
  • Properties
    • name
      • Name of the Task.
      • Type: string
    • executionStatus

task

  • Type: object
  • This schema does not accept additional properties.
  • Properties
    • name required
      • Name that identifies this task instance uniquely.
      • Type: string
    • displayName
      • Display Name that identifies this Task. It could be title or label from the pipeline services.
      • Type: string
    • fullyQualifiedName
      • A unique name that identifies a pipeline in the format 'ServiceName.PipelineName.TaskName'.
      • Type: string
    • description
      • Description of this Task.
      • Type: string
    • taskUrl
      • Task URL to visit/manage. This URL points to respective pipeline service UI.
      • Type: string
      • String format must be a "uri"
    • downstreamTasks
      • All the tasks that are downstream of this task.
      • Type: array
        • Items
        • Type: string
    • taskType
      • Type of the Task. Usually refers to the class it implements.
      • Type: string
    • taskSQL
    • tags

pipelineStatus

This document was updated on: Monday, March 7, 2022

a07bc411 (updated json schema and schema docs (#3219))