datahub/docs/api/tutorials/lineage.md

import Tabs from '@theme/Tabs';
import TabItem from '@theme/TabItem';

# Data Lineage

## Why Would You Use Lineage?

Data lineage is used to capture data dependencies within an organization. It allows you to track the inputs from which a data asset is derived, along with the data assets that depend on it downstream.

For more information about data lineage, refer to [About DataHub Lineage](/docs/generated/lineage/lineage-feature-guide.md).

### Goal Of This Guide

This guide will show you how to

- Add lineage between datasets.
- Add column-level lineage between datasets.

## Prerequisites

For this tutorial, you need to deploy DataHub Quickstart and ingest sample data.
For detailed steps, please refer to [Datahub Quickstart Guide](/docs/quickstart.md).

:::note
Before adding lineage, you need to ensure the targeted dataset is already present in your datahub.
If you attempt to manipulate entities that do not exist, your operation will fail.
In this guide, we will be using data from sample ingestion.
:::

## Add Lineage

<Tabs>
<TabItem value="graphql" label="GraphQL" default>

```json
mutation updateLineage {
  updateLineage(
    input: {
      edgesToAdd: [
        {
          downstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)"
          upstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)"
        }
      ]
      edgesToRemove: []
    }
  )
}
```

Note that you can create a list of edges. For example, if you want to assign multiple upstream entities to a downstream entity, you can do the following.

```json
mutation updateLineage {
  updateLineage(
    input: {
      edgesToAdd: [
        {
          downstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)"
          upstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)"
        }
        {
          downstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)"
          upstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_created,PROD)"
        }
      ]
      edgesToRemove: []
    }
  )
}

```

For more information about the `updateLineage` mutation, please refer to [updateLineage](https://datahubproject.io/docs/graphql/mutations/#updatelineage).

If you see the following response, the operation was successful:

```python
{
  "data": {
    "updateLineage": true
  },
  "extensions": {}
}
```

</TabItem>
<TabItem value="curl" label="Curl">

```shell
curl --location --request POST 'http://localhost:8080/api/graphql' \
--header 'Authorization: Bearer <my-access-token>' \
--header 'Content-Type: application/json'  --data-raw '{ "query": "mutation updateLineage { updateLineage( input:{ edgesToAdd : { downstreamUrn: \"urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)\", upstreamUrn : \"urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)\"}, edgesToRemove :{downstreamUrn: \"urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)\",upstreamUrn : \"urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)\" } })}", "variables":{}}'
```

Expected Response:

```json
{ "data": { "updateLineage": true }, "extensions": {} }
```

</TabItem>
<TabItem value="python" label="Python">

```python
{{ inline /metadata-ingestion/examples/library/lineage_emitter_rest.py show_path_as_comment }}
```

</TabItem>
</Tabs>

### Expected Outcomes of Adding Lineage

You can now see the lineage between `fct_users_deleted` and `logging_events`.

<p align="center">
  <img width="70%"  src="https://raw.githubusercontent.com/datahub-project/static-assets/main/imgs/apis/tutorials/lineage-added.png"/>
</p>

## Add Column-level Lineage

<Tabs>
<TabItem value="python" label="Python">

```python
{{ inline /metadata-ingestion/examples/library/lineage_emitter_dataset_finegrained_sample.py show_path_as_comment }}
```

</TabItem>
</Tabs>

### Expected Outcome of Adding Column Level Lineage

You can now see the column-level lineage between datasets. Note that you have to enable `Show Columns` to be able to see the column-level lineage.

<p align="center">
  <img width="70%"  src="https://raw.githubusercontent.com/datahub-project/static-assets/main/imgs/apis/tutorials/column-level-lineage-added.png"/>
</p>

## Read Table Lineage

<Tabs>
<TabItem value="graphql" label="GraphQL" default>

```graphql
query searchAcrossLineage {
  searchAcrossLineage(
    input: {
      query: "*"
      urn: "urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD)"
      start: 0
      count: 10
      direction: DOWNSTREAM
      orFilters: [
        {
          and: [
            {
              condition: EQUAL
              negated: false
              field: "degree"
              values: ["1", "2", "3+"]
            }
          ]
        }
      ]
    }
  ) {
    searchResults {
      degree
      entity {
        urn
        type
      }
    }
  }
}
```

This example shows using lineage degrees as a filter, but additional search filters can be included here as well.

</TabItem>
<TabItem value="curl" label="Curl">

```shell
curl --location --request POST 'http://localhost:8080/api/graphql' \
--header 'Authorization: Bearer <my-access-token>' \
--header 'Content-Type: application/json'  --data-raw '{ { "query": "query searchAcrossLineage { searchAcrossLineage( input: { query: \"*\" urn: \"urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD)\" start: 0 count: 10 direction: DOWNSTREAM orFilters: [ { and: [ { condition: EQUAL negated: false field: \"degree\" values: [\"1\", \"2\", \"3+\"] } ] } ] } ) { searchResults { degree entity { urn type } } }}"
}}'
```

</TabItem>
<TabItem value="python" label="Python">

```python
{{ inline /metadata-ingestion/examples/library/read_lineage_rest.py show_path_as_comment }}
```

</TabItem>
</Tabs>

This will perform a multi-hop lineage search on the urn specified. For more information about the `searchAcrossLineage` mutation, please refer to [searchAcrossLineage](https://datahubproject.io/docs/graphql/queries/#searchacrosslineage).

## Read Column Lineage

<Tabs>
<TabItem value="graphql" label="GraphQL" default>

```graphql
query searchAcrossLineage {
  searchAcrossLineage(
    input: {
      query: "*"
      urn: "urn:li:schemaField(urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD),profile_id)"
      start: 0
      count: 10
      direction: DOWNSTREAM
      orFilters: [
        {
          and: [
            {
              condition: EQUAL
              negated: false
              field: "degree"
              values: ["1", "2", "3+"]
            }
          ]
        }
      ]
    }
  ) {
    searchResults {
      degree
      entity {
        urn
        type
      }
    }
  }
}
```

This example shows using lineage degrees as a filter, but additional search filters can be included here as well.

</TabItem>
<TabItem value="curl" label="Curl">

```shell
curl --location --request POST 'http://localhost:8080/api/graphql' \
--header 'Authorization: Bearer <my-access-token>' \
--header 'Content-Type: application/json'  --data-raw '{ { "query": "query searchAcrossLineage { searchAcrossLineage( input: { query: \"*\" urn: \"urn:li:schemaField(urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD),profile_id)\" start: 0 count: 10 direction: DOWNSTREAM orFilters: [ { and: [ { condition: EQUAL negated: false field: \"degree\" values: [\"1\", \"2\", \"3+\"] } ] } ] } ) { searchResults { degree entity { urn type } } }}"
}}'
```

</TabItem>
</Tabs>

This will perform a multi-hop lineage search on the urn specified. You can see schemaField URNs are made up of two parts: first the table they are a column of, and second the path of the column. For more information about the `searchAcrossLineage` mutation, please refer to [searchAcrossLineage](https://datahubproject.io/docs/graphql/queries/#searchacrosslineage).
feat(docs): consolidate api guides (#7857) Co-authored-by: socar-dini <dini@socar.kr> 2023-04-20 12:17:11 +09:00			`import Tabs from '@theme/Tabs';`
			`import TabItem from '@theme/TabItem';`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00
feat: add keywords for SEO (#10358) 2024-04-30 08:12:32 +09:00			`# Data Lineage`
feat(docs): consolidate api guides (#7857) Co-authored-by: socar-dini <dini@socar.kr> 2023-04-20 12:17:11 +09:00
			`## Why Would You Use Lineage?`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
feat: add keywords for SEO (#10358) 2024-04-30 08:12:32 +09:00			`Data lineage is used to capture data dependencies within an organization. It allows you to track the inputs from which a data asset is derived, along with the data assets that depend on it downstream.`
docs(lineage): Lineage docs refactoring (#8899) 2023-10-04 17:43:59 +09:00
feat: add keywords for SEO (#10358) 2024-04-30 08:12:32 +09:00			`For more information about data lineage, refer to [About DataHub Lineage](/docs/generated/lineage/lineage-feature-guide.md).`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00
feat: add docs on creating tags/terms/datasets (#7608) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Pedro Silva <pedro@acryl.io> 2023-03-17 06:12:35 +09:00			`### Goal Of This Guide`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
feat: add missing python sdk guides based on DatahubGraph (#7875) Co-authored-by: socar-dini <dini@socar.kr> Co-authored-by: Shirshanka Das <shirshanka@apache.org> 2023-05-03 07:32:23 +09:00			`This guide will show you how to`

feat: add docs on column-level linage (#8062) 2023-05-19 07:59:30 +09:00			`- Add lineage between datasets.`
			`- Add column-level lineage between datasets.`
feat: add docs on creating tags/terms/datasets (#7608) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Pedro Silva <pedro@acryl.io> 2023-03-17 06:12:35 +09:00
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			`## Prerequisites`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
			`For this tutorial, you need to deploy DataHub Quickstart and ingest sample data.`
			`For detailed steps, please refer to [Datahub Quickstart Guide](/docs/quickstart.md).`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00
			`:::note`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00			`Before adding lineage, you need to ensure the targeted dataset is already present in your datahub.`
			`If you attempt to manipulate entities that do not exist, your operation will fail.`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			`In this guide, we will be using data from sample ingestion.`
			`:::`

feat: add missing python sdk guides based on DatahubGraph (#7875) Co-authored-by: socar-dini <dini@socar.kr> Co-authored-by: Shirshanka Das <shirshanka@apache.org> 2023-05-03 07:32:23 +09:00			`## Add Lineage`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00
feat(docs): consolidate api guides (#7857) Co-authored-by: socar-dini <dini@socar.kr> 2023-04-20 12:17:11 +09:00			`<Tabs>`
			`<TabItem value="graphql" label="GraphQL" default>`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00
			```json
			`mutation updateLineage {`
			`updateLineage(`
			`input: {`
			`edgesToAdd: [`
			`{`
			`downstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)"`
			`upstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)"`
			`}`
			`]`
			`edgesToRemove: []`
			`}`
			`)`
			`}`
			```

			`Note that you can create a list of edges. For example, if you want to assign multiple upstream entities to a downstream entity, you can do the following.`

			```json
			`mutation updateLineage {`
			`updateLineage(`
			`input: {`
			`edgesToAdd: [`
			`{`
			`downstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)"`
			`upstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)"`
			`}`
			`{`
			`downstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)"`
			`upstreamUrn: "urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_created,PROD)"`
			`}`
			`]`
			`edgesToRemove: []`
			`}`
			`)`
			`}`

			```

feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00			For more information about the `updateLineage` mutation, please refer to [updateLineage](https://datahubproject.io/docs/graphql/mutations/#updatelineage).
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00
			`If you see the following response, the operation was successful:`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			```python
			`{`
			`"data": {`
			`"updateLineage": true`
			`},`
			`"extensions": {}`
			`}`
			```
feat: add missing python sdk guides based on DatahubGraph (#7875) Co-authored-by: socar-dini <dini@socar.kr> Co-authored-by: Shirshanka Das <shirshanka@apache.org> 2023-05-03 07:32:23 +09:00
feat(docs): consolidate api guides (#7857) Co-authored-by: socar-dini <dini@socar.kr> 2023-04-20 12:17:11 +09:00			`</TabItem>`
			`<TabItem value="curl" label="Curl">`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00
			```shell
			`curl --location --request POST 'http://localhost:8080/api/graphql' \`
			`--header 'Authorization: Bearer <my-access-token>' \`
			--header 'Content-Type: application/json' --data-raw '{ "query": "mutation updateLineage { updateLineage( input:{ edgesToAdd : { downstreamUrn: \"urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)\", upstreamUrn : \"urn:li:dataset:(urn:li:dataPlatform:hive,logging_events,PROD)\"}, edgesToRemove :{downstreamUrn: \"urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)\",upstreamUrn : \"urn:li:dataset:(urn:li:dataPlatform:hive,fct_users_deleted,PROD)\" } })}", "variables":{}}'
			```
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			`Expected Response:`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			```json
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00			`{ "data": { "updateLineage": true }, "extensions": {} }`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			```
feat: add missing python sdk guides based on DatahubGraph (#7875) Co-authored-by: socar-dini <dini@socar.kr> Co-authored-by: Shirshanka Das <shirshanka@apache.org> 2023-05-03 07:32:23 +09:00
feat(docs): consolidate api guides (#7857) Co-authored-by: socar-dini <dini@socar.kr> 2023-04-20 12:17:11 +09:00			`</TabItem>`
			`<TabItem value="python" label="Python">`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			```python
feat: enriching guide on creating dataset (#7777) Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> Co-authored-by: socar-dini <dini@socar.kr> 2023-04-19 12:58:03 +09:00			`{{ inline /metadata-ingestion/examples/library/lineage_emitter_rest.py show_path_as_comment }}`
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			```

feat(docs): consolidate api guides (#7857) Co-authored-by: socar-dini <dini@socar.kr> 2023-04-20 12:17:11 +09:00			`</TabItem>`
			`</Tabs>`

			`### Expected Outcomes of Adding Lineage`
feat(docs): refactor guide on graphql (#7745) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> Co-authored-by: Hyejin Yoon <hyejin.yoon@acryl.io> 2023-04-08 08:26:58 +09:00
feat(docs): add docs on lineage (#7576) Co-authored-by: Hyejin Yoon <yoonhyejin@Hyejins-MacBook-Pro.local> 2023-03-16 08:19:31 +09:00			You can now see the lineage between `fct_users_deleted` and `logging_events`.

docs(docs): add native versioning (#8714) 2023-08-26 06:10:13 +09:00			`<p align="center">`
			`<img width="70%" src="https://raw.githubusercontent.com/datahub-project/static-assets/main/imgs/apis/tutorials/lineage-added.png"/>`
			`</p>`

feat: add docs on column-level linage (#8062) 2023-05-19 07:59:30 +09:00			`## Add Column-level Lineage`

			`<Tabs>`
			`<TabItem value="python" label="Python">`

			```python
			`{{ inline /metadata-ingestion/examples/library/lineage_emitter_dataset_finegrained_sample.py show_path_as_comment }}`
			```

			`</TabItem>`
			`</Tabs>`

			`### Expected Outcome of Adding Column Level Lineage`

			You can now see the column-level lineage between datasets. Note that you have to enable `Show Columns` to be able to see the column-level lineage.

docs(docs): add native versioning (#8714) 2023-08-26 06:10:13 +09:00			`<p align="center">`
			`<img width="70%" src="https://raw.githubusercontent.com/datahub-project/static-assets/main/imgs/apis/tutorials/column-level-lineage-added.png"/>`
			`</p>`

docs(impact analysis): Add column level impact analysis graphql example (#10427) 2024-05-09 13:57:44 -07:00			`## Read Table Lineage`
docs(lineage): add read lineage example (#8322) 2023-06-30 08:48:05 -07:00
			`<Tabs>`
			`<TabItem value="graphql" label="GraphQL" default>`

fix:small typo on graphql tutorial (#8741) 2023-09-01 18:14:28 +09:00			```graphql
			`query searchAcrossLineage {`
docs(lineage): add read lineage example (#8322) 2023-06-30 08:48:05 -07:00			`searchAcrossLineage(`
			`input: {`
			`query: "*"`
			`urn: "urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD)"`
			`start: 0`
			`count: 10`
			`direction: DOWNSTREAM`
			`orFilters: [`
			`{`
			`and: [`
			`{`
			`condition: EQUAL`
			`negated: false`
			`field: "degree"`
			`values: ["1", "2", "3+"]`
			`}`
			`]`
			`}`
			`]`
			`}`
			`) {`
			`searchResults {`
			`degree`
			`entity {`
			`urn`
			`type`
			`}`
			`}`
			`}`
			`}`
			```

docs(graphql): Correct mutation -> query for searchAcrossLineage examples (#9134) 2023-10-27 20:18:31 -07:00			`This example shows using lineage degrees as a filter, but additional search filters can be included here as well.`
docs(lineage): add read lineage example (#8322) 2023-06-30 08:48:05 -07:00
			`</TabItem>`
			`<TabItem value="curl" label="Curl">`

			```shell
			`curl --location --request POST 'http://localhost:8080/api/graphql' \`
			`--header 'Authorization: Bearer <my-access-token>' \`
docs(graphql): Correct mutation -> query for searchAcrossLineage examples (#9134) 2023-10-27 20:18:31 -07:00			`--header 'Content-Type: application/json' --data-raw '{ { "query": "query searchAcrossLineage { searchAcrossLineage( input: { query: \"*\" urn: \"urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD)\" start: 0 count: 10 direction: DOWNSTREAM orFilters: [ { and: [ { condition: EQUAL negated: false field: \"degree\" values: [\"1\", \"2\", \"3+\"] } ] } ] } ) { searchResults { degree entity { urn type } } }}"`
docs(lineage): add read lineage example (#8322) 2023-06-30 08:48:05 -07:00			`}}'`
			```

			`</TabItem>`
			`<TabItem value="python" label="Python">`

			```python
			`{{ inline /metadata-ingestion/examples/library/read_lineage_rest.py show_path_as_comment }}`
			```

			`</TabItem>`
			`</Tabs>`

			This will perform a multi-hop lineage search on the urn specified. For more information about the `searchAcrossLineage` mutation, please refer to [searchAcrossLineage](https://datahubproject.io/docs/graphql/queries/#searchacrosslineage).
docs(impact analysis): Add column level impact analysis graphql example (#10427) 2024-05-09 13:57:44 -07:00
			`## Read Column Lineage`

			`<Tabs>`
			`<TabItem value="graphql" label="GraphQL" default>`

			```graphql
			`query searchAcrossLineage {`
			`searchAcrossLineage(`
			`input: {`
			`query: "*"`
			`urn: "urn:li:schemaField(urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD),profile_id)"`
			`start: 0`
			`count: 10`
			`direction: DOWNSTREAM`
			`orFilters: [`
			`{`
			`and: [`
			`{`
			`condition: EQUAL`
			`negated: false`
			`field: "degree"`
			`values: ["1", "2", "3+"]`
			`}`
			`]`
			`}`
			`]`
			`}`
			`) {`
			`searchResults {`
			`degree`
			`entity {`
			`urn`
			`type`
			`}`
			`}`
			`}`
			`}`
			```

			`This example shows using lineage degrees as a filter, but additional search filters can be included here as well.`

			`</TabItem>`
			`<TabItem value="curl" label="Curl">`

			```shell
			`curl --location --request POST 'http://localhost:8080/api/graphql' \`
			`--header 'Authorization: Bearer <my-access-token>' \`
			`--header 'Content-Type: application/json' --data-raw '{ { "query": "query searchAcrossLineage { searchAcrossLineage( input: { query: \"*\" urn: \"urn:li:schemaField(urn:li:dataset:(urn:li:dataPlatform:dbt,long_tail_companions.adoption.human_profiles,PROD),profile_id)\" start: 0 count: 10 direction: DOWNSTREAM orFilters: [ { and: [ { condition: EQUAL negated: false field: \"degree\" values: [\"1\", \"2\", \"3+\"] } ] } ] } ) { searchResults { degree entity { urn type } } }}"`
			`}}'`
			```

			`</TabItem>`
			`</Tabs>`

			This will perform a multi-hop lineage search on the urn specified. You can see schemaField URNs are made up of two parts: first the table they are a column of, and second the path of the column. For more information about the `searchAcrossLineage` mutation, please refer to [searchAcrossLineage](https://datahubproject.io/docs/graphql/queries/#searchacrosslineage).