mirror of
https://github.com/datahub-project/datahub.git
synced 2025-07-04 07:34:44 +00:00
149 lines
4.6 KiB
Markdown
149 lines
4.6 KiB
Markdown
import Tabs from '@theme/Tabs';
|
||
import TabItem from '@theme/TabItem';
|
||
|
||
# Documentation Forms
|
||
|
||
## Why Would You Use Documentation Forms?
|
||
|
||
Documentation Forms are a way for end-users to fill out all mandatory attributes associated with a data asset. The form will be dynamically generated based on the definitions provided by administrators and stewards and matching rules.
|
||
|
||
Learn more about forms in the [Documentation Forms Feature Guide](../../../docs/features/feature-guides/documentation-forms.md).
|
||
|
||
|
||
### Goal Of This Guide
|
||
This guide will show you how to create and read forms.
|
||
|
||
## Prerequisites
|
||
|
||
For this tutorial, you need to deploy DataHub Quickstart and ingest sample data.
|
||
For detailed information, please refer to [Datahub Quickstart Guide](/docs/quickstart.md).
|
||
|
||
|
||
<Tabs>
|
||
<TabItem value="CLI" label="CLI">
|
||
|
||
Install the relevant CLI version. Forms are available as of CLI version `0.13.1`. The corresponding DataHub Cloud release version is `v0.2.16.5`
|
||
Connect to your instance via [init](https://datahubproject.io/docs/cli/#init):
|
||
|
||
1. Run `datahub init` to update the instance you want to load into
|
||
2. Set the server to your sandbox instance, `https://{your-instance-address}/gms`
|
||
3. Set the token to your access token
|
||
|
||
|
||
</TabItem>
|
||
</Tabs>
|
||
|
||
|
||
## Create a Form
|
||
|
||
<Tabs>
|
||
<TabItem value="CLI" label="CLI">
|
||
|
||
Create a yaml file representing the forms you’d like to load.
|
||
For example, below file represents a form `123456` You can see the full example [here](https://github.com/datahub-project/datahub/blob/example-yaml-sp/metadata-ingestion/examples/forms/forms.yaml).
|
||
|
||
|
||
```yaml
|
||
- id: 123456
|
||
# urn: "urn:li:form:123456" # optional if id is provided
|
||
type: VERIFICATION # Supported Types: DOCUMENTATION, VERIFICATION
|
||
name: "Metadata Initiative 2023"
|
||
description: "How we want to ensure the most important data assets in our organization have all of the most important and expected pieces of metadata filled out"
|
||
prompts:
|
||
- id: "123"
|
||
title: "Retention Time"
|
||
description: "Apply Retention Time structured property to form"
|
||
type: STRUCTURED_PROPERTY
|
||
structured_property_id: io.acryl.privacy.retentionTime
|
||
required: True # optional, will default to True
|
||
entities: # Either pass a list of urns or a group of filters. This example shows a list of urns
|
||
urns:
|
||
- urn:li:dataset:(urn:li:dataPlatform:hdfs,SampleHdfsDataset,PROD)
|
||
# optionally assign the form to a specific set of users and/or groups
|
||
# when omitted, form will be assigned to Asset owners
|
||
actors:
|
||
users:
|
||
- urn:li:corpuser:jane@email.com # note: these should be urns
|
||
- urn:li:corpuser:john@email.com
|
||
groups:
|
||
- urn:li:corpGroup:team@email.com # note: these should be urns
|
||
```
|
||
|
||
:::note
|
||
Note that the structured properties and related entities should be created before you create the form.
|
||
Please refer to the [Structured Properties Tutorial](/docs/api/tutorials/structured-properties.md) for more information.
|
||
:::
|
||
|
||
|
||
You can apply forms to either a list of entity urns, or a list of filters. For a list of entity urns, use this structure:
|
||
|
||
```
|
||
entities:
|
||
urns:
|
||
- urn:li:dataset:...
|
||
```
|
||
|
||
For a list of filters, use this structure:
|
||
|
||
```
|
||
entities:
|
||
filters:
|
||
types:
|
||
- dataset # you can use entity type name or urn
|
||
platforms:
|
||
- snowflake # you can use platform name or urn
|
||
domains:
|
||
- urn:li:domain:finance # you must use domain urn
|
||
containers:
|
||
- urn:li:container:my_container # you must use container urn
|
||
```
|
||
|
||
Note that you can filter to entity types, platforms, domains, and/or containers.
|
||
|
||
Use the CLI to create your properties:
|
||
|
||
```commandline
|
||
datahub forms upsert -f {forms_yaml}
|
||
```
|
||
|
||
If successful, you should see `Created form urn:li:form:...`
|
||
|
||
</TabItem>
|
||
</Tabs>
|
||
|
||
## Read Property Definition
|
||
|
||
|
||
<Tabs>
|
||
<TabItem value="CLI" label="CLI">
|
||
|
||
You can see the properties you created by running the following command:
|
||
|
||
```commandline
|
||
datahub forms get --urn {urn}
|
||
```
|
||
For example, you can run `datahub forms get --urn urn:li:form:123456`.
|
||
|
||
If successful, you should see metadata about your form returned like below.
|
||
|
||
```json
|
||
{
|
||
"urn": "urn:li:form:123456",
|
||
"name": "Metadata Initiative 2023",
|
||
"description": "How we want to ensure the most important data assets in our organization have all of the most important and expected pieces of metadata filled out",
|
||
"prompts": [
|
||
{
|
||
"id": "123",
|
||
"title": "Retention Time",
|
||
"description": "Apply Retention Time structured property to form",
|
||
"type": "STRUCTURED_PROPERTY",
|
||
"structured_property_urn": "urn:li:structuredProperty:io.acryl.privacy.retentionTime"
|
||
}
|
||
],
|
||
"type": "VERIFICATION"
|
||
}
|
||
```
|
||
|
||
</TabItem>
|
||
</Tabs>
|