2021-09-01 15:10:12 -07:00
### Business Glossary File Format
2021-11-29 23:53:08 -08:00
The business glossary source file should be a `.yml` file with the following top-level keys:
**Glossary**: the top level keys of the business glossary file
- **version**: the version of business glossary file config the config conforms to. Currently the only version released is `1` .
- **source**: the source format of the terms. Currently only supports `DataHub`
2022-01-31 06:55:11 +05:30
- **owners**: owners contains two nested fields
2021-11-29 23:53:08 -08:00
- **users**: (optional) a list of user ids
- **groups**: (optional) a list of group ids
- **url**: (optional) external url pointing to where the glossary is defined externally, if applicable.
2021-12-08 06:44:27 +05:30
- **nodes**: (optional) list of child **GlossaryNode** objects
- **terms**: (optional) list of child **GlossaryTerm** objects
2021-11-29 23:53:08 -08:00
**GlossaryNode**: a container of **GlossaryNode** and **GlossaryTerm** objects
- **name**: name of the node
2022-01-31 06:55:11 +05:30
- **description**: description of the node
2022-11-02 03:47:30 +05:30
- **id**: (optional) identifier of the node (normally inferred from the name, see `enable_auto_id` config. Use this if you need a stable identifier)
2021-11-29 23:53:08 -08:00
- **owners**: (optional) owners contains two nested fields
- **users**: (optional) a list of user ids
- **groups**: (optional) a list of group ids
- **terms**: (optional) list of child **GlossaryTerm** objects
- **nodes**: (optional) list of child **GlossaryNode** objects
**GlossaryTerm**: a term in your business glossary
- **name**: name of the term
2022-01-31 06:55:11 +05:30
- **description**: description of the term
2022-11-02 03:47:30 +05:30
- **id**: (optional) identifier of the term (normally inferred from the name, see `enable_auto_id` config. Use this if you need a stable identifier)
2021-11-29 23:53:08 -08:00
- **owners**: (optional) owners contains two nested fields
- **users**: (optional) a list of user ids
- **groups**: (optional) a list of group ids
- **term_source**: One of `EXTERNAL` or `INTERNAL` . Whether the term is coming from an external glossary or one defined in your organization.
- **source_ref**: (optional) If external, what is the name of the source the glossary term is coming from?
- **source_url**: (optional) If external, what is the url of the source definition?
- **inherits**: (optional) List of **GlossaryTerm** that this term inherits from
- **contains**: (optional) List of **GlossaryTerm** that this term contains
- **custom_properties**: A map of key/value pairs of arbitrary custom properties
2022-12-23 04:17:57 +05:30
- **domain**: (optional) domain name or domain urn
2021-11-29 23:53:08 -08:00
2023-01-06 16:14:49 -08:00
You can also view an example business glossary file checked in [here ](https://github.com/datahub-project/datahub/blob/master/metadata-ingestion/examples/bootstrap_data/business_glossary.yml )
2021-09-01 15:10:12 -07:00
## Compatibility
Compatible with version 1 of business glossary format.
The source will be evolved as we publish newer versions of this format.