2024-10-14 20:38:23 -07:00
import FeatureAvailability from '@site/src/components/FeatureAvailability ';
# AI Documentation
< FeatureAvailability saasOnly / >
:::info
2025-04-28 23:34:33 +09:00
This feature is currently in closed beta. Reach out to your DataHub Cloud representative to get access.
2024-10-14 20:38:23 -07:00
:::
With AI-powered documentation, you can automatically generate documentation for tables and columns.
< p align = "center" >
< iframe width = "560" height = "315" src = "https://www.youtube.com/embed/_7DieZeZspY?si=Q5FkCA0gZPEFMj0Y" title = "YouTube video player" frameborder = "0" allow = "accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy = "strict-origin-when-cross-origin" allowfullscreen > < / iframe >
< / p >
## Configuring
2025-03-10 10:25:42 -07:00
Ensure you have edit dataset description privileges.
Once permissions are obtained, no configuration is required - just hit "Generate" on any table or column in the UI.
2024-10-14 20:38:23 -07:00
## How it works
Generating good documentation requires a holistic understanding of the data. Information we take into account includes, but is not limited to:
- Dataset name and any existing documentation
- Column name, type, description, and sample values
- Lineage relationships to upstream and downstream assets
- Metadata about other related assets
2025-04-28 23:34:33 +09:00
Data privacy: Your metadata is not sent to any third-party LLMs. We use AWS Bedrock internally, which means all metadata remains within the DataHub Cloud AWS account. We do not fine-tune on customer data.
2024-10-14 20:38:23 -07:00
## Limitations
2025-02-25 10:44:54 -08:00
- This feature is not available on tables with more than 100 columns. We are working on expanding this limit.
2024-10-14 20:38:23 -07:00
- This feature is powered by an LLM, which can produce inaccurate results. While we've taken steps to reduce the likelihood of hallucinations, they can still occur.