mirror of
https://github.com/datahub-project/datahub.git
synced 2025-07-04 07:34:44 +00:00
3.7 KiB
3.7 KiB
SageMaker
For context on getting started with ingestion, check out our metadata ingestion guide.
Setup
To install this plugin, run pip install 'acryl-datahub[sagemaker]'
.
Capabilities
This plugin extracts the following:
- Feature groups
- Models, jobs, and lineage between the two (e.g. when jobs output a model or a model is used by a job)
Quickstart recipe
Check out the following recipe to get started with ingestion! See below for full configuration options.
For general pointers on writing and running a recipe, see our main recipe guide.
source:
type: sagemaker
config:
# Coordinates
aws_region: "my-aws-region"
sink:
# sink configs
Config details
Note that a .
is used to denote nested fields in the YAML recipe.
Field | Required | Default | Description |
---|---|---|---|
aws_region |
✅ | AWS region code. | |
env |
"PROD" |
Environment to use in namespace when constructing URNs. | |
aws_access_key_id |
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
aws_secret_access_key |
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
aws_session_token |
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
aws_role |
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
extract_feature_groups |
True |
Whether to extract feature groups. | |
extract_models |
True |
Whether to extract models. | |
extract_jobs.auto_ml |
True |
Whether to extract AutoML jobs. | |
extract_jobs.compilation |
True |
Whether to extract compilation jobs. | |
extract_jobs.edge_packaging |
True |
Whether to extract edge packaging jobs. | |
extract_jobs.hyper_parameter_tuning |
True |
Whether to extract hyperparameter tuning jobs. | |
extract_jobs.labeling |
True |
Whether to extract labeling jobs. | |
extract_jobs.processing |
True |
Whether to extract processing jobs. | |
extract_jobs.training |
True |
Whether to extract training jobs. | |
extract_jobs.transform |
True |
Whether to extract transform jobs. |
Compatibility
Coming soon!
Questions
If you've got any questions on configuring this source, feel free to ping us on our Slack!