3.7 KiB

SageMaker

For context on getting started with ingestion, check out our metadata ingestion guide.

Setup

To install this plugin, run pip install 'acryl-datahub[sagemaker]'.

Capabilities

This plugin extracts the following:

  • Feature groups
  • Models, jobs, and lineage between the two (e.g. when jobs output a model or a model is used by a job)

Quickstart recipe

Check out the following recipe to get started with ingestion! See below for full configuration options.

For general pointers on writing and running a recipe, see our main recipe guide.

source:
  type: sagemaker
  config:
    # Coordinates
    aws_region: "my-aws-region"

sink:
  # sink configs

Config details

Note that a . is used to denote nested fields in the YAML recipe.

Field Required Default Description
aws_region AWS region code.
env "PROD" Environment to use in namespace when constructing URNs.
aws_access_key_id Autodetected See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html
aws_secret_access_key Autodetected See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html
aws_session_token Autodetected See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html
aws_role Autodetected See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html
extract_feature_groups True Whether to extract feature groups.
extract_models True Whether to extract models.
extract_jobs.auto_ml True Whether to extract AutoML jobs.
extract_jobs.compilation True Whether to extract compilation jobs.
extract_jobs.edge_packaging True Whether to extract edge packaging jobs.
extract_jobs.hyper_parameter_tuning True Whether to extract hyperparameter tuning jobs.
extract_jobs.labeling True Whether to extract labeling jobs.
extract_jobs.processing True Whether to extract processing jobs.
extract_jobs.training True Whether to extract training jobs.
extract_jobs.transform True Whether to extract transform jobs.

Compatibility

Coming soon!

Questions

If you've got any questions on configuring this source, feel free to ping us on our Slack!