mirror of
				https://github.com/datahub-project/datahub.git
				synced 2025-11-03 20:27:50 +00:00 
			
		
		
		
	
		
			
				
	
	
	
		
			3.7 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			3.7 KiB
		
	
	
	
	
	
	
	
SageMaker
For context on getting started with ingestion, check out our metadata ingestion guide.
Setup
To install this plugin, run pip install 'acryl-datahub[sagemaker]'.
Capabilities
This plugin extracts the following:
- Feature groups
 - Models, jobs, and lineage between the two (e.g. when jobs output a model or a model is used by a job)
 
Quickstart recipe
Check out the following recipe to get started with ingestion! See below for full configuration options.
For general pointers on writing and running a recipe, see our main recipe guide.
source:
  type: sagemaker
  config:
    # Coordinates
    aws_region: "my-aws-region"
sink:
  # sink configs
Config details
Note that a . is used to denote nested fields in the YAML recipe.
| Field | Required | Default | Description | 
|---|---|---|---|
aws_region | 
✅ | AWS region code. | |
env | 
"PROD" | 
Environment to use in namespace when constructing URNs. | |
aws_access_key_id | 
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
aws_secret_access_key | 
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
aws_session_token | 
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
aws_role | 
Autodetected | See https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html | |
extract_feature_groups | 
True | 
Whether to extract feature groups. | |
extract_models | 
True | 
Whether to extract models. | |
extract_jobs.auto_ml | 
True | 
Whether to extract AutoML jobs. | |
extract_jobs.compilation | 
True | 
Whether to extract compilation jobs. | |
extract_jobs.edge_packaging | 
True | 
Whether to extract edge packaging jobs. | |
extract_jobs.hyper_parameter_tuning | 
True | 
Whether to extract hyperparameter tuning jobs. | |
extract_jobs.labeling | 
True | 
Whether to extract labeling jobs. | |
extract_jobs.processing | 
True | 
Whether to extract processing jobs. | |
extract_jobs.training | 
True | 
Whether to extract training jobs. | |
extract_jobs.transform | 
True | 
Whether to extract transform jobs. | 
Compatibility
Coming soon!
Questions
If you've got any questions on configuring this source, feel free to ping us on our Slack!