mirror of
https://github.com/open-metadata/OpenMetadata.git
synced 2025-07-19 15:31:59 +00:00

* GitBook: [#70] Update to Roadmap * GitBook: [#72] Modified Looker and Oracle Connector * GitBook: [#71] Change Events * GitBook: [#74] API Component * GitBook: [#73] MlFlow Connector * GitBook: [#75] PUT Diagram * GitBook: [#76] Change Events * GitBook: [#77] Snowflake Single Sign-on changes Co-authored-by: OpenMetadata <github@harsha.io> Co-authored-by: Ayush Shah <ayush.shah@deuexsolutions.com> Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
1.7 KiB
1.7 KiB
description |
---|
This guide will help install the MlFlow connector and run it manually |
MlFlow
{% hint style="info" %} Prerequisites
OpenMetadata is built using Java, DropWizard, Jetty, and MySQL.
- Python 3.7 or above {% endhint %}
Install from PyPI
{% tabs %} {% tab title="Install Using PyPI" %}
pip install 'openmetadata-ingestion[mlflow]'
{% endtab %} {% endtabs %}
Run Manually
metadata ingest -c ./examples/workflows/mlflow.json
Configuration
{% code title="mlflow.json" %}
{
"source": {
"type": "mlflow",
"config": {
"tracking_uri": "http://localhost:5000",
"registry_uri": "mysql+pymysql://mlflow:password@localhost:3307/experiments"
}
...
{% endcode %}
- tracking_uri - MlFlow server containing the tracking information of runs and experiments (docs).
- registry_uri - Backend store where the Tracking Server stores experiment and run metadata (docs).
Publish to OpenMetadata
Below is the configuration to publish MlFlow data into the OpenMetadata service.
Add optionally pii
processor and metadata-rest
sink along with metadata-server
config
{% code title="mlflow.json" %}
{
"source": {
"type": "mlflow",
"config": {
"tracking_uri": "http://localhost:5000",
"registry_uri": "mysql+pymysql://mlflow:password@localhost:3307/experiments"
}
},
"sink": {
"type": "metadata-rest",
"config": {}
},
"metadata_server": {
"type": "metadata-server",
"config": {
"api_endpoint": "http://localhost:8585/api",
"auth_provider_type": "no-auth"
}
}
}
{% endcode %}