mirror of
				https://github.com/open-metadata/OpenMetadata.git
				synced 2025-10-26 16:22:09 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			271 lines
		
	
	
		
			8.0 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			271 lines
		
	
	
		
			8.0 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| ---
 | |
| title: Kubernetes On Premises Deployment
 | |
| slug: /deployment/kubernetes/on-prem
 | |
| collate: false
 | |
| ---
 | |
| 
 | |
| # On Premises Kubernetes Deployment
 | |
| 
 | |
| OpenMetadata supports the Installation and Running of application on OnPremises Kubernetes through Helm Charts.
 | |
| However, there are some additional configurations which needs to be done as prerequisites for the same.
 | |
| 
 | |
| {%note noteType="Warning"%}
 | |
| 
 | |
| This guide presumes you have an on premises Kubernetes cluster setup, and you are installing OpenMetadata in `default` namespace.
 | |
| 
 | |
| {%/note%}
 | |
| 
 | |
| ## Prerequisites
 | |
| 
 | |
| ### External Database and Search Engine as ElasticSearch / OpenSearch
 | |
| 
 | |
| We support 
 | |
| 
 | |
| - MySQL engine version 8 or higher
 | |
| - PostgreSQL engine version 12 or higher
 | |
| - ElasticSearch version 8.X (upto 8.11.4) or OpenSearch Version 2.X (upto 2.7)
 | |
| 
 | |
| Once you have the External Database and Search Engine configured, you can update the environment variables below for OpenMetadata kubernetes deployments to connect with Database and ElasticSearch.
 | |
| 
 | |
| ```yaml
 | |
| # openmetadata-values.prod.yaml
 | |
| ...
 | |
| openmetadata:
 | |
|   config:
 | |
|     elasticsearch:
 | |
|       host: <SEARCH_ENGINE_ENDPOINT_WITHOUT_HTTPS>
 | |
|       searchType: elasticsearch # or `opensearch` if Search Engine is OpenSearch
 | |
|       port: 443
 | |
|       scheme: https
 | |
|       connectionTimeoutSecs: 5
 | |
|       socketTimeoutSecs: 60
 | |
|       keepAliveTimeoutSecs: 600
 | |
|       batchSize: 10
 | |
|       auth:
 | |
|         enabled: true
 | |
|         username: <SEARCH_ENGINE_CLOUD_USERNAME>
 | |
|         password:
 | |
|           secretRef: elasticsearch-secrets
 | |
|           secretKey: openmetadata-elasticsearch-password
 | |
|     database:
 | |
|       host: <DATABSE_SQL_ENDPOINT>
 | |
|       port: 3306
 | |
|       driverClass: com.mysql.cj.jdbc.Driver
 | |
|       dbScheme: mysql
 | |
|       dbUseSSL: true
 | |
|       databaseName: <DATABASE_SQL_DATABASE_NAME>
 | |
|       auth:
 | |
|         username: <DATABASE_SQL_DATABASE_USERNAME>
 | |
|         password:
 | |
|           secretRef: mysql-secrets
 | |
|           secretKey: openmetadata-mysql-password
 | |
|   ...
 | |
| ```
 | |
| 
 | |
| Make sure to create database and search engine credentials as Kubernetes Secrets mentioned [here](/quick-start/local-kubernetes-deployment#2.-create-kubernetes-secrets-required-for-helm-charts).
 | |
| 
 | |
| Also, disable MySQL and ElasticSearch from OpenMetadata Dependencies Helm Charts as mentioned in the FAQs [here](#how-to-disable-mysql-and-elasticsearch-from-openmetadata-dependencies-helm-charts).
 | |
| 
 | |
| ### Persistent Volumes with ReadWriteMany Access Modes
 | |
| 
 | |
| OpenMetadata helm chart depends on Airflow and Airflow expects a persistent disk that support ReadWriteMany (the volume can be mounted as read-write by many nodes).
 | |
| 
 | |
| The workaround is to create nfs-share and use that as the persistent claim to deploy OpenMetadata by implementing the following steps in order.
 | |
| 
 | |
| {%note%}
 | |
| 
 | |
| This guide assumes you have NFS Server already setup with Hostname or IP Address which is reachable from your on premises Kubernetes cluster, and you have configured a path to be used for OpenMetadata Airflow Helm Dependency.
 | |
| 
 | |
| {%/note%}
 | |
| 
 | |
| ### Dynamic Provisioning using StorageClass
 | |
| 
 | |
| To provision PersistentVolume dynamically using the StorageClass, you need to install the NFS provisioner.
 | |
| It is recommended to use [nfs-subdir-external-provisioner](https://github.com/kubernetes-sigs/nfs-subdir-external-provisioner) helm charts for this case.
 | |
| 
 | |
| ```commandline
 | |
| helm repo add nfs-subdir-external-provisioner https://kubernetes-sigs.github.io/nfs-subdir-external-provisioner
 | |
| 
 | |
| helm install nfs-subdir-external-provisioner nfs-subdir-external-provisioner/nfs-subdir-external-provisioner \
 | |
|   --create-namespace \
 | |
|   --namespace nfs-provisioner \
 | |
|   --set nfs.server=<NFS_HOSTNAME_OR_IP> \
 | |
|   --set nfs.path=/airflow
 | |
| ```
 | |
| 
 | |
| 
 | |
| Replace the `NFS_HOSTNAME_OR_IP` with your NFS Server value and run the commands.
 | |
| 
 | |
| 
 | |
| 
 | |
| This will create a new StorageClass with `nfs-subdir-external-provisioner`. You can view the same using the kubectl command `kubectl get storageclass -n nfs-provisioner`.
 | |
| 
 | |
| 
 | |
| ## Provision NFS backed PVC for Airflow DAGs and Airflow Logs
 | |
| 
 | |
| ### Code Samples for PVC for Airflow DAGs
 | |
| 
 | |
| ```yaml
 | |
| # dags_pvc.yml
 | |
| apiVersion: v1
 | |
| kind: PersistentVolumeClaim
 | |
| metadata:
 | |
|   namespace: default
 | |
|   name: openmetadata-dependencies-dags
 | |
|   labels:
 | |
|     storage.k8s.io/name: nfs
 | |
|     app: airflow
 | |
| spec:
 | |
|   accessModes:
 | |
|     - ReadWriteMany
 | |
|   storageClassName: nfs-client
 | |
|   resources:
 | |
|     requests:
 | |
|       storage: 1Gi
 | |
| ```
 | |
| 
 | |
| Create Persistent Volumes and Persistent Volume claims with the below command.
 | |
| 
 | |
| ```commandline
 | |
| kubectl create -f dags_pvc.yml
 | |
| ```
 | |
| 
 | |
| ### Code Samples for PVC for Airflow Logs
 | |
| 
 | |
| ```yaml
 | |
| # logs_pvc.yml
 | |
| apiVersion: v1
 | |
| kind: PersistentVolumeClaim
 | |
| metadata:
 | |
|   namespace: default
 | |
|   name: openmetadata-dependencies-logs
 | |
|   labels:
 | |
|     storage.k8s.io/name: nfs
 | |
|     app: airflow
 | |
| spec:
 | |
|   accessModes:
 | |
|     - ReadWriteMany
 | |
|   storageClassName: nfs-client
 | |
|   resources:
 | |
|     requests:
 | |
|       storage: 10Gi
 | |
| ```
 | |
| 
 | |
| Create Persistent Volumes and Persistent Volume claims with the below command.
 | |
| 
 | |
| ```commandline
 | |
| kubectl create -f logs_pvc.yml
 | |
| ```
 | |
| 
 | |
| ## Change owner and permission manually on disks
 | |
| 
 | |
| Since airflow pods run as non-root users, they would not have write access on the nfs server volumes. In order to fix the permission here, spin up a pod with persistent volumes attached and run it once.
 | |
| 
 | |
| ```yaml
 | |
| # permissions_pod.yml
 | |
| apiVersion: v1
 | |
| kind: Pod
 | |
| metadata:
 | |
|   creationTimestamp: null
 | |
|   labels:
 | |
|     run: my-permission-pod
 | |
|   name: my-permission-pod
 | |
| spec:
 | |
|   containers:
 | |
|   - image: busybox
 | |
|     name: my-permission-pod
 | |
|     volumeMounts:
 | |
|     - name: airflow-dags
 | |
|       mountPath: /airflow-dags
 | |
|     - name: airflow-logs
 | |
|       mountPath: /airflow-logs
 | |
|     command:
 | |
|     - "chown -R 50000 /airflow-dags /airflow-logs"
 | |
|     # if needed
 | |
|     - "chmod -R a+rwx /airflow-dags"
 | |
|   volumes:
 | |
|   - name: airflow-logs
 | |
|     persistentVolumeClaim:
 | |
|       claimName: openmetadata-dependencies-logs
 | |
|   - name: airflow-dags
 | |
|     persistentVolumeClaim:
 | |
|       claimName: openmetadata-dependencies-dags
 | |
|   dnsPolicy: ClusterFirst
 | |
|   restartPolicy: Always
 | |
| ```
 | |
| 
 | |
| {%note%}
 | |
| 
 | |
| Airflow runs the pods with linux username as airflow and linux user id as 50000.
 | |
| 
 | |
| {%/note%}
 | |
| 
 | |
| Run the below command to create the pod and fix the permissions
 | |
| 
 | |
| ```commandline
 | |
| kubectl create -f permissions_pod.yml
 | |
| ```
 | |
| 
 | |
| ## Create OpenMetadata dependencies Values
 | |
| 
 | |
| Override openmetadata dependencies airflow helm values to bind the nfs persistent volumes for DAGs and logs.
 | |
| 
 | |
| ```yaml
 | |
| # values-dependencies.yml
 | |
| airflow:
 | |
|   airflow:
 | |
|     extraVolumeMounts:
 | |
|       - mountPath: /airflow-logs
 | |
|         name: nfs-airflow-logs
 | |
|       - mountPath: /airflow-dags/dags
 | |
|         name: nfs-airflow-dags
 | |
|     extraVolumes:
 | |
|       - name: nfs-airflow-logs
 | |
|         persistentVolumeClaim:
 | |
|           claimName: openmetadata-dependencies-logs
 | |
|       - name: nfs-airflow-dags
 | |
|         persistentVolumeClaim:
 | |
|           claimName: openmetadata-dependencies-dags
 | |
|     config:
 | |
|       AIRFLOW__OPENMETADATA_AIRFLOW_APIS__DAG_GENERATED_CONFIGS: "/airflow-dags/dags"
 | |
|   dags:
 | |
|     path: /airflow-dags/dags
 | |
|     persistence:
 | |
|       enabled: false
 | |
|   logs:
 | |
|     path: /airflow-logs
 | |
|     persistence:
 | |
|       enabled: false
 | |
| ```
 | |
| 
 | |
| For more information on airflow helm chart values, please refer to [airflow-helm](https://artifacthub.io/packages/helm/airflow-helm/airflow/8.8.0).
 | |
| 
 | |
| When deploying openmetadata dependencies helm chart, use the below command -
 | |
| 
 | |
| ```commandline
 | |
| helm install openmetadata-dependencies open-metadata/openmetadata-dependencies --values values-dependencies.yaml
 | |
| ```
 | |
| 
 | |
| {%note%}
 | |
| 
 | |
| The above command uses configurations defined [here](https://raw.githubusercontent.com/open-metadata/openmetadata-helm-charts/main/charts/deps/values.yaml). 
 | |
| You can modify any configuration and deploy by passing your own `values.yaml`
 | |
| 
 | |
| ```commandline
 | |
| helm install openmetadata-dependencies open-metadata/openmetadata-dependencies --values <path-to-values-file>
 | |
| ```
 | |
| 
 | |
| {%/note%}
 | |
| 
 | |
| Once the openmetadata dependencies helm chart deployed, you can then run the below command to install the openmetadata helm chart - 
 | |
| 
 | |
| ```commandline
 | |
| helm install openmetadata open-metadata/openmetadata
 | |
| ```
 | |
| 
 | |
| Again, this uses the values defined [here](https://github.com/open-metadata/openmetadata-helm-charts/blob/main/charts/openmetadata/values.yaml).
 | |
| Use the `--values` flag to point to your own YAML configuration if needed.
 | |
| 
 | |
| # FAQs
 | |
| 
 | |
| {% partial file="/v1.5/deployment/faqs.md" /%} | 
