2021-03-01 17:36:38 -08:00
|
|
|
# DataHub Metadata Ingestion Docker Image
|
2025-04-16 16:55:51 -07:00
|
|
|
|
2022-03-18 22:12:19 +01:00
|
|
|
[](https://github.com/datahub-project/datahub/actions/workflows/docker-ingestion.yml)
|
2021-03-01 17:36:38 -08:00
|
|
|
|
|
|
|
Refer to the [metadata ingestion framework](../../metadata-ingestion) to understand the architecture and responsibilities of this service.
|
2023-11-13 16:26:53 -06:00
|
|
|
|
|
|
|
## Slim vs Full Image Build
|
|
|
|
|
|
|
|
There are two versions of this image. One includes pyspark and Oracle dependencies and is larger due to the java dependencies.
|
|
|
|
|
|
|
|
Running the standard build results in the `slim` image without pyspark being generated by default. In order to build the full
|
|
|
|
image with pyspark use the following project property `-PdockerTarget=full`.
|