haystack/docker/README.md

<p align="center">
  <a href="https://haystack.deepset.ai/"><img src="https://raw.githubusercontent.com/deepset-ai/.github/main/haystack-logo-colored.png" alt="Haystack by deepset"></a>
</p>

[Haystack](https://github.com/deepset-ai/haystack) is an end-to-end LLM framework that allows you to build applications powered by LLMs, Transformer models, vector search and more. Whether you want to perform retrieval-augmented generation (RAG), document search, question answering or answer generation, Haystack can orchestrate state-of-the-art embedding models and LLMs into pipelines to build end-to-end NLP applications and solve your use case.

## Haystack 2.0

For the latest version of Haystack there's only one image available:

- `haystack:base-<version>` contains a working Python environment with Haystack preinstalled. This image is expected to
  be derived `FROM`.

## Haystack 1.x image variants

The Docker image for Haystack 1.x comes in six variants:
- `haystack:gpu-<version>` contains Haystack dependencies as well as what's needed to run the REST API and UI. It comes with the CUDA runtime and is capable of running on GPUs.
- `haystack:cpu-remote-inference-<version>` is a slimmed down version of the CPU image with the REST API and UI. It is specifically designed for PromptNode inferencing using remotely hosted models, such as Hugging Face Inference, OpenAI, Cohere, Anthropic, and similar.
- `haystack:cpu-<version>` contains Haystack dependencies as well as what's needed to run the REST API and UI. It has no support for GPU so must be run on CPU.
- `haystack:base-gpu-<version>` only contains the Haystack dependencies. It comes with the CUDA runtime and can run on GPUs.
- `haystack:base-cpu-remote-inference-<version>` is a slimmed down version of the CPU image, specifically designed for PromptNode inferencing using remotely hosted models, such as Hugging Face Inference, OpenAI, Cohere, Anthropic, and similar.
- `haystack:base-cpu-<version>` only contains the Haystack dependencies. It has no support for GPU so must be run on CPU.

## Image Development

Images are built with BuildKit and we use `bake` to orchestrate the process.
You can build a specific image by running:
```sh
docker buildx bake gpu
```

You can override any `variable` defined in the `docker-bake.hcl` file and build custom
images, for example if you want to use a branch from the Haystack repo, run:
```sh
HAYSTACK_VERSION=mybranch_or_tag BASE_IMAGE_TAG_SUFFIX=latest docker buildx bake gpu --no-cache
```

### Multi-Platform Builds

Haystack images support multiple architectures. But depending on your operating system and Docker
environment, you might not be able to build all of them locally.

You may encounter the following error when trying to build the image:

```
multiple platforms feature is currently not supported for docker driver. Please switch to a different driver
(eg. “docker buildx create --use”)
```

To get around this, you need to override the `platform` option and limit local builds to the same architecture as
your computer's. For example, on an Apple M1 you can limit the builds to ARM only by invoking `bake` like this:

```sh
docker buildx bake base-cpu --set "*.platform=linux/arm64"
```

# License

View [license information](https://github.com/deepset-ai/haystack/blob/main/LICENSE) for
the software contained in this image.

As with all Docker images, these likely also contain other software which may be under
other licenses (such as Bash, etc from the base distribution, along with any direct or
indirect dependencies of the primary software being contained).

As for any pre-built image usage, it is the image user's responsibility to ensure that any
use of this image complies with any relevant licenses for all software contained within.
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			`<p align="center">`
Update Docker README.md (#7369) * Update Docker README.md * mention 1.x/2.0 --------- Co-authored-by: Massimiliano Pippi <mpippi@gmail.com> 2024-04-05 17:20:56 +03:00			`<a href="https://haystack.deepset.ai/"><img src="https://raw.githubusercontent.com/deepset-ai/.github/main/haystack-logo-colored.png" alt="Haystack by deepset"></a>`
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			`</p>`
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00
Update Docker README.md (#7369) * Update Docker README.md * mention 1.x/2.0 --------- Co-authored-by: Massimiliano Pippi <mpippi@gmail.com> 2024-04-05 17:20:56 +03:00			`[Haystack](https://github.com/deepset-ai/haystack) is an end-to-end LLM framework that allows you to build applications powered by LLMs, Transformer models, vector search and more. Whether you want to perform retrieval-augmented generation (RAG), document search, question answering or answer generation, Haystack can orchestrate state-of-the-art embedding models and LLMs into pipelines to build end-to-end NLP applications and solve your use case.`
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00
Update Docker README.md (#7369) * Update Docker README.md * mention 1.x/2.0 --------- Co-authored-by: Massimiliano Pippi <mpippi@gmail.com> 2024-04-05 17:20:56 +03:00			`## Haystack 2.0`
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00
Update Docker README.md (#7369) * Update Docker README.md * mention 1.x/2.0 --------- Co-authored-by: Massimiliano Pippi <mpippi@gmail.com> 2024-04-05 17:20:56 +03:00			`For the latest version of Haystack there's only one image available:`
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00
Update Docker README.md (#7369) * Update Docker README.md * mention 1.x/2.0 --------- Co-authored-by: Massimiliano Pippi <mpippi@gmail.com> 2024-04-05 17:20:56 +03:00			- `haystack:base-<version>` contains a working Python environment with Haystack preinstalled. This image is expected to
			be derived `FROM`.
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00
Update Docker README.md (#7369) * Update Docker README.md * mention 1.x/2.0 --------- Co-authored-by: Massimiliano Pippi <mpippi@gmail.com> 2024-04-05 17:20:56 +03:00			`## Haystack 1.x image variants`

			`The Docker image for Haystack 1.x comes in six variants:`
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			- `haystack:gpu-<version>` contains Haystack dependencies as well as what's needed to run the REST API and UI. It comes with the CUDA runtime and is capable of running on GPUs.
feat: Update Docker readme (#5536) * Update Docker readme * Update wording --------- Co-authored-by: agnieszka-m <amarzec13@gmail.com> 2023-08-11 14:06:12 +02:00			- `haystack:cpu-remote-inference-<version>` is a slimmed down version of the CPU image with the REST API and UI. It is specifically designed for PromptNode inferencing using remotely hosted models, such as Hugging Face Inference, OpenAI, Cohere, Anthropic, and similar.
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			- `haystack:cpu-<version>` contains Haystack dependencies as well as what's needed to run the REST API and UI. It has no support for GPU so must be run on CPU.
feat: Update Docker readme (#5536) * Update Docker readme * Update wording --------- Co-authored-by: agnieszka-m <amarzec13@gmail.com> 2023-08-11 14:06:12 +02:00			- `haystack:base-gpu-<version>` only contains the Haystack dependencies. It comes with the CUDA runtime and can run on GPUs.
			- `haystack:base-cpu-remote-inference-<version>` is a slimmed down version of the CPU image, specifically designed for PromptNode inferencing using remotely hosted models, such as Hugging Face Inference, OpenAI, Cohere, Anthropic, and similar.
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			- `haystack:base-cpu-<version>` only contains the Haystack dependencies. It has no support for GPU so must be run on CPU.
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			`## Image Development`
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00
			Images are built with BuildKit and we use `bake` to orchestrate the process.
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			`You can build a specific image by running:`
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00			```sh
			`docker buildx bake gpu`
			```

			You can override any `variable` defined in the `docker-bake.hcl` file and build custom
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			`images, for example if you want to use a branch from the Haystack repo, run:`
refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00			```sh
			`HAYSTACK_VERSION=mybranch_or_tag BASE_IMAGE_TAG_SUFFIX=latest docker buildx bake gpu --no-cache`
			```

docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			`### Multi-Platform Builds`

			`Haystack images support multiple architectures. But depending on your operating system and Docker`
chore: fix all EOF (#3852) * fix all eof * fix test * fix test * fix test * typo * fix sample * fix sample * add logs * fix page_dynamic_result.txt 2023-01-16 12:34:50 +01:00			`environment, you might not be able to build all of them locally.`
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00
			`You may encounter the following error when trying to build the image:`
feat: add multi-platform Docker images (#3354) * add arm platform to the build * add a note about multi-platforms build * test on current branch * setup qemu on Github actions * better naming * Revert "test on current branch" This reverts commit b0e5ea77b46e3e0bafd579c95e434c6a3c8ef84f. 2022-10-11 12:29:33 +02:00
			```
			`multiple platforms feature is currently not supported for docker driver. Please switch to a different driver`
			`(eg. “docker buildx create --use”)`
			```

docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00			To get around this, you need to override the `platform` option and limit local builds to the same architecture as
feat: add multi-platform Docker images (#3354) * add arm platform to the build * add a note about multi-platforms build * test on current branch * setup qemu on Github actions * better naming * Revert "test on current branch" This reverts commit b0e5ea77b46e3e0bafd579c95e434c6a3c8ef84f. 2022-10-11 12:29:33 +02:00			your computer's. For example, on an Apple M1 you can limit the builds to ARM only by invoking `bake` like this:
docs: Update docker readme (#3531) * Update docker readme * Make language changes 2022-11-08 09:06:18 +01:00
feat: add multi-platform Docker images (#3354) * add arm platform to the build * add a note about multi-platforms build * test on current branch * setup qemu on Github actions * better naming * Revert "test on current branch" This reverts commit b0e5ea77b46e3e0bafd579c95e434c6a3c8ef84f. 2022-10-11 12:29:33 +02:00			```sh
			`docker buildx bake base-cpu --set "*.platform=linux/arm64"`
			```

refactoring: reimplement Docker strategy (#3162) * setup base images * add cpu flavor * use the same Dockerfile for cpu and gpu * better naming, add docs * add docker workflow * add missing image input * change cwd for bake * also push api images * try conditional tagging for releases * revert testing code * update docker readme * document variable override * use Python 3.10 * allow empty HAYSTACK_EXTRAS * Apply suggestions from code review Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * remove repo description step, can't make it work so far * add docs to the last step as it's tricky * manage tags for the newest images * tests are passing, checking in the last bit Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> 2022-09-12 16:33:56 +02:00			`# License`

			`View [license information](https://github.com/deepset-ai/haystack/blob/main/LICENSE) for`
			`the software contained in this image.`

			`As with all Docker images, these likely also contain other software which may be under`
			`other licenses (such as Bash, etc from the base distribution, along with any direct or`
			`indirect dependencies of the primary software being contained).`

			`As for any pre-built image usage, it is the image user's responsibility to ensure that any`
chore: fix all EOF (#3852) * fix all eof * fix test * fix test * fix test * typo * fix sample * fix sample * add logs * fix page_dynamic_result.txt 2023-01-16 12:34:50 +01:00			`use of this image complies with any relevant licenses for all software contained within.`