40 Commits

Author SHA1 Message Date
Massimiliano Pippi
722919c452
use the latest Python image (#7540) 2024-04-11 19:05:03 +02:00
Bilge Yücel
b8fc86eb6a
Update Docker README.md (#7369)
* Update Docker README.md

* mention 1.x/2.0

---------

Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
2024-04-05 16:20:56 +02:00
Tobias Wochinger
79b32cd755
build: fix CVE-2022-40897 (#7080) 2024-02-23 16:58:39 +01:00
Massimiliano Pippi
1887dac667
ci: push Docker image for 2.x (#6479)
* cleanup

* use a smaller runner

* main branch
2023-12-04 12:52:58 +01:00
Stefano Fiorucci
4782bc3e93
unpin trio (#6239) 2023-11-06 13:26:26 +01:00
Massimiliano Pippi
03015877f3
chore: pin trio to <0.23 (#6227)
* chore: pin trio to <0.23

* Update Dockerfile.base

---------

Co-authored-by: Stefano Fiorucci <44616784+anakin87@users.noreply.github.com>
2023-11-03 14:46:18 +01:00
Christian Clauss
6dd52d91b2
ci: Fix typos discovered by codespell (#5778)
* Fix typos discovered by codespell

* pylint: max-args = 38
2023-09-13 16:14:45 +02:00
Vladimir Blagojevic
094d8578bd
feat: Update Docker readme (#5536)
* Update Docker readme

* Update wording

---------

Co-authored-by: agnieszka-m <amarzec13@gmail.com>
2023-08-11 14:06:12 +02:00
Vladimir Blagojevic
395854d823
Add cpu-remote-inference Docker image (#5225)
* Add cpu-remote-inference Docker image

* Add web lfqa pipeline as an example for cpu-remote-inference Docker image

* WebRetriever must have document_store attribute

* Add cpu-remote-inference-latest

* Add image testing in CI

---------

Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>
2023-07-07 10:23:14 +02:00
Vladimir Blagojevic
ad6072728d
Add dependencies to build lxml successfully (#5288) 2023-07-06 12:53:28 +02:00
Stefano Fiorucci
cbc9dcfdad
add inference dependency to docker images (#5215) 2023-06-27 11:47:40 +02:00
ZanSara
d8ac30fa47
refactor!: extract preprocessing and file conversion deps (#4605)
* isolate file-conversion deps

* pylint

* add to all extra

* chain was missing

* move langdetect into preprocessing and fix tika

* add file-conversion extra
2023-04-14 11:34:16 +02:00
ZanSara
ba11d1c2a8
refactor!: extract evaluation and statistical dependencies (#4457)
* try-catch sklearn and scipy

* haystack imports

* linting

* mypy

* try to import baseretriever

* remove typing

* unused import

* remove more typing

* pylint

* isolate sql imports for postgres, which we don't use anyway

* remove stats

* replace expit

* als inmemory

* mypy

* feedback

* docker

* expit

* re-add njit
2023-04-12 15:38:56 +02:00
Massimiliano Pippi
52fb935936
build xpdf on bionic (#4606) 2023-04-05 15:52:44 +02:00
Massimiliano Pippi
322652c306
fix: provide a fallback for PyMuPDF (#4564)
* add a fallback xpdf alternative to PyMuPDF

* add xpdpf to the base images

* to be reverted

* silence mypy on conditional error

* do not install pdf extras in base images

* bring back the xpdf build strategy

* remove leftovers from old build

* fix indentation

* Apply suggestions from code review

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>

* revert test workflow

---------

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
2023-03-31 14:37:05 +02:00
bogdankostic
e3503a92c9
build: Use uvicorn instead of gunicorn as server in REST API's Dockerfile (#4304)
* Use uvicorn instead of gunicorn as server

* Added comments and changed service names

* comments improvised

---------

Co-authored-by: Mayank Jobanputra <mayankjobanputra@gmail.com>
2023-03-09 01:46:07 +05:30
bogdankostic
f33829fabf
Remove xpdf dependencies (#4314) 2023-03-02 11:12:03 +01:00
Silvano Cerza
87a02d9372
Fix Dockerfile.base failing cause of missing dependencies (#4215) 2023-02-21 16:37:33 +01:00
Mayank Jobanputra
c4b98fcccc
allowing file-upload api to work with write permission (#4221) 2023-02-21 16:48:02 +05:30
Silvano Cerza
f5b8835e2c
ci: Fix Dockerfile.base failing cause of missing git (#4210) 2023-02-20 18:40:30 +01:00
Silvano Cerza
e6af353530
ci: Add ca-certificates installation to xpdf container (#4206) 2023-02-20 17:47:10 +01:00
Silvano Cerza
30cdb81f19
ci: Move xpdf build into separate container (#4199)
* Create Dockerfile and hcl config to build Xpdf

* Create workflow to build Xpdf Docker image

* Update Dockerfile.base to not build Xpdf

* Fix CWD removal and arg casing

* Fix ARG setting
2023-02-20 14:58:11 +01:00
Mayank Jobanputra
d27f372b67
build: cache nltk models into the docker image (#4118)
* separated nltk cache

* separated nltk caching

* fixed pylint lazy log error

* using model name as default value
2023-02-16 16:56:16 +05:30
Mayank Jobanputra
93962c09fc
fix: fix torchaudio version (#4102)
* fix torchaudio version

* added comment for keeping torchaudio last

* removed torchaudio from base
2023-02-09 15:14:10 +05:30
Mayank Jobanputra
fa17f0973e
chore: increased timeout for loading pipelines through API (#3977)
* increased timeout

* Added comment for users to increase timeout while using docker compose file

* changed the comment with appropriate msg

* changed the comment indent

* changed the indent again
2023-01-30 11:30:47 +01:00
Silvano Cerza
a05836589b
ci: Add Docker images testing (#3943)
* Fix typo in Dockerfile.base ARG

* Add workflow to test Docker images

* Fix base image name

* Simplified Docker images testing

* Fix wrong command to retrieve current version

Co-authored-by: Mayank Jobanputra <mayankjobanputra@gmail.com>
2023-01-27 09:48:05 +01:00
ZanSara
3ffdb0a9a3
chore: fix all EOF (#3852)
* fix all eof

* fix test

* fix test

* fix test

* typo

* fix sample

* fix sample

* add logs

* fix page_dynamic_result.txt
2023-01-16 12:34:50 +01:00
Fabian
e53cc2bc3f
fix(docker): Use IMAGE_NAME in api image (#3786)
If you set the IMAGE_NAME variable, then the base image will use that name,
but the api image would previously use a hardcoded `deepset/haystack` image name.
2023-01-03 12:26:26 +01:00
Massimiliano Pippi
19c7725319
feat: utility function to explicitly invoke JSON schema generation (#3798)
* explicitly cache the JSON schema

* fix import path

* move to final
2023-01-02 17:06:24 +01:00
Vladimir Blagojevic
19e9b06b4e
feat: Bump python to 3.10 for gpu docker image, use nvidia/cuda (#3701)
* Update pytorch base image

* Small corrections

* Revert back to load_schema() call

* reverted to import haystack for schema generation

Co-authored-by: Mayank Jobanputra <mayankjobanputra@gmail.com>
2022-12-30 16:04:27 +05:30
Vladimir Blagojevic
890e2bf0f5
feat: Run commands inside docker container as a non root user (#3702) 2022-12-27 21:36:42 +01:00
Massimiliano Pippi
450c3d4484
fix: build pdftotext from sources (#3746)
* build pdftotext from sources

* trigger the build on my own PR - to be reverted

* trigger the build on my own PR - to be reverted

* Update docker_release.yml
2022-12-22 18:37:36 +01:00
Sebastian
4afdbc33b2
fix: Removed overlooked torch scatter references (#3719)
* Removed torch scatter references

* Add back /
2022-12-16 10:36:19 +01:00
Sebastian
54bf7ad343
Remove && \ from end of line (#3710) 2022-12-13 21:29:18 +05:30
Sebastian
d0f786af9f
feat: Bump transformers version to remove torch scatter dependency (#3703)
* Bump transformers version so we can remove torch scatter dependency

* manual re-merge

Co-authored-by: Mayank Jobanputra <mayankjobanputra@gmail.com>
2022-12-13 18:33:07 +05:30
Sara Zan
f24cbdbb5d
remove beir from the base GPU image (#3692) 2022-12-13 11:11:58 +01:00
Branden Chan
e6b7109164
docs: Update docker readme (#3531)
* Update docker readme

* Make language changes
2022-11-08 09:06:18 +01:00
Massimiliano Pippi
8ddb6d7821
feat: add multi-platform Docker images (#3354)
* add arm platform to the build

* add a note about multi-platforms build

* test on current branch

* setup qemu on Github actions

* better naming

* Revert "test on current branch"

This reverts commit b0e5ea77b46e3e0bafd579c95e434c6a3c8ef84f.
2022-10-11 12:29:33 +02:00
Massimiliano Pippi
859c303c16
include fontconfig in the final image and fix tagging (#3230) 2022-09-16 15:33:24 +02:00
Massimiliano Pippi
64b0c43885
refactoring: reimplement Docker strategy (#3162)
* setup base images

* add cpu flavor

* use the same Dockerfile for cpu and gpu

* better naming, add docs

* add docker workflow

* add missing image input

* change cwd for bake

* also push api images

* try conditional tagging for releases

* revert testing code

* update docker readme

* document variable override

* use Python 3.10

* allow empty HAYSTACK_EXTRAS

* Apply suggestions from code review

Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai>

* remove repo description step, can't make it work so far

* add docs to the last step as it's tricky

* manage tags for the newest images

* tests are passing, checking in the last bit

Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai>
2022-09-12 16:33:56 +02:00