18 Commits

Author SHA1 Message Date
Julian Risch
f9d2f786ca
Replace FARM import statements; add dependencies (#1492)
* Replace FARM import statements; add dependencies

* Add InferenceProc., TextCl.Proc., TextPairCl.Proc.

* Remove FARMRanker, add type annotations, rename max_sample

* Add sample_to_features_text for InferenceProc.

* Fix type annotations: model_name_or_path is str not Path

* Fix mypy errors: implement _create_dataset in TextCl.Proc.

* Add task_type "embeddings" in Inferencer

* Allow loading AdaptiveModel for embedding task

* Add SQuAD eval metrics; enable InferenceProc for embedding task

* Add baskets as param to log_samples and handle empty basket list in log_samples

* Remove unused dependencies

* Remove FARMClassifier (doc classificer) due to ref to TextClassificationHead

* Remove FARMRanker and Classifier from doc generation scripts

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2021-09-28 16:34:24 +02:00
Malte Pietsch
a0921f0c35
Remove Finder (#1326)
* deprecate finder

* remove import

* add doc section for moving from finder to pipelines
2021-08-09 13:41:40 +02:00
oryx1729
3eaf9dfbca
Suppress FAISS logs & apex warnings (#1315) 2021-07-29 14:32:50 +02:00
Ikram Ali
d835a9cdc5
[setup] version tag added to Haystack fix #1175 (#1216) 2021-06-22 09:43:26 +02:00
Tanay Soni
8a5dc8f826
Load Pipeline with YAML config file (#785) 2021-02-02 17:32:17 +01:00
Tanay Soni
e3a68aedaf
Add support for building custom Search Pipelines (#596) 2020-11-20 17:41:08 +01:00
Malte Pietsch
9727829cc6
Rename and restructure modules (database, indexing, schemas) (#379)
* rename database to documentstore

* move document, label, multilabel to haystack/schema.py

* rename documentstore -> document_store

* split indexing modules -> file_converter + preprocessor

* fix order of imports

* Update tutorial notebooks

* fix torch version in tutorial 4
2020-09-16 18:33:23 +02:00
Malte Pietsch
c9d3146fae
Fix multi-gpu training via DataParallel (#234) 2020-07-15 18:34:55 +02:00
Malte Pietsch
2164e8550f
Add gpu dockerfile, improve logging, fix minor bug with filtering (#36)
* add gpu dockerfile. improve logging. fix minor bug with filtering

* fix path
2020-03-12 18:30:42 +01:00
Malte Pietsch
e3db47ff2f
Make Elasticsearch configuration more flexible (#29) 2020-02-27 15:11:55 +01:00
Timo Moeller
c0910c82c5
Merge pull request #26 from deepset-ai/no_answer
Add no answer handling
2020-02-24 16:27:59 +01:00
timoeller
0f5b61d20a Fix typo 2020-02-24 12:28:49 +01:00
Malte Pietsch
637d1b38da
Make eval more verbose (#28) 2020-02-24 09:04:11 +01:00
Tanay Soni
f83a164095
Add Elasticsearch Document Store (#13) 2020-01-24 18:24:07 +01:00
Malte Pietsch
1718ea55b8
Add method to train a reader on custom data (#5)
* initial version of training a reader WIP

* update for latest changes in FARM inferencer. Update tutorial. Add basic docs
2020-01-23 14:49:17 +01:00
Tanay Soni
b5b62c569e Refactor database layer 2020-01-22 15:33:18 +01:00
Malte Pietsch
cab0932fab
Refactor pipeline for better generalizability & Add TransformersReader (#1)
* add flag to skip writing docs to non-empty db

* change finder pipeline structure for better generalizability

* add basic TransformersReader

* update tutorials and requirements
2020-01-13 18:56:22 +01:00
Malte Pietsch
3367b46348 switch name from farm_haystack to haystack 2019-11-27 13:56:03 +01:00