510 Commits

Author SHA1 Message Date
Tanay Soni
d90435efd6 Add wait for Elasticsearch update call 2020-07-31 12:06:27 +02:00
Malte Pietsch
29a15c0d59
Add eval for Dense Passage Retriever & Refactor handling of labels/feedback (#243) 2020-07-31 11:34:06 +02:00
Tanay Soni
52370c7bd4
Update README.rst 2020-07-30 08:59:56 +02:00
Timo Moeller
5541a53f2d
Add export answers to CSV function (#266)
Add export answers to CSV function
2020-07-29 17:10:44 +02:00
Malte Pietsch
abec1be722
Add num_processes to reader.train() to configure multiprocessing (#271) 2020-07-29 16:28:23 +02:00
Malte Pietsch
52a805be86
Update README.rst 2020-07-24 21:11:28 +02:00
Malte Pietsch
02ae0ccad1 Resize sketch concepts 2020-07-24 21:09:31 +02:00
Malte Pietsch
e1962a4e4b update sketch concepts 2020-07-24 21:05:24 +02:00
Malte Pietsch
6283348096 add concept sketch 2020-07-24 21:01:37 +02:00
antoniolanza1996
b55de6f70a
Added support for unanswerable questions in TransformersReader (#258)
* Added support for unanswerable questions in TransformersReader

Co-authored-by: Antonio Lanza <anotniolanza1996@gmail.com>
2020-07-23 10:45:58 +02:00
Timo Moeller
f0d901a374 Simplify farmreader predict 2020-07-23 10:27:43 +02:00
Malte Pietsch
ce50718103
Update README.rst 2020-07-20 14:58:12 +02:00
antoniolanza1996
cdaa6f0c66
Fix type of query_emb in DPR.retrieve() (#247) 2020-07-18 22:13:52 +02:00
Malte Pietsch
5b1be233d0 Update Tutorial 4 2020-07-17 19:31:00 +02:00
Malte Pietsch
355be293b6
Fix return type of EmbeddingRetriever to numpy array (#245) 2020-07-17 19:03:31 +02:00
Malte Pietsch
4da480aa15 Fix dockerfiles 2020-07-16 15:58:49 +02:00
Tanay Soni
5210c8c2ab
Add method to update meta fields for documents in Elasticsearch (#242) 2020-07-16 15:34:55 +02:00
Malte Pietsch
a6ec430931 Fix readme rst syntax 0.3.0 2020-07-16 13:27:44 +02:00
Malte Pietsch
d2d048c9fa Upgrade version number to 0.3.0 2020-07-16 13:21:00 +02:00
Malte Pietsch
1289cc6fbb
Fix format of /export-doc-qa-feedback to comply with SQuAD (#241) 2020-07-16 13:17:45 +02:00
Tanay Soni
292b599cdd
Remove meta field when indexing in Elasticsearch (#240) 2020-07-16 13:11:04 +02:00
Malte Pietsch
cec6a0e821
Update README.rst 2020-07-16 11:05:25 +02:00
Malte Pietsch
6bed2f509f
Refactor DPR for latest transformers version & change init arg gpu -> use_gpu for DPR and EmbeddingRetriever (#239)
* fix tokenizer warning in latest transformers

* change dpr arg from gpu to use_gpu

* change gpu arg for EmbeddingRetriever
2020-07-16 10:45:01 +02:00
Malte Pietsch
e5b6546112 Change default reader for REST API 2020-07-16 10:02:06 +02:00
Malte Pietsch
337680baf5
Update README.rst 2020-07-16 09:38:39 +02:00
Anirban Saha
7e24620159
Update readme (#229) 2020-07-15 19:14:25 +02:00
Malte Pietsch
c9d3146fae
Fix multi-gpu training via DataParallel (#234) 2020-07-15 18:34:55 +02:00
Tanay Soni
5c1a5fe61d
Add dummy retriever for benchmarking / reader-only settings (#235) 2020-07-15 17:22:17 +02:00
Malte Pietsch
eb658d308e Upgrade version to 0.2.2 2020-07-15 17:07:29 +02:00
Branden Chan
36867dabac change from top_n_recall to accuracy 2020-07-15 17:05:08 +02:00
Branden Chan
ec795314dc
Merge pull request #233 from deepset-ai/update_eval_data
Fix Evaluation Dataset
2020-07-15 16:45:29 +02:00
Branden Chan
64721d3196 One more update 2020-07-15 16:24:10 +02:00
Branden Chan
c55477e0ce update eval dataset 2020-07-15 16:14:52 +02:00
Tanay Soni
912e98cd40
Fix id for documents returned by the TfidfRetriever (#232) 2020-07-15 14:55:07 +02:00
Tanay Soni
4e10a1520d
Remove mutation of documents in write_documents() (#231) 2020-07-15 13:10:52 +02:00
Tanay Soni
e1d64c2c68
Fix print_answers to not delete keys from passed results object (#230) 2020-07-15 12:49:14 +02:00
Malte Pietsch
99a6a34047
Upgrade to new FARM / Transformers / PyTorch versions (#212) 2020-07-14 18:53:15 +02:00
Malte Pietsch
17c1b84c21
Fix API links in readme 2020-07-14 17:42:06 +02:00
Anirban Saha
6b217732f5
Add basic support for Docx Files (#225) 2020-07-14 12:28:19 +02:00
Tanay Soni
b886e054a3
Move document_name attribute to meta (#217) 2020-07-14 09:53:31 +02:00
Tanay Soni
4c21556a79
Fix embedding method for Retriever (#220) 2020-07-13 12:38:01 +02:00
Malte Pietsch
5023fde2be Update issue templates 2020-07-13 10:45:58 +02:00
Malte Pietsch
d2b26a99ff
Add more tests (#213) 2020-07-10 10:54:56 +02:00
Malte Pietsch
549f3a1285
Harmonize meta data handling across doc stores (#214) 2020-07-09 15:25:48 +02:00
Guillim
c45d54959f
Fix Dockerfile to build successfully without models directory (#210) 2020-07-08 17:12:20 +02:00
antoniolanza1996
4b9954c4a6
Update docstring for embedding_field and embedding_dim (#208)
'embedding_field' and 'embedding_dim' are useful not only for EmbeddingRetriever but also for DensePassageRetriever. Hence, could be good if you generalise it as "dense retriever".
2020-07-08 13:36:49 +02:00
Malte Pietsch
bda333dca4 Update code snippet for readme 2020-07-08 11:43:43 +02:00
Guillim
9733e0bd1b
Fix naming of offset in answers of TransformersReader (for consistency with FARMReader) (#204) 2020-07-07 19:29:12 +02:00
Guillim
b4f4388e09
gitignore fix (#203) 2020-07-07 19:24:48 +02:00
Guillim
8a616dae75
Adjust Docker and REST API to allow TransformsReader Class (#180) 2020-07-07 16:25:36 +02:00