3803 Commits

Author SHA1 Message Date
Malte Pietsch
e1962a4e4b update sketch concepts 2020-07-24 21:05:24 +02:00
Malte Pietsch
6283348096 add concept sketch 2020-07-24 21:01:37 +02:00
antoniolanza1996
b55de6f70a
Added support for unanswerable questions in TransformersReader (#258)
* Added support for unanswerable questions in TransformersReader

Co-authored-by: Antonio Lanza <anotniolanza1996@gmail.com>
2020-07-23 10:45:58 +02:00
Timo Moeller
f0d901a374 Simplify farmreader predict 2020-07-23 10:27:43 +02:00
Malte Pietsch
ce50718103
Update README.rst 2020-07-20 14:58:12 +02:00
antoniolanza1996
cdaa6f0c66
Fix type of query_emb in DPR.retrieve() (#247) 2020-07-18 22:13:52 +02:00
Malte Pietsch
5b1be233d0 Update Tutorial 4 2020-07-17 19:31:00 +02:00
Malte Pietsch
355be293b6
Fix return type of EmbeddingRetriever to numpy array (#245) 2020-07-17 19:03:31 +02:00
Malte Pietsch
4da480aa15 Fix dockerfiles 2020-07-16 15:58:49 +02:00
Tanay Soni
5210c8c2ab
Add method to update meta fields for documents in Elasticsearch (#242) 2020-07-16 15:34:55 +02:00
Malte Pietsch
a6ec430931 Fix readme rst syntax 0.3.0 2020-07-16 13:27:44 +02:00
Malte Pietsch
d2d048c9fa Upgrade version number to 0.3.0 2020-07-16 13:21:00 +02:00
Malte Pietsch
1289cc6fbb
Fix format of /export-doc-qa-feedback to comply with SQuAD (#241) 2020-07-16 13:17:45 +02:00
Tanay Soni
292b599cdd
Remove meta field when indexing in Elasticsearch (#240) 2020-07-16 13:11:04 +02:00
Malte Pietsch
cec6a0e821
Update README.rst 2020-07-16 11:05:25 +02:00
Malte Pietsch
6bed2f509f
Refactor DPR for latest transformers version & change init arg gpu -> use_gpu for DPR and EmbeddingRetriever (#239)
* fix tokenizer warning in latest transformers

* change dpr arg from gpu to use_gpu

* change gpu arg for EmbeddingRetriever
2020-07-16 10:45:01 +02:00
Malte Pietsch
e5b6546112 Change default reader for REST API 2020-07-16 10:02:06 +02:00
Malte Pietsch
337680baf5
Update README.rst 2020-07-16 09:38:39 +02:00
Anirban Saha
7e24620159
Update readme (#229) 2020-07-15 19:14:25 +02:00
Malte Pietsch
c9d3146fae
Fix multi-gpu training via DataParallel (#234) 2020-07-15 18:34:55 +02:00
Tanay Soni
5c1a5fe61d
Add dummy retriever for benchmarking / reader-only settings (#235) 2020-07-15 17:22:17 +02:00
Malte Pietsch
eb658d308e Upgrade version to 0.2.2 2020-07-15 17:07:29 +02:00
Branden Chan
36867dabac change from top_n_recall to accuracy 2020-07-15 17:05:08 +02:00
Branden Chan
ec795314dc
Merge pull request #233 from deepset-ai/update_eval_data
Fix Evaluation Dataset
2020-07-15 16:45:29 +02:00
Branden Chan
64721d3196 One more update 2020-07-15 16:24:10 +02:00
Branden Chan
c55477e0ce update eval dataset 2020-07-15 16:14:52 +02:00
Tanay Soni
912e98cd40
Fix id for documents returned by the TfidfRetriever (#232) 2020-07-15 14:55:07 +02:00
Tanay Soni
4e10a1520d
Remove mutation of documents in write_documents() (#231) 2020-07-15 13:10:52 +02:00
Tanay Soni
e1d64c2c68
Fix print_answers to not delete keys from passed results object (#230) 2020-07-15 12:49:14 +02:00
Malte Pietsch
99a6a34047
Upgrade to new FARM / Transformers / PyTorch versions (#212) 2020-07-14 18:53:15 +02:00
Malte Pietsch
17c1b84c21
Fix API links in readme 2020-07-14 17:42:06 +02:00
Anirban Saha
6b217732f5
Add basic support for Docx Files (#225) 2020-07-14 12:28:19 +02:00
Tanay Soni
b886e054a3
Move document_name attribute to meta (#217) 2020-07-14 09:53:31 +02:00
Tanay Soni
4c21556a79
Fix embedding method for Retriever (#220) 2020-07-13 12:38:01 +02:00
Malte Pietsch
5023fde2be Update issue templates 2020-07-13 10:45:58 +02:00
Malte Pietsch
d2b26a99ff
Add more tests (#213) 2020-07-10 10:54:56 +02:00
Malte Pietsch
549f3a1285
Harmonize meta data handling across doc stores (#214) 2020-07-09 15:25:48 +02:00
Guillim
c45d54959f
Fix Dockerfile to build successfully without models directory (#210) 2020-07-08 17:12:20 +02:00
antoniolanza1996
4b9954c4a6
Update docstring for embedding_field and embedding_dim (#208)
'embedding_field' and 'embedding_dim' are useful not only for EmbeddingRetriever but also for DensePassageRetriever. Hence, could be good if you generalise it as "dense retriever".
2020-07-08 13:36:49 +02:00
Malte Pietsch
bda333dca4 Update code snippet for readme 2020-07-08 11:43:43 +02:00
Guillim
9733e0bd1b
Fix naming of offset in answers of TransformersReader (for consistency with FARMReader) (#204) 2020-07-07 19:29:12 +02:00
Guillim
b4f4388e09
gitignore fix (#203) 2020-07-07 19:24:48 +02:00
Guillim
8a616dae75
Adjust Docker and REST API to allow TransformsReader Class (#180) 2020-07-07 16:25:36 +02:00
Malte Pietsch
fe33a481ad
Update tutorials (#200)
* fix link in readme. update installation in tutorials

* update haystack version to latest master

* add basic documentation for input to write_documents()

* add docstring for sqldocumentstore

* comment out docker in notebook
2020-07-07 14:59:01 +02:00
Tanay Soni
ff7e35581b
Add response time in logs (#201) 2020-07-07 12:28:41 +02:00
Tanay Soni
68d604d82b
Add response for successful file upload (#195) 2020-07-06 17:35:47 +02:00
Tanay Soni
0eb8a29da9
Adjust PDF conversion subprocess for Python v3.6 (#194) 2020-07-06 16:56:10 +02:00
Malte Pietsch
7ada5a91cd
Update README.rst 2020-07-03 16:08:26 +02:00
Malte Pietsch
c36f8c991e Update Tutorial 6 2020-07-03 16:06:46 +02:00
Malte Pietsch
8a9f97fad3
Tutorial for Dense Passage Retriever (#186) 2020-07-03 15:53:58 +02:00