haystack/test/benchmarks/reader_results.csv
Branden Chan f3a3b73d9b
Choose correct similarity fns during benchmark runs & re-run benchmarks (#773)
* Adapt to new dataset_from_dicts return signature

* rename fn

* Align similarity fn in benchmark doc store

* Better choice of similarity fn

* Increase postgres wait time

* Add more expected returned variables

* update benchmark results

* Fix typo

* update all benchmark runs

* multiply stats by 100

* Specify similarity fns for website

Co-authored-by: Malte Pietsch <malte.pietsch@deepset.ai>
2021-02-03 11:45:18 +01:00

1.0 KiB

1EMf1top_n_accuracytop_nreader_timeseconds_per_querypassages_per_secondreadererror
200.78392044496881850.82588605752996580.9742120343839542598.163581737000640.008272676701247315125.81040525892847deepset/roberta-base-squad2
310.74380583178830270.78878584910070420.9719366256531266547.382580534998850.003993138423647299260.6443097981493deepset/minilm-uncased-squad2
420.69475813247935280.74311824004432860.95575594134501945101.998117793002170.008595829916821352121.08066567525722deepset/bert-base-cased-squad2
530.78973537839204460.83263067747343080.9769088151019725292.518864082005170.02465185100977626642.21949937744112deepset/bert-large-uncased-whole-word-masking-squad2
640.80212371481543910.84504226992074680.9740434855890785293.530387416001760.02473709652924336442.07400844838984deepset/xlm-roberta-large-squad2
750.37299848306084610.42319258447235740.9539019046013821555.4030112809996350.004669055391960192222.91207128366705distilbert-base-uncased-distilled-squad