mirror of https://github.com/deepset-ai/haystack.git synced 2026-01-18 10:07:03 +00:00

History

* initial test cml

* Update cml.yaml

* WIP test workflow

* switch to general ubuntu ami

* switch to general ubuntu ami

* disable gpu for tests

* rm gpu infos

* rm gpu infos

* update token env

* switch github token

* add postgres

* test db connection

* fix typo

* remove tty

* add sleep for db

* debug runner

* debug removal postgres

* debug: reset to working commit

* debug: change github token

* switch to new bot token

* debug token

* add back postgres

* adjust network runner docker

* add elastic

* fix typo

* adjust working dir

* fix benchmark execution

* enable s3 downloads

* add query benchmark. fix path

* add saving of markdown files

* cat md files. add faiss+dpr. increase n_queries

* switch to GPU instance

* switch availability zone

* switch to public aws DL ami

* increase volume size

* rm faiss. fix error logging

* save markdown files

* add reader benchmarks

* add download of squad data

* correct reader metric normalization

* fix newlines between reports

* fix max_docs for reader eval data. remove max_docs from ci run config

* fix mypy. switch workflow trigger

* try trigger for label

* try trigger for label

* change trigger syntax

* debug machine shutdown with test workflow

* add es and postgres to test workflow

* Revert "add es and postgres to test workflow"

This reverts commit 6f038d3d7f12eea924b54529e61b192858eaa9d5.

* Revert "debug machine shutdown with test workflow"

This reverts commit db70eabae8850b88e1d61fd79b04d4f49d54990a.

* fix typo in action. set benchmark config back to original

2020-11-18 18:28:17 +01:00

data_scripts

Create time and performance benchmarks for all readers and retrievers (#339 )

2020-10-12 13:34:42 +02:00

config.json

Automate benchmarks via CML (#518 )

2020-11-18 18:28:17 +01:00

nq_to_squad.py

Create time and performance benchmarks for all readers and retrievers (#339 )

2020-10-12 13:34:42 +02:00

reader_results.csv

Address reviewer comments

2020-10-27 12:41:11 +01:00

reader.py

Automate benchmarks via CML (#518 )

2020-11-18 18:28:17 +01:00

README.md

add readme

2020-10-22 15:32:56 +02:00

results_to_json.py

add automatic json update

2020-10-21 17:59:44 +02:00

retriever_index_results.csv

Add versioning docs (#495 )

2020-10-19 11:46:51 +02:00

retriever_query_results.csv

Add versioning docs (#495 )

2020-10-19 11:46:51 +02:00

retriever.py

Automate benchmarks via CML (#518 )

2020-11-18 18:28:17 +01:00

run.py

Automate benchmarks via CML (#518 )

2020-11-18 18:28:17 +01:00

templates.py

Fix template

2020-10-29 10:30:03 +01:00

utils.py

Automate benchmarks via CML (#518 )

2020-11-18 18:28:17 +01:00

README.md

Benchmarks

Run the benchmarks with the following command:

python run.py [--reader] [--retriever_index] [--retriever_query] [--ci] [--update-json]

You can specify which components and processes to benchmark with the following flags.

--reader will trigger the speed and accuracy benchmarks for the reader. Here we simply use the SQuAD dev set.

--retriever_index will trigger indexing benchmarks

--retriever_query will trigger querying benchmarks (embeddings will be loaded from file instead of being computed on the fly)

--ci will cause the the benchmarks to run on a smaller slice of each dataset and a smaller subset of Retriever / Reader / DocStores.

--update-json will cause the script to update the json files in docs/_src/benchmarks so that the website benchmarks will be updated.