haystack/test/benchmarks/retriever_simplified.py

"""
This script performs the same query benchmarking as `retriever.py` but with less of the loops that iterate
over all the parameters so that it is easier to inspect what is happening
"""


from haystack.document_stores import MilvusDocumentStore, FAISSDocumentStore
from haystack.nodes import DensePassageRetriever
from retriever import prepare_data
import datetime
from pprint import pprint
from milvus import IndexType
from utils import get_document_store


def benchmark_querying(index_type, n_docs=100_000, similarity="dot_product"):

    doc_index = "document"
    label_index = "label"

    docs, labels = prepare_data(
        data_dir="data/",
        filename_gold="nq2squad-dev.json",
        filename_negative="psgs_w100_minus_gold_100k.tsv",
        remote_url="https://ext-haystack-retriever-eval.s3-eu-west-1.amazonaws.com/",
        embeddings_filenames=["wikipedia_passages_100k.pkl"],
        embeddings_dir="embeddings/",
        n_docs=n_docs,
        add_precomputed=True,
    )

    doc_store = get_document_store(document_store_type=index_type, similarity=similarity)

    # if index_type == "milvus_flat":
    #     doc_store = MilvusDocumentStore(index=doc_index, similarity=similarity)
    # elif index_type == "milvus_hnsw":
    #     index_param = {"M": 64, "efConstruction": 80}
    #     search_param = {"ef": 20}
    #     doc_store = MilvusDocumentStore(
    #         index=doc_index,
    #         index_type=IndexType.HNSW,
    #         index_param=index_param,
    #         search_param=search_param,
    #         similarity=similarity
    #     )

    doc_store.write_documents(documents=docs, index=doc_index)
    doc_store.write_labels(labels=labels, index=label_index)

    retriever = DensePassageRetriever(
        document_store=doc_store,
        query_embedding_model="facebook/dpr-question_encoder-single-nq-base",
        passage_embedding_model="facebook/dpr-ctx_encoder-single-nq-base",
        use_gpu=True,
        use_fast_tokenizers=True,
    )

    raw_results = retriever.eval(label_index=label_index, doc_index=doc_index)
    results = {
        "n_queries": raw_results["n_questions"],
        "retrieve_time": raw_results["retrieve_time"],
        "queries_per_second": raw_results["n_questions"] / raw_results["retrieve_time"],
        "seconds_per_query": raw_results["retrieve_time"] / raw_results["n_questions"],
        "recall": raw_results["recall"] * 100,
        "map": raw_results["map"] * 100,
        "top_k": raw_results["top_k"],
        "date_time": datetime.datetime.now(),
        "error": None,
    }

    pprint(results)

    doc_store.delete_all_documents(index=doc_index)
    doc_store.delete_all_documents(index=label_index)


if __name__ == "__main__":
    similarity = "l2"
    n_docs = 1000

    benchmark_querying(index_type="milvus_flat", similarity=similarity, n_docs=n_docs)
    benchmark_querying(index_type="milvus_hnsw", similarity=similarity, n_docs=n_docs)
    benchmark_querying(index_type="faiss_flat", similarity=similarity, n_docs=n_docs)
    benchmark_querying(index_type="faiss_hnsw", similarity=similarity, n_docs=n_docs)
Add L2 support for FAISS HNSW (#1138) 2021-06-04 11:05:18 +02:00			`"""`
			This script performs the same query benchmarking as `retriever.py` but with less of the loops that iterate
			`over all the parameters so that it is easier to inspect what is happening`
			`"""`


Refactoring of the `haystack` package (#1624) * Files moved, imports all broken * Fix most imports and docstrings into * Fix the paths to the modules in the API docs * Add latest docstring and tutorial changes * Add a few pipelines that were lost in the inports * Fix a bunch of mypy warnings * Add latest docstring and tutorial changes * Create a file_classifier module * Add docs for file_classifier * Fixed most circular imports, now the REST API can start * Add latest docstring and tutorial changes * Tackling more mypy issues * Reintroduce from FARM and fix last mypy issues hopefully * Re-enable old-style imports * Fix some more import from the top-level package in an attempt to sort out circular imports * Fix some imports in tests to new-style to prevent failed class equalities from breaking tests * Change document_store into document_stores * Update imports in tutorials * Add latest docstring and tutorial changes * Probably fixes summarizer tests * Improve the old-style import allowing module imports (should work) * Try to fix the docs * Remove dedicated KnowledgeGraph page from autodocs * Remove dedicated GraphRetriever page from autodocs * Fix generate_docstrings.sh with an updated list of yaml files to look for * Fix some more modules in the docs * Fix the document stores docs too * Fix a small issue on Tutorial14 * Add latest docstring and tutorial changes * Add deprecation warning to old-style imports * Remove stray folder and import Dict into dense.py * Change import path for MLFlowLogger * Add old loggers path to the import path aliases * Fix debug output of convert_ipynb.py * Fix circular import on BaseRetriever * Missed one merge block * re-run tutorial 5 * Fix imports in tutorial 5 * Re-enable squad_to_dpr CLI from the root package and move get_batches_from_generator into document_stores.base * Add latest docstring and tutorial changes * Fix typo in utils __init__ * Fix a few more imports * Fix benchmarks too * New-style imports in test_knowledge_graph * Rollback setup.py * Rollback squad_to_dpr too Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> 2021-10-25 15:50:23 +02:00			`from haystack.document_stores import MilvusDocumentStore, FAISSDocumentStore`
			`from haystack.nodes import DensePassageRetriever`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00			`from retriever import prepare_data`
			`import datetime`
			`from pprint import pprint`
			`from milvus import IndexType`
Add L2 support for FAISS HNSW (#1138) 2021-06-04 11:05:18 +02:00			`from utils import get_document_store`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00
Apply black formatting (#2115) * Testing black on ui/ * Applying black on docstores * Add latest docstring and tutorial changes * Create a single GH action for Black and docs to reduce commit noise to the minimum, slightly refactor the OpenAPI action too * Remove comments * Relax constraints on pydoc-markdown * Split temporary black from the docs. Pydoc-markdown was obsolete and needs a separate PR to upgrade * Fix a couple of bugs * Add a type: ignore that was missing somehow * Give path to black * Apply Black * Apply Black * Relocate a couple of type: ignore * Update documentation * Make Linux CI run after applying Black * Triggering Black * Apply Black * Remove dependency, does not work well * Remove manually double trailing commas * Update documentation Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> 2022-02-03 13:43:18 +01:00
Add L2 support for FAISS HNSW (#1138) 2021-06-04 11:05:18 +02:00			`def benchmark_querying(index_type, n_docs=100_000, similarity="dot_product"):`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00
			`doc_index = "document"`
			`label_index = "label"`

			`docs, labels = prepare_data(`
			`data_dir="data/",`
			`filename_gold="nq2squad-dev.json",`
			`filename_negative="psgs_w100_minus_gold_100k.tsv",`
			`remote_url="https://ext-haystack-retriever-eval.s3-eu-west-1.amazonaws.com/",`
			`embeddings_filenames=["wikipedia_passages_100k.pkl"],`
			`embeddings_dir="embeddings/",`
			`n_docs=n_docs,`
Apply black formatting (#2115) * Testing black on ui/ * Applying black on docstores * Add latest docstring and tutorial changes * Create a single GH action for Black and docs to reduce commit noise to the minimum, slightly refactor the OpenAPI action too * Remove comments * Relax constraints on pydoc-markdown * Split temporary black from the docs. Pydoc-markdown was obsolete and needs a separate PR to upgrade * Fix a couple of bugs * Add a type: ignore that was missing somehow * Give path to black * Apply Black * Apply Black * Relocate a couple of type: ignore * Update documentation * Make Linux CI run after applying Black * Triggering Black * Apply Black * Remove dependency, does not work well * Remove manually double trailing commas * Update documentation Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> 2022-02-03 13:43:18 +01:00			`add_precomputed=True,`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00			`)`

Apply black formatting (#2115) * Testing black on ui/ * Applying black on docstores * Add latest docstring and tutorial changes * Create a single GH action for Black and docs to reduce commit noise to the minimum, slightly refactor the OpenAPI action too * Remove comments * Relax constraints on pydoc-markdown * Split temporary black from the docs. Pydoc-markdown was obsolete and needs a separate PR to upgrade * Fix a couple of bugs * Add a type: ignore that was missing somehow * Give path to black * Apply Black * Apply Black * Relocate a couple of type: ignore * Update documentation * Make Linux CI run after applying Black * Triggering Black * Apply Black * Remove dependency, does not work well * Remove manually double trailing commas * Update documentation Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> 2022-02-03 13:43:18 +01:00			`doc_store = get_document_store(document_store_type=index_type, similarity=similarity)`
Add L2 support for FAISS HNSW (#1138) 2021-06-04 11:05:18 +02:00
			`# if index_type == "milvus_flat":`
			`# doc_store = MilvusDocumentStore(index=doc_index, similarity=similarity)`
			`# elif index_type == "milvus_hnsw":`
			`# index_param = {"M": 64, "efConstruction": 80}`
			`# search_param = {"ef": 20}`
			`# doc_store = MilvusDocumentStore(`
			`# index=doc_index,`
			`# index_type=IndexType.HNSW,`
			`# index_param=index_param,`
			`# search_param=search_param,`
			`# similarity=similarity`
			`# )`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00
			`doc_store.write_documents(documents=docs, index=doc_index)`
			`doc_store.write_labels(labels=labels, index=label_index)`

			`retriever = DensePassageRetriever(`
			`document_store=doc_store,`
			`query_embedding_model="facebook/dpr-question_encoder-single-nq-base",`
			`passage_embedding_model="facebook/dpr-ctx_encoder-single-nq-base",`
			`use_gpu=True,`
Apply black formatting (#2115) * Testing black on ui/ * Applying black on docstores * Add latest docstring and tutorial changes * Create a single GH action for Black and docs to reduce commit noise to the minimum, slightly refactor the OpenAPI action too * Remove comments * Relax constraints on pydoc-markdown * Split temporary black from the docs. Pydoc-markdown was obsolete and needs a separate PR to upgrade * Fix a couple of bugs * Add a type: ignore that was missing somehow * Give path to black * Apply Black * Apply Black * Relocate a couple of type: ignore * Update documentation * Make Linux CI run after applying Black * Triggering Black * Apply Black * Remove dependency, does not work well * Remove manually double trailing commas * Update documentation Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> 2022-02-03 13:43:18 +01:00			`use_fast_tokenizers=True,`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00			`)`

			`raw_results = retriever.eval(label_index=label_index, doc_index=doc_index)`
			`results = {`
			`"n_queries": raw_results["n_questions"],`
			`"retrieve_time": raw_results["retrieve_time"],`
			`"queries_per_second": raw_results["n_questions"] / raw_results["retrieve_time"],`
			`"seconds_per_query": raw_results["retrieve_time"] / raw_results["n_questions"],`
			`"recall": raw_results["recall"] * 100,`
			`"map": raw_results["map"] * 100,`
			`"top_k": raw_results["top_k"],`
			`"date_time": datetime.datetime.now(),`
Apply black formatting (#2115) * Testing black on ui/ * Applying black on docstores * Add latest docstring and tutorial changes * Create a single GH action for Black and docs to reduce commit noise to the minimum, slightly refactor the OpenAPI action too * Remove comments * Relax constraints on pydoc-markdown * Split temporary black from the docs. Pydoc-markdown was obsolete and needs a separate PR to upgrade * Fix a couple of bugs * Add a type: ignore that was missing somehow * Give path to black * Apply Black * Apply Black * Relocate a couple of type: ignore * Update documentation * Make Linux CI run after applying Black * Triggering Black * Apply Black * Remove dependency, does not work well * Remove manually double trailing commas * Update documentation Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> 2022-02-03 13:43:18 +01:00			`"error": None,`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00			`}`

			`pprint(results)`

			`doc_store.delete_all_documents(index=doc_index)`
			`doc_store.delete_all_documents(index=label_index)`

Apply black formatting (#2115) * Testing black on ui/ * Applying black on docstores * Add latest docstring and tutorial changes * Create a single GH action for Black and docs to reduce commit noise to the minimum, slightly refactor the OpenAPI action too * Remove comments * Relax constraints on pydoc-markdown * Split temporary black from the docs. Pydoc-markdown was obsolete and needs a separate PR to upgrade * Fix a couple of bugs * Add a type: ignore that was missing somehow * Give path to black * Apply Black * Apply Black * Relocate a couple of type: ignore * Update documentation * Make Linux CI run after applying Black * Triggering Black * Apply Black * Remove dependency, does not work well * Remove manually double trailing commas * Update documentation Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> 2022-02-03 13:43:18 +01:00
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00			`if __name__ == "__main__":`
			`similarity = "l2"`
Add L2 support for FAISS HNSW (#1138) 2021-06-04 11:05:18 +02:00			`n_docs = 1000`
Improve Milvus HNSW Performance (#1127) * Add simplified script * Optimize HNSW index creation * Adjust benchmark order * Rename script 2021-06-02 13:17:35 +02:00
Add L2 support for FAISS HNSW (#1138) 2021-06-04 11:05:18 +02:00			`benchmark_querying(index_type="milvus_flat", similarity=similarity, n_docs=n_docs)`
			`benchmark_querying(index_type="milvus_hnsw", similarity=similarity, n_docs=n_docs)`
			`benchmark_querying(index_type="faiss_flat", similarity=similarity, n_docs=n_docs)`
			`benchmark_querying(index_type="faiss_hnsw", similarity=similarity, n_docs=n_docs)`