haystack/nodes at 28724e2e25a003cf941061758361d2ae87f0abf1 - haystack - Gitea: Git with a cup of tea

yujunjun/haystack

mirror of https://github.com/deepset-ai/haystack.git synced 2026-01-07 04:27:15 +00:00

History

Daniel Bichuetti 28724e2e25

feat: add automatic OCR detection mechanism and improve performance (#4329 )

* feat: add automatic OCR detection mechanism and improve performance

* refactor: add error message

* refactor: ignore pdftoppm bad typing

* refactor: add Tesseract install. docstrings

* fix: check if OCR var. assigned on mp

* tests: add path to windows/linux tests

* tests: add tessdata path

* tests: include matrix ref.

* tests: custom Tesseract matrix install

* refactor: improve user guide

* tests: fix macos path

* tests: remove brew formulae version

* fix: macos paths

* tests: fix macos path

* tests: add Tesseract to Windows Path

* tests: pytesseract path

* tests: macos path

* refactor: fix path message and remove extra path from tests

* refactor: raise exception when path not found

* refactor: expression simplification

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>

* refactor: check ocr parameter

* tests: mark as integration

* tests: mock deprecation warning

* refactor: simplify code

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>

* refactor: change deprecation test

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>

* refactor: add unit patch

* refactor: black formatting

---------

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
Co-authored-by: Mayank Jobanputra <mayankjobanputra@gmail.com>

2023-03-13 20:19:22 +05:30

..

__init__.py

[CI refactoring] Categorize tests into folders (#2554 )

2022-05-17 09:55:53 +01:00

test_audio.py

refactor: Remove the pin from the espnet module and fix the audio node tests. (#4128 )

2023-02-16 22:12:17 +05:30

test_connector.py

refact: mark unit tests under the test/nodes/** path (#4235 )

2023-02-27 15:00:19 +01:00

test_doc_language_classifier.py

feat: LanguageClassifier (#2994 )

2023-03-13 10:30:03 +01:00

test_document_classifier.py

style: Update black (#4101 )

2023-02-08 15:34:43 +01:00

test_document_merger.py

refact: mark unit tests under the test/nodes/** path (#4235 )

2023-02-27 15:00:19 +01:00

test_extractor.py

test: Added integration test for using EntityExtractor in query pipeline (#4117 )

2023-02-28 09:20:44 +01:00

test_file_converter.py

feat: add automatic OCR detection mechanism and improve performance (#4329 )

2023-03-13 20:19:22 +05:30

test_filetype_classifier.py

refact: mark unit tests under the test/nodes/** path (#4235 )

2023-02-27 15:00:19 +01:00

test_generator.py

feat: Add Azure as OpenAI endpoint (#4170 )

2023-03-02 09:55:09 +01:00

test_image_to_text.py

docs: TransformersImageToText- inform about supported models, better exception handling (#4310 )

2023-03-09 15:35:17 +01:00

test_join_answers.py

refact: move the first batch of unit tests into the proper job (#4216 )

2023-02-21 17:00:02 +01:00

test_join_documents.py

refact: move the first batch of unit tests into the proper job (#4216 )

2023-02-21 17:00:02 +01:00

test_label_generator.py

[CI Refactoring] Refactor Document fixtures in tests (#2577 )

2022-06-10 18:22:48 +02:00

test_preprocessor.py

refact: mark unit tests under the test/nodes/** path (#4235 )

2023-02-27 15:00:19 +01:00

test_prompt_node.py

refactor: simplify registration of PromptModelInvocationLayer (#4339 )

2023-03-07 20:53:48 +01:00

test_query_classifier.py

feat: Extend TransformersQueryClassifier: clean version (#2965 )

2022-08-09 09:43:33 +02:00

test_question_generator.py

feat: add document_store to all BaseRetriever.retrieve() and BaseRetriever.retrieve_batch() implementations (#3379 )

2022-10-26 15:47:06 +02:00

test_ranker.py

Update document scores based on ranker node (#2048 )

2022-06-27 12:17:18 +02:00

test_reader.py

fix: hf-tiny-roberta model loading from disk and mypy errors (#4363 )

2023-03-09 18:06:09 +05:30

test_retriever.py

feat: Add Azure OpenAI embeddings support (#4332 )

2023-03-06 13:37:20 +01:00

test_route_documents.py

refact: move the first batch of unit tests into the proper job (#4216 )

2023-02-21 17:00:02 +01:00

test_shaper.py

refact: mark unit tests under the test/nodes/** path (#4235 )

2023-02-27 15:00:19 +01:00

test_summarizer.py

test: mock all Summarizer tests and move a few into e2e (#4299 )

2023-03-01 17:30:55 +01:00

test_table_reader.py

refactor: Use TableQuestionAnsweringPipeline from transformers (#4303 )

2023-03-07 11:46:50 +01:00

test_translator.py

test: mock all Translator tests and move one to e2e (#4290 )

2023-03-01 14:52:05 +01:00