* move embedding backends
* use token in Sentence Transformers embeddings
* more compact token handling
* token parameter in reader
* add token to ranker
* release note
* add test for reader
* add telemetry to pipelines 2.0
* only collect data if telemetry is on
* reno
* add downsampling
* typing
* manual tests
* pylint
* simplify code
* Update haystack/preview/telemetry/__init__.py
* look for _telemetry_data
* rather index by component type
* black
* mypy
* error handling
* comment
* review feedback & small improvements
* defaultdict
* stray changes
* try-catch
* method instead of attribute
* fixes
* remove print statements
* lint
* invert condition
* always send the first event of the day
* collect specs
* track 2nd and 3rd events too
* send first event and then max 1 event a minute
* rename constant
* black
* add telemetry for embedders
* add test
* remove boolean values
* add telemetry to pipelines 2.0
* only collect data if telemetry is on
* reno
* add downsampling
* typing
* manual tests
* pylint
* simplify code
* Update haystack/preview/telemetry/__init__.py
* look for _telemetry_data
* rather index by component type
* black
* mypy
* error handling
* comment
* review feedback & small improvements
* defaultdict
* stray changes
* try-catch
* method instead of attribute
* fixes
* remove print statements
* lint
* invert condition
* always send the first event of the day
* collect specs
* track 2nd and 3rd events too
* send first event and then max 1 event a minute
* rename constant
* black
* add telemetry details to HuggingFaceLocalGenerator
* add test
* check if the model is a string
---------
Co-authored-by: Stefano Fiorucci <44616784+anakin87@users.noreply.github.com>
* add telemetry to pipelines 2.0
* only collect data if telemetry is on
* reno
* add downsampling
* typing
* manual tests
* pylint
* simplify code
* Update haystack/preview/telemetry/__init__.py
* look for _telemetry_data
* rather index by component type
* black
* mypy
* error handling
* comment
* review feedback & small improvements
* defaultdict
* stray changes
* try-catch
* method instead of attribute
* fixes
* remove print statements
* lint
* invert condition
* always send the first event of the day
* collect specs
* track 2nd and 3rd events too
* send first event and then max 1 event a minute
* rename constant
* black
* add test
* add telemetry for rankers readers and retrievers
* get only the type of docstore, not the whole object
* add telemetry to pipelines 2.0
* only collect data if telemetry is on
* reno
* add downsampling
* typing
* manual tests
* pylint
* simplify code
* Update haystack/preview/telemetry/__init__.py
* look for _telemetry_data
* rather index by component type
* black
* mypy
* error handling
* comment
* add telemetry_data to gptgenerator
* review feedback & small improvements
* defaultdict
* stray changes
* try-catch
* method instead of attribute
* change attribute to method
* fixes
* remove print statements
* lint
* invert condition
* always send the first event of the day
* collect specs
* track 2nd and 3rd events too
* send first event and then max 1 event a minute
* rename constant
* black
* add telemetry to pipelines 2.0
* only collect data if telemetry is on
* reno
* add downsampling
* typing
* manual tests
* pylint
* simplify code
* Update haystack/preview/telemetry/__init__.py
* look for _telemetry_data
* rather index by component type
* black
* mypy
* error handling
* comment
* review feedback & small improvements
* defaultdict
* stray changes
* try-catch
* method instead of attribute
* fixes
* remove print statements
* lint
* invert condition
* always send the first event of the day
* collect specs
* track 2nd and 3rd events too
* send first event and then max 1 event a minute
* rename constant
* black
* add test
* Fix import in link_content.py
* Fix another import
* Move __version__ to separate file to fix circular import
* Fix mypy complaining about redefinition of __version__
* added hybrid search example
Added an example about hybrid search for faq pipeline on covid dataset
* formatted with back formatter
* renamed document
* fixed
* fixed typos
* added test
added test for hybrid search
* fixed withespaces
* removed test for hybrid search
* fixed pylint
* commented logging
* fixed bug in join_docs.py _concatenate_results
* Update join_docs.py
updated comment
* format with black
* added releasenote on PR
* updated release notes
* updated test_join_documents
* updated test
* updated test
* Update test_join_documents.py
* formatted with black
* fixed test
* fixed
---------
Co-authored-by: Stefano Fiorucci <44616784+anakin87@users.noreply.github.com>
* Remove old cohere models
* Add aliases for the existing models according to Cohere documentation
* Add release note
* put cohere embdding models in a constant
* update doc strings
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
* draft
* still a raw draft
* still a raw draft
* improvements
* minimal impl ok
* tests
* reno
* better language
* examples of generation_kwargs
* incorporate feedback
* lg and format updates
* don't save valid str tokens
* fix style
---------
Co-authored-by: Darja Fokina <daria.f93@gmail.com>
* draft TextLanguageClassifier
* implement language detection with langdetect
* add unit test for logging message
* reno
* pylint
* change input from List[str] to str
* remove empty output connections
* add from_dict/to_dict tests
* mark example usage as python code
* add telemetry to pipelines 2.0
* only collect data if telemetry is on
* reno
* add downsampling
* typing
* manual tests
* pylint
* simplify code
* Update haystack/preview/telemetry/__init__.py
* rather index by component type
* black
* mypy
* review feedback & small improvements
* defaultdict
* stray changes
* lint
* invert condition
* always send the first event of the day
* collect specs
* track 2nd and 3rd events too
* send first event and then max 1 event a minute
* rename constant
* invert condition
* linting
* Addition to the text in ValueError when creating a prompt node to inform users to double check they have authorisation for the loaded model and have logged into the huggingface cli
* Update haystack/nodes/prompt/prompt_model.py
Accepted the suggested changes to the value error text
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
Co-authored-by: Stefano Fiorucci <44616784+anakin87@users.noreply.github.com>
* added hybrid search example
Added an example about hybrid search for faq pipeline on covid dataset
* formatted with back formatter
* renamed document
* fixed
* fixed typos
* added test
added test for hybrid search
* fixed withespaces
* removed test for hybrid search
* fixed pylint
* commented logging
* updated hybrid search example
* release notes
* Update hybrid_search_faq_pipeline.py-815df846dca7e872.yaml
* Update hybrid_search_faq_pipeline.py
* mention hybrid search example in release notes
* reduce installed dependencies in examples test workflow
* do not install cuda dependencies
* skip models if API key not set; delete document indices
* skip models if API key not set; delete document indices
* skip models if API key not set; delete document indices
* keep roberta-base model and inference extra
* pylint
* disable pylint no-logging-basicconfig rule
---------
Co-authored-by: Julian Risch <julian.risch@deepset.ai>