David S. Batista
b81abc0c85
feat: SentenceTransformersDocumentEmbedder supports config_kwargs
( #8433 )
...
* initial import
* adding release notes
2024-10-14 17:43:04 +02:00
Sebastian Husch Lee
06dd5c2f37
feat (v2): Update so model_max_length
updates max_seq_length
for Sentence Transformers ( #8334 )
...
* Update so model_max_length does what is expected
* Add release notes
* Some fixes
* Another test
2024-09-06 11:37:56 +02:00
Nicola Procopio
4c798470b2
added precision
parameter to sentence transformers embeddings ( #8179 )
...
* added `precision` parameter to sentence transformers embeddings
* fixed test
* Update haystack/components/embedders/sentence_transformers_document_embedder.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* Update test/components/embedders/test_sentence_transformers_text_embedder.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* Update test/components/embedders/test_sentence_transformers_text_embedder.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* fix format
* Update sentence_transformers_text_embedder.py
---------
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
2024-08-09 11:38:47 +02:00
Sebastian Husch Lee
c90495c2e8
feat: Add model and tokenizer kwargs to TransformersSimilarityRanker
, SentenceTransformersDocumentEmbedder
, SentenceTransformersTextEmbedder
( #8145 )
...
* Start adding model and tokenizer kwargs support
* Add model and tokenizer kwargs to doc embedder
* Some updates and fixes in tests
* Fix more tests
* Fix tests
* Add release note
* Fix test
* Add from_dict tests
2024-08-02 10:37:10 +02:00
Nicola Procopio
47f4db8698
added truncate_dim to sentence transformers embedder ( #8077 )
...
* added truncate_dim to sentence transformers embedder
* Update haystack/components/embedders/sentence_transformers_document_embedder.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* Update releasenotes/notes/release-note-2b603a123cd36214.yaml
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* fixed parameter description
* added test for truncation to text embedder
* fix format
---------
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
2024-07-26 10:39:48 +02:00
Sebastian Husch Lee
c121c86c4c
fix: Fix from_dict methods of components using HF models to work with default values ( #8003 )
...
* Fix from_dict to work if device isn't provided in init params
* Minor refactoring of from_dict for components that load HF models
* Add tests
* Update tests to test loading with all default parameters
* Add more tests
* Add release notes
* Add unit test for whisper local
* Update reno
* Add fix for ExtractiveReader
* Fix NamedEntityExtractor
2024-07-10 12:18:05 +02:00
Vladimir Blagojevic
535a281eec
feat: Add option to use HF_TOKEN
as env var for authentication across all HF components ( #7942 )
...
* Read both HF_API_TOKEN and HF_TOKEN env vars in all HF related components
* Add reno note
* Test fixes
* More test updates
* More test updates
2024-06-27 10:31:58 +02:00
Sebastian Husch Lee
a2be90b95a
fix: Update device deserialization for components that use local models ( #7686 )
...
* fix: Update device deserializtion for SentenceTransformersTextEmbedder
* Add unit test
* Fix unit test
* Make same change to doc embedder
* Add release notes
* Add same change to Diversity Ranker and Named Entity Extractor
* Add unit test
* Add the same for whisper local
* Update release notes
2024-05-14 08:36:14 +02:00
Massimiliano Pippi
10c675d534
chore: add license header to all modules ( #7675 )
...
* add license header to modules
* check license header at linting time
2024-05-09 13:40:36 +00:00
Vladimir Blagojevic
2aae8472e7
feat: Add trust_remote_code init param to SentenceTransformer embedders ( #7356 )
...
* Add trust_remote_code init param to SentenceTransformer embedders
* Add release note
* Go with no kwargs solution
* Update haystack/components/embedders/sentence_transformers_document_embedder.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* Pydoc fix
---------
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
2024-03-14 11:14:04 +01:00
Stefano Fiorucci
d00f171f8b
refactor!: Sentence Transformers Embedders - new devices mgmt ( #7033 )
...
* new device mgmt for Sentence Transformers embedders
* reno
2024-02-19 14:52:44 +01:00
Madeesh Kannan
27d1af3068
feat!: Use Secret
for passing authentication secrets to components ( #6887 )
...
* feat!: Use `Secret` for passing authentication secrets to components
* Add comment to clarify type ignore
2024-02-05 13:17:01 +01:00
ZanSara
288ed150c9
feat!: Rename model_name
or model_name_or_path
to model
in all Embedder classes ( #6733 )
...
* rename model parameter in the openai doc embedder
* fix tests for openai doc embedder
* rename model parameter in the openai text embedder
* fix tests for openai text embedder
* rename model parameter in the st doc embedder
* fix tests for st doc embedder
* rename model parameter in the st backend
* fix tests for st backend
* rename model parameter in the st text embedder
* fix tests for st text embedder
* fix docstring
* fix pipeline utils
* fix e2e
* reno
* fix the indexing pipeline _create_embedder function
* fix e2e eval rag pipeline
* pytest
2024-01-12 15:30:17 +01:00
Stefano Fiorucci
c773c30c66
refactor!: rename all remaining metadata
to meta
( #6650 )
...
* change metadata to meta
* release note
2023-12-28 12:18:15 +01:00
Massimiliano Pippi
7c05f37a53
remove unit marker ( #6450 )
2023-11-29 19:24:25 +01:00
Silvano Cerza
e6637f5ec2
Fix all tests
2023-11-24 14:48:43 +01:00
Massimiliano Pippi
8adb8bbab8
Remove preview folder in test/
...
---------
Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>
2023-11-24 11:52:55 +01:00