ZanSara
|
dfc1d452bb
|
feat: upgrade canals to 0.10.1 (#6309)
* upgrade canals
* reno
* trigger preview e2e
* bump canals
* fix decorator
* fix test
* test factory
* tests inmemory
* tests writer
* test audio
* tests builders
* tests caching
* tests embedders
* tests converters
* tests generators
* tests rankers
* tests retrievers
* fix pipeline and telemetry tests
* remove trigger
|
2023-11-17 14:46:23 +01:00 |
|
Stefano Fiorucci
|
f708cf6056
|
refactor!: set scale_score default value to False (#6276)
* set default scale_score to False
* release note
|
2023-11-13 11:59:18 +01:00 |
|
Agnieszka Marzec
|
1046bebbe0
|
Docs: Update docstrings lg (#6260)
* Update docstrings lg
* Update test_in_memory_bm25_retriever.py
* Update test_in_memory_embedding_retriever.py
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
|
2023-11-09 17:34:52 +01:00 |
|
Silvano Cerza
|
7287657f0e
|
refactor: Rename Document 's text field to content (#6181)
* Rework Document serialisation
Make Document backward compatible
Fix InMemoryDocumentStore filters
Fix InMemoryDocumentStore.bm25_retrieval
Add release notes
Fix pylint failures
Enhance Document kwargs handling and docstrings
Rename Document's text field to content
Fix e2e tests
Fix SimilarityRanker tests
Fix typo in release notes
Rename Document's metadata field to meta (#6183)
* fix bugs
* make linters happy
* fix
* more fix
* match regex
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
|
2023-10-31 12:44:04 +01:00 |
|
Silvano Cerza
|
c8d162ced9
|
refactor: Change Document.embedding type to list of floats (#6135)
* Change Document.embedding type
* Add release notes
* Fix document_store testing
* Fix pylint
* Fix tests
|
2023-10-23 12:26:05 +02:00 |
|
Silvano Cerza
|
3f98bd9137
|
refactor: Rework Document.id generation (#6122)
* Rework Document id generation
* Fix tests
* Add release notes
* Fix failing integration test
* Remove score from Document id generation
* Enhance tests
* Update release notes
---------
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
|
2023-10-20 10:34:28 +02:00 |
|
Stefano Fiorucci
|
4e4af99a5e
|
refactor!: rename MemoryDocumentStore and related Retrievers (#6076)
* rename doc store and retrievers
* release note
* fix patch
|
2023-10-17 16:15:16 +02:00 |
|
Stefano Fiorucci
|
de84a95970
|
separate classes and tests (#5819)
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
|
2023-09-19 19:21:49 +02:00 |
|
Stefano Fiorucci
|
1c69070db6
|
make MemoryEmbeddingRetriever act in non-batch mode (#5809)
|
2023-09-14 15:37:20 +02:00 |
|
ZanSara
|
5888fb7052
|
make MemoryBM25Retriever non match (#5768)
|
2023-09-13 15:11:47 +02:00 |
|
ZanSara
|
6e70d403f8
|
feat: Improve Document for Haystack 2.0 (#5738)
* initial draft
* tests
* add proposal
* proposal number
* reno
* fix tests and usage of content and content_type
* update branch & fix more tests
* mypy
* add docstring
* fix more tests
* review feedback
* improve __str__
* Apply suggestions from code review
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/preview/dataclasses/document.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* improve __str__
* fix tests
* fix more tests
* Update haystack/preview/document_stores/memory/document_store.py
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
|
2023-09-11 17:40:00 +02:00 |
|
Stefano Fiorucci
|
2edf85f739
|
MemoryEmbeddingRetriever (2.0) (#5726)
* MemoryDocumentStore - Embedding retrieval draft
* add release notes
* fix mypy
* better comment
* improve return_embeddings handling
* MemoryEmbeddingRetriever - first draft
* address PR comments
* release note
* update docstrings
* update docstrings
* incorporated feeback
* add return_embedding to __init__
* rm leftover docstring
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
|
2023-09-08 15:52:48 +02:00 |
|
ZanSara
|
b1daa7c647
|
chore: migrate to canals==0.7.0 (#5647)
* add default_to_dict and default_from_dict placeholders to ease migration to canals 0.7.0
* canals==0.7.0
* whisper components
* add to_dict/from_dict stubs
* import serialization methods in init to hide canals imports
* reno
* export deserializationerror too
* Update haystack/preview/__init__.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* serialization methods for LocalWhisperTranscriber (#5648)
* chore: serialization methods for `FileExtensionClassifier` (#5651)
* serialization methods for FileExtensionClassifier
* Update test_file_classifier.py
* chore: serialization methods for `SentenceTransformersDocumentEmbedder` (#5652)
* serialization methods for SentenceTransformersDocumentEmbedder
* fix device management
* serialization methods for SentenceTransformersTextEmbedder (#5653)
* serialization methods for TextFileToDocument (#5654)
* chore: serialization methods for `RemoteWhisperTranscriber` (#5650)
* serialization methods for RemoteWhisperTranscriber
* remove patches
* Add default to_dict and from_dict in document stores built with factory (#5674)
* fix tests (#5671)
* chore: simplify serialization methods for `MemoryDocumentStore` (#5667)
* simplify serialization for MemoryDocumentStore
* remove redundant tests
* pylint
* chore: serialization methods for `MemoryRetriever` (#5663)
* serialization method for MemoryRetriever
* more tests
* remove hash from default_document_store_to_dict
* remove diff in factory.py
* chore: serialization methods for `DocumentWriter` (#5661)
* serialization methods for DocumentWriter
* more tests
* use factory
* black
---------
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
|
2023-08-29 18:15:07 +02:00 |
|
Silvano Cerza
|
66f615a3a4
|
Remove BaseTestComponent (#5613)
* Remove BaseTestComponent
* Add release notes
|
2023-08-23 17:03:37 +02:00 |
|
Silvano Cerza
|
4bc68cbc2f
|
Rework MemoryRetriever (#5582)
* Remove DocumentStoreAwareMixin from MemoryRetriever
* Add release notes
* Update an article
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
|
2023-08-18 16:33:35 +02:00 |
|
Massimiliano Pippi
|
714b944dc2
|
chore: rename store to document_store for clarity (#5547)
* store -> document_store
* fix leftovers
* fix import name
* moar leftovers
* rebase on main, update MemoryDocumentStore to the new protocol
* Update haystack/preview/pipeline.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
---------
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
|
2023-08-12 08:44:36 +02:00 |
|
ZanSara
|
5ca4874df9
|
Migrate existing v2 components to Canals 0.4.0 (#5532)
* pin canals==0.4.0
* update audio components
* allow audio components to receive whisper_params in init too
* migrating memoryretriever
* migrate memoryretriever
* migrate TextFileToDocument
* fix TextFileToDocument tests
* fix pipeline tests
* fix defaults management
* reno
* inverted assignments
* Simplify release notes
---------
Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>
|
2023-08-09 15:51:32 +02:00 |
|
Silvano Cerza
|
7940ec0482
|
Add @store decorator (#5438)
|
2023-07-26 09:32:23 +02:00 |
|
ZanSara
|
8f3fe85878
|
feat: extend pipeline.add_component to support stores (#5261)
* add protocol and adapt pipeline
* change API in pipeline.add_component
* adapt pipeline tests
* adapt memoryretriever
* additional checks
* separate protocol and mixin
* review feedback & update tests
* pylint
* Update haystack/preview/document_stores/protocols.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* Update haystack/preview/document_stores/memory/document_store.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* docstring of Store
* adapt memorydocumentstore
* fix tests
* remove direct inheritance
* pylint
* Update haystack/preview/document_stores/mixins.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* Update test/preview/components/retrievers/test_memory_retriever.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* Update test/preview/components/retrievers/test_memory_retriever.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* Update test/preview/components/retrievers/test_memory_retriever.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* Update test/preview/components/retrievers/test_memory_retriever.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* Update test/preview/components/retrievers/test_memory_retriever.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* test names
* revert suggestion
* private self._stores
* move asserts out
* remove protocols
* review feedback
* review feedback
* fix tests
* mypy
* review feedback
* fix tests & other details
* naming
* mypy
* fix tests
* typing
* partial review feedback
* move .store to input dataclass
* Revert "move .store to input dataclass"
This reverts commit 53f624b99f3414c89d5134711725b31bd94ef77a.
* disable reusing components with stores
* disable sharing components with docstores
* Update mixins.py
* black
* upgrade canals & fix tests
---------
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
|
2023-07-17 15:06:19 +02:00 |
|
ZanSara
|
13bed30504
|
feat: batch mode for MemoryRetriever (v2) (#5287)
* memoryretriever batch mode
* typing of output
|
2023-07-07 12:10:35 +02:00 |
|
Vladimir Blagojevic
|
bc86f57715
|
feat: BM25 retrieval for MemoryDocumentStore (#5151)
|
2023-06-27 17:42:23 +02:00 |
|