Stefano Fiorucci
|
b7bea3ae9c
|
MemoryDocumentStore - Embedding retrieval (2.0) (#5715)
* MemoryDocumentStore - Embedding retrieval draft
* add release notes
* fix mypy
* better comment
* improve return_embeddings handling
* address PR comments
* update docstrings
* incorporated feeback
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
|
2023-09-07 15:44:07 +02:00 |
|
ZanSara
|
b1daa7c647
|
chore: migrate to canals==0.7.0 (#5647)
* add default_to_dict and default_from_dict placeholders to ease migration to canals 0.7.0
* canals==0.7.0
* whisper components
* add to_dict/from_dict stubs
* import serialization methods in init to hide canals imports
* reno
* export deserializationerror too
* Update haystack/preview/__init__.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* serialization methods for LocalWhisperTranscriber (#5648)
* chore: serialization methods for `FileExtensionClassifier` (#5651)
* serialization methods for FileExtensionClassifier
* Update test_file_classifier.py
* chore: serialization methods for `SentenceTransformersDocumentEmbedder` (#5652)
* serialization methods for SentenceTransformersDocumentEmbedder
* fix device management
* serialization methods for SentenceTransformersTextEmbedder (#5653)
* serialization methods for TextFileToDocument (#5654)
* chore: serialization methods for `RemoteWhisperTranscriber` (#5650)
* serialization methods for RemoteWhisperTranscriber
* remove patches
* Add default to_dict and from_dict in document stores built with factory (#5674)
* fix tests (#5671)
* chore: simplify serialization methods for `MemoryDocumentStore` (#5667)
* simplify serialization for MemoryDocumentStore
* remove redundant tests
* pylint
* chore: serialization methods for `MemoryRetriever` (#5663)
* serialization method for MemoryRetriever
* more tests
* remove hash from default_document_store_to_dict
* remove diff in factory.py
* chore: serialization methods for `DocumentWriter` (#5661)
* serialization methods for DocumentWriter
* more tests
* use factory
* black
---------
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
|
2023-08-29 18:15:07 +02:00 |
|
Massimiliano Pippi
|
f9bd64ba9e
|
make code layout consistent (#5561)
|
2023-08-14 16:35:34 +02:00 |
|
Massimiliano Pippi
|
714b944dc2
|
chore: rename store to document_store for clarity (#5547)
* store -> document_store
* fix leftovers
* fix import name
* moar leftovers
* rebase on main, update MemoryDocumentStore to the new protocol
* Update haystack/preview/pipeline.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
---------
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
|
2023-08-12 08:44:36 +02:00 |
|
Silvano Cerza
|
a7416bcf89
|
Add to_dict and from_dict methods for Stores (#5541)
* Add to_dict and from_dict methods for Stores
* Add release notes
* Add tests with custom init parameters
|
2023-08-11 14:45:56 +02:00 |
|
Massimiliano Pippi
|
c079576a87
|
chore: move base test class into haystack core (#5509)
* move base test class into haystack core
* fix linter
* do not compute coverage of testing code
|
2023-08-04 12:42:13 +02:00 |
|
ZanSara
|
f49bd3a12f
|
feat: introduce Store protocol (v2) (#5259)
* add protocol and adapt pipeline
* review feedback & update tests
* pylint
* Update haystack/preview/document_stores/protocols.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* Update haystack/preview/document_stores/memory/document_store.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* docstring of Store
* adapt memorydocumentstore
* fix tests
---------
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
|
2023-07-07 12:10:08 +02:00 |
|
Vladimir Blagojevic
|
bc86f57715
|
feat: BM25 retrieval for MemoryDocumentStore (#5151)
|
2023-06-27 17:42:23 +02:00 |
|
ZanSara
|
3a6db68408
|
feat: allow filtering documents on all fields (v2) (#4773)
* extend tests
* remove stray test
* pylint
* mypy
* review feedback
* fix tests
* fix last tests
* remove comment
* remove print statement
* pylint
* add flatten test
* remove direct acces/ direct write in docstore tests
* fix tests
|
2023-05-10 16:33:47 +02:00 |
|
ZanSara
|
f2106ab37b
|
feat: initial implementation of MemoryDocumentStore for new Pipelines (#4447)
* add stub implementation
* reimplementation
* test files
* docstore tests
* tests for document
* better testing
* remove mmh3
* readme
* only store, no retrieval yet
* linting
* review feedback
* initial filters implementation
* working on filters
* linters
* filtering works and is isolated by document store
* simplify filters
* comments
* improve filters matching code
* review feedback
* pylint
* move logic into_create_id
* mypy
|
2023-04-13 09:36:23 +02:00 |
|