Stefano Fiorucci
e3f9da13d0
test: fix test incorrectly marked as async ( #9327 )
...
* test: fix test incorrectly marked as async
* fix inmemory async tests
2025-04-30 14:07:30 +00:00
David S. Batista
672ab09477
fix: cleaning up InMemoryDocumentStore executor when created inside the class ( #8994 )
...
* cleaning up executor when created inside the class
* adding missed tests
2025-03-07 11:01:29 +01:00
David S. Batista
9581fea3bc
feat: adding async version of InMemoryDocumentStore and associated retrievers ( #8963 )
...
* adding classes from experimental
* adding release notes
* adding tests
* merging all into a single class
* adding async retriever methods
* Update haystack/document_stores/in_memory/document_store.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* adding missed tests
---------
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
2025-03-05 11:36:24 +01:00
Stefano Fiorucci
f3c44be904
refactor!: remove dataframe field from Document and ExtractedTableAnswer; make pandas optional ( #8906 )
...
* remove dataframe
* release note
* small fix
* group imports
* Update pyproject.toml
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
* Update pyproject.toml
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
* address feedback
---------
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
2025-03-04 11:06:07 +00:00
Stefano Fiorucci
bc30105fbc
test: reorganize docstore test suite to isolate dataframe tests ( #8684 )
...
* reorganize docstore test suite to isolate dataframe tests
* improve docstring
* include FilterDocumentsTestWithDataframe in InMemoryDocumentStore tests
2025-01-08 14:58:52 +00:00
Vladimir Blagojevic
7e9f153e78
chore: Remove all references to old filter syntax ( #8342 )
...
* Remove all references to old filter syntax
* More removals
* Lint
* Do not remove test_filter_retriever.py
* Add reno note
* Update ValueError text to match text in haystack-core-integrations
2024-09-12 16:28:31 +02:00
David Berenstein
08104e0042
feat: InMemoryDocumentStore serialization ( #7888 )
...
* Add: InMemoryDocumentStore serialization
* Add: additional chek to test if path exists
* Fix: failing test
2024-06-21 16:45:25 +02:00
Silvano Cerza
854c4173f2
feat: Add memory sharing between different instances of InMemoryDocumentStore ( #7781 )
...
* Add memory sharing between different instances of InMemoryDocumentStore
* Fix FilterRetriever tests
* Fix InMemoryBM25Retriever tests
2024-05-31 16:44:14 +02:00
Massimiliano Pippi
10c675d534
chore: add license header to all modules ( #7675 )
...
* add license header to modules
* check license header at linting time
2024-05-09 13:40:36 +00:00
Guest400123064
cd66a80ba2
perf: enhanced InMemoryDocumentStore BM25 query efficiency with incremental indexing ( #7549 )
...
* incorporating better bm25 impl without breaking interface
* all three bm25 algos
* 1. setting algo post-init not allowed; 2. remove extra underscore for naming consistency; 3. remove unused import
* 1. rename attribute name for IDF computation 2. organize document statistics as a dataclass instead of tuple to improve readability
* fix score type initialization (int -> float) to pass mypy check
* release note included
* fixing linting issues and mypy
* fixing tests
* removing heapq import and cleaning up logging
* changing indexing order
* adding more tests
* increasing tests
* removing rank_bm25 from pyproject.toml
---------
Co-authored-by: David S. Batista <dsbatista@gmail.com>
2024-05-03 12:10:15 +00:00
ZanSara
1182c08daf
fix: Dont filter negative scores when using BM25Okapi and scale_score=False ( #6889 )
...
* dont filter negatives for unscaled Okapi
* change BM25 algorithm default to BM25L
* Update haystack/document_stores/in_memory/document_store.py
* improve comment
2024-02-06 11:07:27 +01:00
Madeesh Kannan
a5189dd035
fix!: InMemoryBM25Retriever no longer returns documents that have a score of 0.0 ( #6717 )
...
* fix!: `InMemoryBM25Retriever` no longer returns documents that have a score of 0.0
Also update tests to accommodate the new behavior.
* Remove superfluous code
2024-01-12 17:50:55 +01:00
Massimiliano Pippi
e1ec4e5e4d
refact!: Remove symbols under the haystack.document_stores namespace ( #6714 )
...
* remove symbols under the haystack.document_stores namespace
* Update haystack/document_stores/types/protocol.py
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* fix
* same for retrievers
* leftovers
* more leftovers
* add relnote
* leftovers
* one more
* fix examples
---------
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
2024-01-10 21:20:42 +01:00
Massimiliano Pippi
7c05f37a53
remove unit marker ( #6450 )
2023-11-29 19:24:25 +01:00
Silvano Cerza
831d0611d9
feat: Change default DuplicatePolicy in DocumentStore.write_documents() ( #6438 )
...
* Change default DuplicatePolicy in DocumentStore.write_documents()
* Add release notes
2023-11-28 12:30:17 +01:00
Silvano Cerza
e6637f5ec2
Fix all tests
2023-11-24 14:48:43 +01:00
Massimiliano Pippi
8adb8bbab8
Remove preview folder in test/
...
---------
Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>
2023-11-24 11:52:55 +01:00