Vladimir Blagojevic
|
b4d8d1c904
|
feat: Add custom conversion callable to PyPDFToDocument - Haystack 2.x (#6258)
* Allow user specified converter hook
* Add a release note
* More unit tests
* PR review - Massi, use protocol as converter
|
2023-11-09 17:35:33 +01:00 |
|
Silvano Cerza
|
7287657f0e
|
refactor: Rename Document 's text field to content (#6181)
* Rework Document serialisation
Make Document backward compatible
Fix InMemoryDocumentStore filters
Fix InMemoryDocumentStore.bm25_retrieval
Add release notes
Fix pylint failures
Enhance Document kwargs handling and docstrings
Rename Document's text field to content
Fix e2e tests
Fix SimilarityRanker tests
Fix typo in release notes
Rename Document's metadata field to meta (#6183)
* fix bugs
* make linters happy
* fix
* more fix
* match regex
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
|
2023-10-31 12:44:04 +01:00 |
|
Julian Risch
|
9f3b6512be
|
refactor: Remove reimplementations of default from_dict /to_dict and corresponding tests in 2.0 (#6108)
* whisper transcriber
* remove from/to_dict from builders
* remove from/to_dict from embedders
* remove from/to_dict from fetcher, file_converters
* remove from/to_dict from generators, preprocessors
* remove from/to_dict from ranker, reader
* remove from/to_dict from router, sampler, websearch
* pylint
* reno
* refactor import
* remove unused import
|
2023-10-19 11:17:02 +02:00 |
|
Vladimir Blagojevic
|
3803d23ff6
|
feat: Update PyPDFToDocument to process ByteStream inputs (#6021)
* Update PyPDF converter
* Add mixed source unit test
* Update haystack/preview/components/file_converters/pypdf.py
Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>
---------
Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>
|
2023-10-11 10:52:08 +02:00 |
|
Vladimir Blagojevic
|
92a6221927
|
feat: Add PyPDFToDocument component (2.0) (#5850)
* Initial PyPDFToDocument implementation
* Remove progress bar
* Add release note
* Minor fix
* import check and dependency
---------
Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>
|
2023-09-21 11:52:26 +02:00 |
|