Madeesh Kannan
|
5d66d040cc
|
feat: Add serde methods to HTMLToDocument (#6758)
|
2024-01-18 10:02:01 +01:00 |
|
Sebastian Husch Lee
|
c0b67432e4
|
feat: Add page breaks to default PDF to Document converter (#6755)
* Speedup tests for PyPDFToDocument
* Added unit test and removed skipping of empty pages
* add release note
* Add back some integration marks
|
2024-01-18 08:54:59 +01:00 |
|
ZanSara
|
abd16ab796
|
feat: support single metadata dictionary in MarkdownToDocument (#6629)
* support single metadata dict in markdown2document
* reno
* unwrap list
* direct key access
* typing
* add explicit test
|
2024-01-09 14:44:39 +01:00 |
|
ZanSara
|
175b5baf45
|
feat: support single metadata dictionary in AzureOCRDocumentConverter (#6635)
* support single metadata dict in azureconverter
* reno
* tests
* Update releasenotes/notes/single-meta-in-azureconverter-ce1cc196a9b161f3.yaml
|
2024-01-09 10:49:37 +01:00 |
|
ZanSara
|
974d65f30a
|
feat: support single metadata dictionary in TikaDocumentConverter (#6698)
* reno
* converter
* test
* comment
|
2024-01-09 09:49:47 +01:00 |
|
Stefano Fiorucci
|
bb2b1a20f8
|
refactor: optimize API keys reading (#6655)
* centralize API keys handling
* fix mypy and pylint
* rm utility function, be more explicit
|
2024-01-05 10:40:03 +01:00 |
|
ZanSara
|
c0f1dab454
|
feat: support single metadata dictionary in PyPDFToDocument (#6615)
* support single metadata dict in pypdf2document
* improve tests
* tests
* remove line
|
2023-12-22 14:13:11 +01:00 |
|
ZanSara
|
ff55985e2d
|
feat: support single metadata dictionary in HTMLToDocument (#6613)
* support single metadata in HTMLToDocument
* reno
* docstring
|
2023-12-21 16:45:31 +01:00 |
|
ZanSara
|
cf79aa1485
|
feat: add support for single meta dict in TextFileToDocument (#6606)
* add support for single meta dict
* reno
* reno
* mypy
* extract to function
* docstring
* mypy
|
2023-12-21 14:21:17 +01:00 |
|
sahusiddharth
|
3d17e6ff76
|
changed metadata to meta (#6605)
|
2023-12-21 12:39:58 +01:00 |
|
Vladimir Blagojevic
|
2dd5a94b04
|
feat: Add RAG based OpenAPI service integration (#6555)
* Add OpenAPIServiceConnector and OpenAPIServiceToFunctions
* Add release note
* Add test deps
* Better docs on OpenAPI spec reqs, improve tests
* Silvano PR feedback
|
2023-12-19 13:27:41 +01:00 |
|
Stefano Fiorucci
|
94cfe5d9ae
|
feat!: HTMLToDocument - allow choosing the boilerpy3 extractor (#6582)
* allow extractor customizability
* release note
* typo
|
2023-12-19 10:52:12 +01:00 |
|
Stefano Fiorucci
|
2f034d3c97
|
refactor!: Converters - standardize inputs (#6540)
* standardize converters inputs: first draft
* fix precommit
* fix precommit 2
* fix precommit 3
* add default for optional param
* rm leftover
* install boilerpy in linting workflow
* add boilerpy3 to the core dependencies
* add reno
* remove boilerpy3 installation from test workflow
* fix pylint: import order and unused import
* fix import order
* add release note
* better Tika docstring
* rm boilerpy from linting
* leftover
* md link brackets
* feat: Converters - allow passing `meta` in the `run` method (#6554)
* first impl for html
* progressing on other components
* fix test
* add tests - run with meta
* release note
* reintroduce patches wrongly deleted
* add patch in test
* fix tika test
* Update haystack/components/converters/azure.py
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
* Update releasenotes/notes/converters-standardize-inputs-ed2ba9c97b762974.yaml
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* simplify test
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
|
2023-12-15 16:41:35 +01:00 |
|
Massimiliano Pippi
|
7c05f37a53
|
remove unit marker (#6450)
|
2023-11-29 19:24:25 +01:00 |
|
Silvano Cerza
|
e6637f5ec2
|
Fix all tests
|
2023-11-24 14:48:43 +01:00 |
|
Massimiliano Pippi
|
8adb8bbab8
|
Remove preview folder in test/
---------
Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>
|
2023-11-24 11:52:55 +01:00 |
|