haystack/releasenotes/notes/pypdf-refactoring-de869c91b42ce5b6.yaml
Stefano Fiorucci 6925e3a2e1
refactor!: Improve PyPDFToDocument (#7362)
* first draft

* rm kwargs from protocol

* Simplify

* no breaking changes

* reno

* one more test of the deprecated registry
2024-03-26 10:09:29 +01:00

13 lines
513 B
YAML

---
enhancements:
- |
Refactor `PyPDFToDocument` to simplify support for custom PDF converters.
PDF converters are classes that implement the `PyPDFConverter` protocol and have 3 methods:
`convert`, `to_dict` and `from_dict`.
The `DefaultConverter` class is provided as a default implementation.
deprecations:
- |
Using the `converter_name` parameter in the `PyPDFToDocument` component is deprecated.
It will be removed in the 2.3.0 release.
Use the `converter` parameter instead.