mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-07-27 19:10:33 +00:00

**Summary** File-types other than PDF need to use OCR on extracted images. Extract `OCRAgent.get_agent()` such that any file-type partitioner can use it without risking dependency on PDF-only extras.