unstructured

mirror of https://github.com/Unstructured-IO/unstructured.git synced 2025-07-27 19:10:33 +00:00

History

rfctr: extract OCRAgent.get_agent() out of PDF subtree (#2965 )

**Summary**
File-types other than PDF need to use OCR on extracted images. Extract
`OCRAgent.get_agent()` such that any file-type partitioner can use it
without risking dependency on PDF-only extras.

2024-05-03 19:39:22 +00:00

test_ocr_interface.py

rfctr: extract OCRAgent.get_agent() out of PDF subtree (#2965 )

2024-05-03 19:39:22 +00:00