unstructured

mirror of https://github.com/Unstructured-IO/unstructured.git synced 2025-11-24 06:10:20 +00:00

History

rfctr: extract OCRAgent.get_agent() out of PDF subtree (#2965 )

**Summary**
File-types other than PDF need to use OCR on extracted images. Extract
`OCRAgent.get_agent()` such that any file-type partitioner can use it
without risking dependency on PDF-only extras.

2024-05-03 19:39:22 +00:00

test_ocr_interface.py

rfctr: extract OCRAgent.get_agent() out of PDF subtree (#2965 )

2024-05-03 19:39:22 +00:00