OCRmyPDF

Collection of script aimed at generating searchable PDF files from PDF files containing only images

ATTENTION: The scripts are still in development phase, please do not use!!!!

Install

TODO

Install java: cd /usr/ports/java/openjdk7/ && make install clean

Install jhove: download jhove from here: http://sourceforge.net/projects/jhove/files/jhove/ After extracting the JHOVE files to some directory "jhove", you have to edit the file "jhove/conf/jhove.conf" and change something in "something" to the actual directory (ending in "/jhove").

Description
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Readme MPL-2.0 77 MiB
Languages
Python 96.9%
Shell 2.7%
Dockerfile 0.4%