James R. Barlow
e35526192c
More test cases
2015-07-28 03:02:35 -07:00
James R. Barlow
2a9da225e4
Minor tweaks to uncommon arguments
2015-07-28 02:25:50 -07:00
James R. Barlow
a3f37de9b5
Test cases for --tesseract-timeout
2015-07-28 01:47:30 -07:00
James R. Barlow
6064160953
Get rid of subprocess call on import of tesseract, unpaper -- bit nasty
2015-07-28 01:00:29 -07:00
James R. Barlow
587fa63c8e
--oversample: Default to 0
2015-07-27 20:42:16 -07:00
James R. Barlow
b40eec4cb0
Add --oversample test for hocr rendering
2015-07-27 17:18:02 -07:00
James R. Barlow
2e7cd52c0f
Improve argument handling, test cases
2015-07-27 15:39:54 -07:00
James R. Barlow
77d4cb367e
Put ghostscript in a module
2015-07-27 15:22:00 -07:00
James R. Barlow
2c45c5abc6
Implement tesseract timeout
2015-07-27 04:23:37 -07:00
James R. Barlow
a89afabd79
Implement tesseract PDF rendering as an alternative
...
It's much better a rendering text baselines than hocr and seems to
produce small file sizes, so it's progress. Not available for
Tesseract 3.02 obviously, so both modes need to remove available.
2015-07-27 04:20:49 -07:00
James R. Barlow
6c3cb6acba
Remove redundant *res_render
2015-07-26 12:56:10 -07:00
James R. Barlow
d3088829af
More packaging changes: move jhove, fix console script
2015-07-26 01:52:08 -07:00
James R. Barlow
9aaaba1714
Packaging stuff
2015-07-25 23:45:13 -07:00
Jim Barlow
9adb0d696f
Prepare for Python packaging - move to ocrmypdf folder
2015-07-25 18:22:04 -07:00