125 Commits

Author SHA1 Message Date
James R. Barlow
073ee52ce7 Use hocr and weave; eliminate old combine layers and merge pages 2018-05-01 14:21:53 -07:00
James R. Barlow
54150a14e9 Further elimination of tesseract renderer special casing
We don't need to keep a "skip page" around anymore since
skipping means just not grafting on the text layer.
2018-05-01 13:36:20 -07:00
James R. Barlow
88ff091cce Unify tesseract and sandwich renderer paths
Since the new weaving method copies the font and content
stream from the Tesseract PDF, it doesn't matter if Tesseract
happens to have an image or not.
If Tesseract is text-only capable we use that feature for efficiency,
but ignore the image either way.
2018-05-01 13:24:20 -07:00
James R. Barlow
775be3933c Cherrypick merge_pages unification 2018-04-20 23:08:15 -07:00
James R. Barlow
5fde214290 Update notes for v6.1.5 2018-04-17 15:23:35 -07:00
James R. Barlow
9d28879505 Update Ubuntu 14.04 instructions
Closes #252
2018-04-14 17:30:33 -07:00
James R. Barlow
9b731d63b8 Set Ghostscript -sColorConversionStrategy the way old/new versions expect 2018-04-12 16:28:48 -07:00
James R. Barlow
10aa59f674 v6.1.4 fix test suite regression with Ghostscript 9.23 2018-04-12 15:16:54 -07:00
James R. Barlow
1f7837e7b1 v6.1.4 release notes update 2018-04-12 00:55:45 -07:00
James R. Barlow
c95db246d4 v6.1.4 merge 2018-04-11 15:58:00 -07:00
James R. Barlow
1ba93371ce docs: Update installation to reflect qpdf 7.0.0 requirement 2018-04-11 15:40:50 -07:00
James R. Barlow
3d69b46fca Release notes 2018-04-10 15:53:02 -07:00
James R. Barlow
75d37eb103 docs: expand ocr of image usage 2018-04-09 13:06:09 -07:00
James R. Barlow
d1d4f1e198 Add issue links to release notes 2018-04-06 14:52:40 -07:00
James R. Barlow
be41ff6d54 Update flowchart
[ci skip]
2018-04-05 21:26:37 -07:00
James R. Barlow
1dbb6f1746 Notes on relevant envvars, repology 2018-04-05 02:15:01 -07:00
James R. Barlow
753e6274ab Tell unpaper to use --layout none so it won't blank out multi column text 2018-04-05 02:14:33 -07:00
James R. Barlow
7f462c618b v6.1.3 notes 2018-04-03 00:11:20 -07:00
James R. Barlow
2a43f73228 Update installation.rst, further info on fitz 2018-04-02 11:32:57 -07:00
James R. Barlow
4032570d97 Change docs for fitz/PyMuPDF 2018-04-01 13:19:57 -07:00
James R. Barlow
158f902c3b Fixed setup.py syntax error 2018-03-30 14:00:36 -07:00
James R. Barlow
7f6aaeaecf v6.1.2 2018-03-30 12:39:33 -07:00
James R. Barlow
fc299032a4 v6.1.1 release notes
Better get the last one out
2018-03-30 00:11:52 -07:00
James R. Barlow
610b769df9 Update release notes 2018-03-28 23:33:34 -07:00
James R. Barlow
40ef4f0bbe Add new argument --skip-repair to skip the repair step 2018-03-28 00:54:58 -07:00
James R. Barlow
d0271d5049 More debug messages on repair; update notes 2018-03-28 00:39:38 -07:00
James R. Barlow
d86e315c48 v6.0.1 start release notes 2018-03-26 01:44:01 -07:00
James R. Barlow
2d10fdcf0f Fix typo in release notes 2018-03-25 21:37:06 -07:00
James R. Barlow
355ec70a80 Note other web frontends 2018-03-25 21:36:39 -07:00
James R. Barlow
9fbc69df3f v6.0.0 release 2018-03-25 01:34:26 -07:00
James R. Barlow
656045610a Update release notes 2018-03-25 00:17:23 -07:00
James R. Barlow
11d74dea09 Remove the OCRMYPDF_program environment variables
Really, this was just replicating the functionality of the PATH
environment variable, and users probably do that anyway.
2018-03-24 15:07:02 -07:00
James R. Barlow
6f1a40b2ca v6.0.0 notes, build machinery changes 2018-03-24 02:52:56 -07:00
James R. Barlow
a2b1f54eb2 Update documentation license info 2018-03-24 02:33:24 -07:00
James R. Barlow
e88ec9822b Tweak release notes 2018-03-15 17:09:43 -07:00
James R. Barlow
11fdb4c5d8 Update release notes for v5.7.0 2018-03-15 17:06:04 -07:00
James R. Barlow
319aff6d09 Merge better-hocr 2018-03-15 16:59:59 -07:00
endolith
8d691391ac Fix typos in advanced.rst (#228) 2018-03-14 15:54:55 -04:00
James R. Barlow
0089a84c94 hocr: Make interword spaces default and non-optional for hocr
Update documentation to match.
2018-03-13 14:51:47 -07:00
James R. Barlow
fc0800ed5d v5.6.3 notes 2018-03-12 03:41:12 -07:00
James R. Barlow
f4e3a0e5b2 v5.6.2 notes 2018-03-09 15:37:08 -08:00
James R. Barlow
84d120e850 v5.6.1 notes 2018-03-09 08:00:42 -08:00
James R. Barlow
7ae6c5ae87 Trial merge interword-spaces 2018-03-02 23:47:06 -08:00
Charles Forcey
422e619978 Add a note to the documentation about interword-spaces 2018-03-01 13:15:03 -05:00
James R. Barlow
2d8aad1086 Improve docs 2018-03-01 00:24:38 -08:00
James R. Barlow
74ca736333 Issue #223: improve text of encrypted PDF error message 2018-02-27 15:08:22 -08:00
James R. Barlow
f248576994 Change instructions to point away from private tap 2018-02-19 17:33:58 -08:00
James R. Barlow
24435f11e0 We are now in homebrew 2018-02-15 17:42:16 -08:00
James R. Barlow
fa2c0296d6 v5.6.0 release notes, docs 2018-02-07 16:48:04 -08:00
James R. Barlow
882fc2257c Add --max-image-mpixels argument to support Pillow 5.0 2018-01-10 15:43:59 -08:00