394 Commits

Author SHA1 Message Date
James R. Barlow
11d74dea09 Remove the OCRMYPDF_program environment variables
Really, this was just replicating the functionality of the PATH
environment variable, and users probably do that anyway.
2018-03-24 15:07:02 -07:00
James R. Barlow
6f1a40b2ca v6.0.0 notes, build machinery changes 2018-03-24 02:52:56 -07:00
James R. Barlow
a2b1f54eb2 Update documentation license info 2018-03-24 02:33:24 -07:00
James R. Barlow
e88ec9822b Tweak release notes 2018-03-15 17:09:43 -07:00
James R. Barlow
11fdb4c5d8 Update release notes for v5.7.0 2018-03-15 17:06:04 -07:00
James R. Barlow
319aff6d09 Merge better-hocr 2018-03-15 16:59:59 -07:00
endolith
8d691391ac Fix typos in advanced.rst (#228) 2018-03-14 15:54:55 -04:00
James R. Barlow
0089a84c94 hocr: Make interword spaces default and non-optional for hocr
Update documentation to match.
2018-03-13 14:51:47 -07:00
James R. Barlow
fc0800ed5d v5.6.3 notes 2018-03-12 03:41:12 -07:00
James R. Barlow
f4e3a0e5b2 v5.6.2 notes 2018-03-09 15:37:08 -08:00
James R. Barlow
84d120e850 v5.6.1 notes 2018-03-09 08:00:42 -08:00
James R. Barlow
7ae6c5ae87 Trial merge interword-spaces 2018-03-02 23:47:06 -08:00
Charles Forcey
422e619978 Add a note to the documentation about interword-spaces 2018-03-01 13:15:03 -05:00
James R. Barlow
2d8aad1086 Improve docs 2018-03-01 00:24:38 -08:00
James R. Barlow
74ca736333 Issue #223: improve text of encrypted PDF error message 2018-02-27 15:08:22 -08:00
James R. Barlow
f248576994 Change instructions to point away from private tap 2018-02-19 17:33:58 -08:00
James R. Barlow
24435f11e0 We are now in homebrew 2018-02-15 17:42:16 -08:00
James R. Barlow
fa2c0296d6 v5.6.0 release notes, docs 2018-02-07 16:48:04 -08:00
James R. Barlow
882fc2257c Add --max-image-mpixels argument to support Pillow 5.0 2018-01-10 15:43:59 -08:00
James R. Barlow
41e83b52fc Document process for redoing OCR 2018-01-10 15:39:58 -08:00
James R. Barlow
44a45fc3fb Add "bad UTF8 output from Tesseract" test 2017-11-29 14:08:07 -08:00
James R. Barlow
67aa00e2c5 Remove py3.5 deprecation
It'll be in Debian stable for a long time, so maybe not such a good idea
2017-11-27 01:10:34 -08:00
James R. Barlow
a1bf161eb7 Update v5.4.4 release notes 2017-11-27 01:09:35 -08:00
James R. Barlow
64b9005030 v5.4.4 release notes; mention deprecation of 3.5 2017-11-26 23:08:55 -08:00
James R. Barlow
7eadd82048 Update docker script based on uid issue
issue #180, fix reported by @Atredis76
2017-11-26 15:11:17 -08:00
James R. Barlow
f45d012635 Update docs: security, pip install --user 2017-11-21 16:50:14 -08:00
James R. Barlow
5a777ee9bc v5.4.3 release notes 2017-11-17 02:28:02 -08:00
James R. Barlow
c7b8b6e18b Fix issue #194 - --sidecar creates blank txt file 2017-10-26 18:15:31 -07:00
James R. Barlow
2061092ec5 Describe how to stream with Docker
[ci skip]
2017-10-18 12:37:27 -07:00
James R. Barlow
57466a31ba v5.4.1 release notes 2017-10-12 14:04:45 -07:00
James R. Barlow
4b7135f0e5 Add option to produce PDF/A-1B 2017-10-11 14:32:58 -07:00
James R. Barlow
9498601a37 Add docs on adding to docker iamge 2017-10-10 12:13:20 -07:00
James R. Barlow
571de0e368 Update release notes 2017-10-08 12:41:03 -07:00
James R. Barlow
82cea2fd85 Update batch processing docs to include Synology script 2017-10-08 12:34:36 -07:00
James R. Barlow
6af7d61ee5 Fix CI failure due to spoofers not being updated to Tesseract 3.05 strings 2017-09-01 16:17:26 -07:00
James R. Barlow
bafd08391d Update release notes 2017-09-01 12:50:45 -07:00
James R. Barlow
4ed1aa4d23 Release notes: fix indentation 2017-09-01 12:47:22 -07:00
James R. Barlow
f6a4d8f1f8 Fix Ubuntu 14.04 install instructions to account for dropping Py3.4 support
[ci skip]
2017-08-27 13:53:36 -07:00
James R. Barlow
b3097a2384 Fix broken test case related to language packs 2017-08-24 13:01:02 -07:00
James R. Barlow
6d9ddbe98b v5.3.1 notes 2017-08-24 01:09:19 -07:00
James R. Barlow
5b413e3873 Cookbook: add "don't OCR" examples 2017-08-23 23:29:41 -07:00
James R. Barlow
c4831ac00c v5.3 release notes 2017-07-27 00:11:12 -07:00
James R. Barlow
098f5d4f0b docs: remove deprecated example of pdftotext 2017-07-20 16:20:17 -07:00
James R. Barlow
ffc685d536 docs: envvar markup 2017-07-20 16:19:57 -07:00
James R. Barlow
2c24f67deb Rename “tess4” renderer to “sandwich” and make it default in Tess 3.05.01
Tesseract 3.05.01 backported the textonly_pdf=1 which allows the use
of this superior PDF renderer prior to 4.00 alpha. This means that
the tess4 name is no longer accurate, so call it a sandwich because of
its merge-preserve characteristic. Preserve the tess4 name. Fix the
documentation and tests to reflect this.

Make it the default, because it’s better. It does not have the issues
the “tesseract” renderer does prior to Tess 3.05.00 with rendering
PDFs that Ghostscript corrupts, and it produces better output without
re-rastering.

Deprecate some old stuff to avoid the test suite growing obscenely
large.
2017-06-13 13:09:12 -07:00
James R. Barlow
f7ee9e90ce Document what is meant by the ocrmypdf “API” 2017-06-13 10:15:11 -07:00
James R. Barlow
12bfe20385 v5.1 release notes 2017-05-29 14:36:50 -07:00
James R. Barlow
4c09875890 docs: upload unpaper Dropbox link, .rst typo blocking macOS install
[ci skip]
2017-05-19 12:18:09 -07:00
James R. Barlow
0a2f732267 docs: Fix restructured text typos 2017-05-16 23:27:10 -07:00
James R. Barlow
4bade99f27 docs: Remark that someone got bash on Windows working 2017-05-16 23:24:34 -07:00