James R. Barlow
ea69883386
Tests: Speed up a slow test (cherry-picked from v7)
2018-07-12 02:47:15 -07:00
James R. Barlow
eb343b1e37
Tests: Add ability to disable use of cache (cherrypicked from v7)
2018-07-12 02:46:53 -07:00
James R. Barlow
9f02de55be
main: do better parameter validation
2018-07-12 02:46:52 -07:00
James R. Barlow
7394a4cf49
Cherrypick warning about --user-words not having any effect
...
Might be available in full release of Tess4
2018-07-12 02:46:34 -07:00
James R. Barlow
ed9fb110b1
Fix a comment about Tesseract behavior in certain versions
2018-07-12 02:46:34 -07:00
James R. Barlow
4650074428
Cherrypick Python 3.7 documentation updates from v7.0.0
...
From b0eacd6
2018-07-12 02:45:51 -07:00
James R. Barlow
70aa644c10
Backport Python 3.7 fix for ruffus 2.7.0 from ocrmypdf v7.0.0
2018-07-12 02:45:51 -07:00
James R. Barlow
2ccb3edc58
Ignore masks when deciding what color to rasterize at
2018-07-12 02:45:51 -07:00
James R. Barlow
1f40a70554
Use qpdf 8.0.2 backport, force old pytest-timeout to fix build
v6.2.1
2018-06-23 03:14:18 -07:00
James R. Barlow
e14ffbf03f
v6.2.1 release notes
2018-06-23 03:01:54 -07:00
James R. Barlow
25a1dde57c
Fix recent versions of tesseract not registering as textonly_pdf
...
This change happened sometime after the 4.0.0-beta1 release in
Ubuntu 18.04
2018-06-23 02:59:22 -07:00
jbarlow83
1d10eac764
Add wiki link to issue template
...
[ci skip]
2018-06-06 12:59:59 -07:00
jbarlow83
3f868118cd
Remove gpg
...
[ci skip]
2018-06-06 12:58:02 -07:00
Fabian Rodriguez
c7cf041e4a
Fixed language option example (French) ( #266 )
...
Replace fre to fra.
2018-05-10 00:10:27 -07:00
James R. Barlow
38ab03655b
Restore unpaper
...
It's a suggested/recommended dep not required in Deb/Ubu.
2018-05-06 21:36:12 -07:00
James R. Barlow
9226f8a5d1
Trap PDF/A-3 errors on old Ghostscript
v6.2.0
2018-05-04 15:29:43 -07:00
James R. Barlow
5c8a007f3e
Fix failure to prevent use of Ghostscript on /UserUnit files
2018-05-04 13:34:34 -07:00
James R. Barlow
b3ad3e297d
v6.2.0 fixes
2018-05-03 17:04:23 -07:00
James R. Barlow
d607553e48
v6.2.0 Release notes
2018-05-03 16:47:21 -07:00
James R. Barlow
7cf83c77ca
Merge branch 'feature/pdfa3'
2018-05-03 16:45:57 -07:00
James R. Barlow
8a9f174f63
Fix XMP validation issue with /CreationDate
...
Related to previous validation issue. If the /CreationDate had no
timezone, Ghostscript also creates invalid metadata. Work around this.
Also fix up PDF date decoding, and transcode dates to standardize them.
2018-05-03 16:30:20 -07:00
James R. Barlow
98a0786c32
Add 18.04 update procedure
2018-05-03 13:55:16 -07:00
James R. Barlow
df1129724c
Update Dockerfile for Ubuntu 18.04
2018-05-03 01:27:13 -07:00
James R. Barlow
383e726d65
Expand size growth reasons to other arguments that trigger transcoding
2018-04-27 19:34:57 -07:00
James R. Barlow
2131ad4670
Fix --remove-background error on PDFs with colormapped images
...
It's unclear how exactly a
colormapped image gets to this
spot given the tendency of other
image processing tools to flatten
such images, but someone made it happen, so now we make sure
the image is okay.
Closes #262
2018-04-27 17:21:01 -07:00
James R. Barlow
219fe2155b
test_pageinfo: remove duplicate import
2018-04-27 17:16:42 -07:00
James R. Barlow
4209034d20
Add gpg key to issue template
2018-04-27 15:51:26 -07:00
James R. Barlow
abcae0c2a4
Fix helpers.py again
2018-04-25 22:10:51 -07:00
James R. Barlow
0934905493
Don't suppress error message from config_notfound
...
Since it showed up in s390x bionic
2018-04-25 21:58:18 -07:00
James R. Barlow
11cd6201d9
helpers: fix missing call to complain()
...
In practice this is probably unreachable.
2018-04-25 21:57:50 -07:00
James R. Barlow
df87e21c85
Add support for PDF/A-3
...
No ability to attach files however
2018-04-20 00:06:55 -07:00
Hugo
d761d80750
Use more standard __version__ rather than PILLOW_VERSION ( #257 )
2018-04-19 23:35:32 -07:00
James R. Barlow
0b10db91be
Fix regression: Disable Ghostscript JPEG passthrough entirely
v6.1.5
2018-04-17 17:00:24 -07:00
James R. Barlow
1a516b2af9
Fix regression: time stamp test suite failures
2018-04-17 16:59:21 -07:00
James R. Barlow
076363d78e
Disable JPEG passthrough for Ghostscript 9.23
...
Seems to corrupt JPEGs involved in image masks?
2018-04-17 16:31:03 -07:00
James R. Barlow
5fde214290
Update notes for v6.1.5
2018-04-17 15:23:35 -07:00
James R. Barlow
a620724d6a
Fix PDF/A validation failure due to timezone being omitted from /ModDate
2018-04-17 15:16:48 -07:00
James R. Barlow
7368399f8b
Clarify license of two test files - https://github.com/jbarlow83/OCRmyPDF/issues/254
2018-04-17 11:56:36 -07:00
James R. Barlow
34c78a892a
Fix list table for tests/resources
...
[ci skip]
2018-04-15 23:52:19 -07:00
James R. Barlow
9d28879505
Update Ubuntu 14.04 instructions
...
Closes #252
2018-04-14 17:30:33 -07:00
James R. Barlow
2482296e2b
hocr: avoid division by zero
...
Issue #253 - PDF that produces the error is not available, but if font_width
is zero, chances are the text is nonprinting characters, so suppress it.
2018-04-14 17:24:21 -07:00
James R. Barlow
7fc897e6dc
Fix NameError 'ghostscript'
v6.1.4
2018-04-12 21:24:05 -07:00
James R. Barlow
9b731d63b8
Set Ghostscript -sColorConversionStrategy the way old/new versions expect
2018-04-12 16:28:48 -07:00
James R. Barlow
10aa59f674
v6.1.4 fix test suite regression with Ghostscript 9.23
2018-04-12 15:16:54 -07:00
James R. Barlow
1f7837e7b1
v6.1.4 release notes update
2018-04-12 00:55:45 -07:00
James R. Barlow
ba0535e3fb
Update test cache to account for unpaper --layout none change
2018-04-12 00:48:21 -07:00
James R. Barlow
49fa7f6b5c
tesseract_cache: don't reveal host system file paths in manifest file
2018-04-12 00:47:28 -07:00
James R. Barlow
c95db246d4
v6.1.4 merge
2018-04-11 15:58:00 -07:00
James R. Barlow
1ba93371ce
docs: Update installation to reflect qpdf 7.0.0 requirement
2018-04-11 15:40:50 -07:00
James R. Barlow
fedbbdb575
Travis: compile qpdf from source
...
The older version in Travis's Ubuntu 14.04 can't pass the test suite anymore.
2018-04-11 15:40:45 -07:00