3117 Commits

Author SHA1 Message Date
James R. Barlow
8423bd549b
helpers: don't trap exception on failure to unlink
If we can't unlink a file we expect to unlink, logging and moving on is
probably the wrong action.

Coverage never hits this line.
2021-04-07 23:16:10 -07:00
James R. Barlow
336d274a54 Drop remnants of support for Tesseract without has_textonly_pdf
Also improve Tesseract version checking so it can compare all of their
weird conventions.
2021-04-07 23:05:21 -07:00
James R. Barlow
906d77b389
tests: remove obsolete running_in_travis() 2021-04-07 02:25:10 -07:00
James R. Barlow
9416e850ff
Remove another instance of helpers_namespace 2021-04-07 02:23:04 -07:00
James R. Barlow
fd3248869c Merge branch 'master' of github.com:jbarlow83/OCRmyPDF 2021-04-07 02:18:14 -07:00
James R. Barlow
2a09a668f6
Delinting: unused args 2021-04-07 02:18:08 -07:00
James R. Barlow
e788dde607
tests: eliminate unnecessary mmap 2021-04-07 02:11:31 -07:00
James R. Barlow
173a80864d
Delinting 2021-04-07 02:09:45 -07:00
James R. Barlow
aa115a8be3
Remove pytest_helpers_namespace 2021-04-07 01:56:51 -07:00
Tomáš Hrnčiar
e0441c4aa1
Explicitly require setuptools, since pdfa.py imports pkg_resources (#755)
Co-authored-by: jbarlow83 <jbarlow83@users.noreply.github.com>
2021-04-06 02:23:44 -07:00
James R. Barlow
a861c58da2 Merge branch 'master' of github.com:jbarlow83/OCRmyPDF 2021-04-06 02:07:16 -07:00
James R. Barlow
b1306bd7a8
tests: skip test_bash on Windows 2021-04-06 01:15:00 -07:00
James R. Barlow
3a6eb383dc
Run all tests 2021-04-06 00:28:06 -07:00
James R. Barlow
6c942ecefd
Update release notes 2021-04-06 00:17:36 -07:00
James R. Barlow
a25f8ecc62
Remove Azure Pipelines 2021-04-06 00:17:29 -07:00
andkrause
e1f4813d94 Dockerfile: support arm64
Thanks to @andkrause for submitting the initial PR that made this change
possible, @0x326 for review comments on the same PR, and both for their
patience in waiting for the rest of OCRmyPDF to catch up.

Co-authored-by: James R. Barlow

Closes #564
2021-04-06 00:09:04 -07:00
James R. Barlow
af526f078d Activate codecov 2021-04-06 00:08:58 -07:00
James R. Barlow
16438c1312 Begin GitHub Actions migration 2021-04-06 00:08:13 -07:00
James R. Barlow
dd6cb7ce20
Preparing release notes 2021-04-02 01:10:53 -07:00
James R. Barlow
653e2e23df
Update deps for next release 2021-04-02 01:10:39 -07:00
James R. Barlow
a2033698fa graft: use newer pikepdf style 2021-04-02 00:11:13 -07:00
James R. Barlow
ec1d585d40
Merge branch 'feature/misc-breaking' 2021-04-01 16:51:04 -07:00
James R. Barlow
a4e1f8e1f3 Merge branch 'feature/lambda' 2021-04-01 16:36:22 -07:00
James R. Barlow
2e155c31bf
Ensure builtin module registration is deterministic 2021-04-01 16:30:42 -07:00
James R. Barlow
e09ae9c68a
Fix test suite failure if filter_pdf_page is missing 2021-04-01 16:25:40 -07:00
James R. Barlow
c5bf1dd90d
reqs: bump reqs to avoid reportlab issue 2021-04-01 16:01:23 -07:00
Timo Klerx
e96770c5e4
Updated documentation for windows and additional languages (#753)
Changed the documentation for how to add new languages on Windows.

Co-authored-by: jbarlow83 <jbarlow83@users.noreply.github.com>
2021-03-26 16:13:28 -07:00
James R. Barlow
f4f0f3c022
v11.7.3 release notes v11.7.3 2021-03-20 23:30:42 -07:00
James R. Barlow
0a42934c08
Exclude Group 3 images from optimization 2021-03-20 23:28:21 -07:00
James R. Barlow
d8f47768f9
v11.7.2 release notes v11.7.2 2021-03-19 00:31:38 -07:00
James R. Barlow
c9594a4a5f
Update pinned versions to avoid Pillow vulnerabilties
See https://github.com/python-pillow/Pillow/blob/master/CHANGES.rst
2021-03-19 00:31:27 -07:00
James R. Barlow
873f915212
docs: mention problems with Debian/Ubuntu and other tidying 2021-03-09 23:24:10 -08:00
James R. Barlow
079c162a96 Ensure sidecar is not input or output file 2021-03-05 00:29:42 -08:00
James R. Barlow
25c8c4656f
Fix error message change v11.7.1 2021-03-03 01:02:59 -08:00
James R. Barlow
ffcae9a1a0
v11.7.1 release notes 2021-03-03 00:46:35 -08:00
James R. Barlow
6e71fe1186
Clarify --unpaper-args errors 2021-03-03 00:44:21 -08:00
James R. Barlow
0885799010 Update docs for conda
Closes #743
2021-03-03 00:43:45 -08:00
James R. Barlow
8ffc99f648 optimize: log errors more loudly 2021-03-03 00:43:40 -08:00
James R. Barlow
2261c51eff
Reactivate pngquant on windows v11.7.0 2021-02-26 01:19:03 -08:00
James R. Barlow
5c470778a3
v11.7.0 release notes 2021-02-26 00:29:52 -08:00
James R. Barlow
4124889f36
Don't generate PDF/A-1b with object streams
Acrobat insists that PDF/A-1b should not have object streams.
Other programs like veraPDF disagree with this restriction, but
we can accommodate Acrobat so we will.

Also add more tests around this.
2021-02-26 00:23:57 -08:00
James R. Barlow
a23c22b0e8
helpers: tidy check_pdf 2021-02-25 22:51:53 -08:00
James R. Barlow
dd1f5f7215
pyproject: black doesn't like py39 yet 2021-02-25 16:10:20 -08:00
Dima Kuznetsov
5e2206bae7
Allow --sidecar along --pages (#735) 2021-02-19 16:55:35 -08:00
James R. Barlow
079ee86d43
pyproject: also target py39 2021-02-18 01:48:56 -08:00
James R. Barlow
3692868004
v11.6.2 release notes v11.6.2 2021-02-15 01:48:14 -08:00
James R. Barlow
064f935699
Fix page rotation regression
Page size fixes in commit b26749 did accounted for a "kept" rotation,
but not a corrected rotation.

Fixes #730.
2021-02-15 01:47:09 -08:00
James R. Barlow
8770fff968
tests: remove unreliable/incomplete test 2021-02-15 01:05:08 -08:00
James R. Barlow
82de78b6b0
v11.6.1 release notes v11.6.1 2021-02-14 01:51:26 -08:00
James R. Barlow
2a52c6dec2 optimize: skip images with unusually small dimensions
They're unlikely to be handled well by our recompressors. It seems
that JBIG2 cannot handle very small widths.

Fixes #732
2021-02-14 01:43:25 -08:00