2895 Commits

Author SHA1 Message Date
James R. Barlow
44149ad319 Disable test_error_trap for Leptonica < 1.79
Old error trap seems unreliable in the first place so difficult to set up
a test.
2020-07-20 21:12:00 -07:00
fcatus
d80d963cea
pdfinfo: Replace list comp with gen expr'n 2020-07-20 02:21:58 -07:00
James R. Barlow
4da33b8050 Update debian/copyright from Debian, with fixes 2020-07-19 03:50:43 -07:00
James R. Barlow
5cbbff8472 For Leptonica 1.79+ use leptSetStderrHandler
Lock free and considerably less dangerous to stderr messages.
2020-07-19 03:40:33 -07:00
James R. Barlow
fa6e47c277 Merge branch 'feature/optimize-cleanup' 2020-07-19 01:53:11 -07:00
James R. Barlow
4ea9cffebd Add locking to Leptonica error trap
To protect another thread from interfering with our redirection of
stderr.
2020-07-19 01:51:58 -07:00
James R. Barlow
ae68edefc5 pipelines: fix Python 3.7/3.8 on macOS 2020-07-19 01:51:58 -07:00
James R. Barlow
1558e068f1
docs: explain firstresult hook behavior 2020-07-16 00:01:59 -07:00
James R. Barlow
d2a9c413f8
docs: install notes for ARM64 2020-07-14 01:25:16 -07:00
James R. Barlow
a510b21b20
optimize: add typing for Xref, remove fspath()'s 2020-07-09 14:06:41 -07:00
James R. Barlow
e33ba07aa4
Update pre-commit settings 2020-07-08 23:45:53 -07:00
James R. Barlow
373f27832b
optimize: improve typing of xref_exts 2020-07-07 22:41:29 -07:00
James R. Barlow
b20a6e4c5d
optimize: add type hints 2020-07-07 22:18:50 -07:00
James R. Barlow
49734d5456
optimize: fix incorrect to prevent re-optimizing JBIG2s 2020-07-07 21:52:11 -07:00
James R. Barlow
26a415c5dd
docs: Note usage of OCR_JSON_SETTINGS for watcher 2020-07-07 21:26:57 -07:00
James R. Barlow
60be64a5f1
Fix debug.log missing pageno handler 2020-07-04 03:59:38 -07:00
James R. Barlow
190294634c
docs: edit plugins 2020-07-03 16:16:01 -07:00
James R. Barlow
1722cb579d
v10.2.1 release notes v10.2.1 2020-07-01 03:26:57 -07:00
James R. Barlow
dc42beb6a8
More typing improvements
Typing fixes bugs.
2020-06-30 15:02:30 -07:00
James R. Barlow
378f543619 TextPositionTracker: set boxes_flow=None
We don't care about the order of lines in our analysis, and this is an
expensive calculation in pdfminer.
2020-06-30 04:20:58 -07:00
James R. Barlow
62924ee280
Improve API documentation 2020-06-30 04:20:14 -07:00
James R. Barlow
86a73191b0
Plugin manager: accept Path(plugin) 2020-06-30 04:17:30 -07:00
James R. Barlow
86875997b8
Fix more mypy errors 2020-06-29 02:17:14 -07:00
James R. Barlow
b939584c7a
quality: fixing typing issues 2020-06-29 01:45:45 -07:00
James R. Barlow
bbd174071d
readme: markdown cleanup 2020-06-29 01:45:27 -07:00
James R. Barlow
e5b6fe1317
pyproject.toml: weird line wrapping? 2020-06-29 01:45:12 -07:00
James R. Barlow
f15d9049eb
install: add Mageia
Closes #586. Thanks to @yannick56
2020-06-26 23:28:26 -07:00
James R. Barlow
7630c93e5b
install: drop Ubuntu 14.04 steps
Bit rot must have set in.
2020-06-26 23:27:42 -07:00
James R. Barlow
638d68aa8a
docs: move Windows ahead of FreeBSD 2020-06-26 22:49:34 -07:00
James R. Barlow
a92dde058a
docs: promote one liner installs, reorg Windows 2020-06-26 22:47:44 -07:00
James R. Barlow
580f2ebb4b
Python 3.9beta is now known to work (Fedora) 2020-06-26 00:06:58 -07:00
James R. Barlow
01cae7a584
docs: Update Fedora versions 2020-06-23 02:08:24 -07:00
James R. Barlow
66337813e6
Spell runslow correctly v10.2.0 2020-06-22 23:32:09 -07:00
James R. Barlow
eb5a211e72
New hocrtransform test isn't platform stable - mark runslow 2020-06-22 16:59:59 -07:00
James R. Barlow
5142933120
v10.2.0 release notes
Closes #582, #584, #545
2020-06-22 16:37:51 -07:00
James R. Barlow
06ab114aa8
Update test cache 2020-06-22 16:31:34 -07:00
James R. Barlow
1257419465
test_hocrtransform: this test is worth not caching 2020-06-22 16:31:06 -07:00
James R. Barlow
30404f53f0
Add test to sanity check our pdf renderers 2020-06-22 16:18:38 -07:00
James R. Barlow
1ce8edbdfe
hocrtransform: some text not included in output after Tesseract changes 2020-06-22 15:48:23 -07:00
James R. Barlow
d4b704a0ae
hocrtransform: refactor colors 2020-06-22 15:22:48 -07:00
James R. Barlow
2d64e1536d
hocrtransform: refactor xpath manipulations 2020-06-22 14:44:34 -07:00
James R. Barlow
c8b581ac31
hoctransform: remove deprecated element.getchildren()
Breaks Python 3.9.
2020-06-22 14:28:18 -07:00
James R. Barlow
ad8dead7df
Document that API accepts streams now 2020-06-22 14:27:27 -07:00
James R. Barlow
c9bd87254e
A few minor typing issues 2020-06-22 02:31:53 -07:00
James R. Barlow
f4cb424451
Support input/output streams at API level 2020-06-22 02:02:18 -07:00
James R. Barlow
fef14778d5 Fix missing f-string in log message 2020-06-22 01:17:16 -07:00
James R. Barlow
86ec63f215
Decouple plugin manager forking from PdfContext/Pagecontext 2020-06-22 01:16:59 -07:00
James R. Barlow
5b10ec9d39
jobcontext.PdfContext: remove dead code, add annotations 2020-06-22 00:34:58 -07:00
James R. Barlow
800c75c4e5
Bump requirements (mainly for Docker's benefit) 2020-06-21 01:58:53 -07:00
James R. Barlow
24d64b04c3
Update Docker to Ubuntu 20.04 and jbig2-latest 2020-06-21 01:48:31 -07:00