James R. Barlow
c5edff2c2f
Sort imports
2019-12-19 15:31:18 -08:00
James R. Barlow
8c5f8b8ddd
Add isort to precommit
2019-12-19 15:29:38 -08:00
James R. Barlow
39da931a56
Look in Program Files for executables and liblept5.dll
2019-12-19 12:11:32 -08:00
James R. Barlow
9fe354359b
Generally update documentation about available platforms
2019-12-19 00:27:37 -08:00
James R. Barlow
facc4750bc
Remove command line qpdf from azure and travis
v9.2.0
2019-12-12 00:14:05 -08:00
James R. Barlow
437c235738
v9.2.0 release notes and docs
2019-12-11 13:13:51 -08:00
James R. Barlow
9559b0b186
Use pikepdf to perform qpdf.check()
2019-12-11 01:21:15 -08:00
James R. Barlow
91456e19a4
pdfa.py: Fix misleading comment
2019-12-11 01:05:47 -08:00
James R. Barlow
a2d89f67c4
Improve help messages for Windows
2019-12-10 01:44:00 -08:00
James R. Barlow
f34130d193
Fixed case where page image was not converted to JPEG
...
If a preprocessing option was used,
and all original images on the page were JPEGs,
and --output-type=pdf, then
images would saved as Flate instead
of converted to JPEG.
2019-12-10 01:07:59 -08:00
James R. Barlow
c5571388e2
Improve test coverage of _sync.py
2019-12-10 01:06:27 -08:00
James R. Barlow
9af59c0d6d
docs: improvements for Windows
2019-12-09 21:39:01 -08:00
James R. Barlow
55ae838cb7
azure: fix extra build step
2019-12-09 21:11:50 -08:00
James R. Barlow
c434b97f55
docs: more install notes
2019-12-09 16:18:01 -08:00
James R. Barlow
607eee198d
tests: split out preprocessing tests
2019-12-09 16:18:01 -08:00
James R. Barlow
5e2a7f8a56
tests: speed up several slow tests
2019-12-09 16:17:57 -08:00
James R. Barlow
fd9550acda
Add Azure Pipelines CI/CD
2019-12-09 16:17:33 -08:00
James R. Barlow
b354511ac9
ghostscript: document need to write to stdout when using txtwrite
2019-12-09 16:17:10 -08:00
James R. Barlow
7be293f628
Address tests that fail on Windows with Python 3.7 or 3.6
2019-12-09 16:17:10 -08:00
James R. Barlow
65855dc14c
Fix close_fds=True on Windows Python 3.6
2019-12-09 16:17:10 -08:00
James R. Barlow
cac4a8b9b6
Suppress duplicate error messages from Ghostscript
2019-12-06 15:14:15 -08:00
James R. Barlow
17d97b354a
Ignore mypy cache
2019-12-06 15:11:10 -08:00
James R. Barlow
1c1b60fa9f
Add typing hints for ocr() function
2019-12-06 15:10:54 -08:00
James R. Barlow
6b745d892f
Merge branch 'windows'
2019-12-06 15:09:09 -08:00
James R. Barlow
fbf271a3ec
Remove Tesseract < 4.0 specific check
2019-12-06 15:06:02 -08:00
James R. Barlow
8077718804
Possible fix to loss of log adapter state
2019-12-06 15:05:58 -08:00
James R. Barlow
66bda3420a
docs: cause about using Windows in production
2019-12-06 15:03:20 -08:00
James R. Barlow
f6510e2b15
Document function of symlink shim
2019-12-06 15:00:12 -08:00
James R. Barlow
51abd79136
Tesseract no longer posts an error message if config file not found
2019-12-04 21:35:28 -08:00
James R. Barlow
5607429d9a
tests: error message from tesseract change
2019-12-04 21:31:01 -08:00
James R. Barlow
b8b7ecfe7f
Fix DecompressionBomb related errors due to Windows process differences
2019-12-04 21:10:27 -08:00
James R. Barlow
d4abe88452
docs: sketch Windows install procedure
2019-12-04 21:01:09 -08:00
James R. Barlow
cb3cfaa055
Add Windows install advice
2019-12-04 21:01:09 -08:00
James R. Barlow
9db01c7ff5
Remove test_bad_utf8
...
Due to difficulties of getting this to work on Python 3.8, Windows, and
high probability that this behavior is now gone from Tesseract 4.0+.
Originally added in 2017.
2019-12-04 21:01:09 -08:00
James R. Barlow
d0301813cc
ghosttext: mention page number differences
2019-12-04 21:01:09 -08:00
James R. Barlow
cff37bf681
Make test_german more Windows-friendly
2019-12-04 21:01:09 -08:00
James R. Barlow
66d04dd6e3
Don't expect filenames to be replicated on NT
2019-12-04 21:01:09 -08:00
James R. Barlow
06a1f987d4
Use _OCRMYPDF_TEST_PATH for testing and .py stubs to simulate symlinks
2019-12-04 21:01:06 -08:00
James R. Barlow
e51e21c6b6
ghostscript: Refactor checking for executable name on Windows
2019-12-04 21:01:06 -08:00
James R. Barlow
c5fa72bd4e
ghostscript: use run(check=True) for more consistent error handling
2019-12-04 21:01:00 -08:00
James R. Barlow
bf99587aa1
ghostscript: use correct executable name on Windows
2019-12-04 17:37:38 -08:00
James R. Barlow
d249aef57d
ghostscript: don't use NamedTemporaryFile
...
Temporary files are more awkward for Windows.
2019-12-04 17:37:38 -08:00
James R. Barlow
43ab7c88d7
Remove os_environ() context manager
2019-12-04 17:37:38 -08:00
James R. Barlow
ca9669742d
Move gs tests to test_ghostscript
2019-12-04 17:14:27 -08:00
James R. Barlow
8a1dddc3ee
Don't worry about closed streams on Windows
2019-12-04 17:14:27 -08:00
James R. Barlow
0cd424ffcb
Enforce str-only environment for Windows since it's more strict
2019-12-04 17:14:27 -08:00
James R. Barlow
fde550f9a7
test: Replace many instances of run_ocrmypdf in subprocess with inline
2019-12-04 17:14:27 -08:00
James R. Barlow
a3726e4ce3
Fix test_metadata: use mmap in a Windows and POSIX compatible way
2019-12-04 17:13:52 -08:00
James R. Barlow
4ab0a8ff35
Fix test_single_page_inline_image - remove temp file
2019-12-04 17:13:51 -08:00
James R. Barlow
37f6f72df3
tests: a few Windows fixes
2019-12-04 17:13:51 -08:00