James R. Barlow
43a23e3695
Disable Travis
2020-02-25 22:22:57 -08:00
James R. Barlow
c4ca572afd
Merge branch 'pigmonkey-aur-manual'
2020-02-25 22:21:35 -08:00
Pig Monkey
e04e4565a9
Demonstrate installing the AUR package without a helper
...
This describes how to use the AUR package on a minimal install, as per
the discussion in #494 .
There may be formatting mistakes. I don't use RST myself, so I wrote the
instructions in Markdown, converted via Pandoc, and gave the output a
quick comparison against the rest of the installation docs.
2020-02-25 18:59:49 -08:00
knobix
2391fb0be0
Update installation instructions for FreeBSD ( #493 )
...
Python 3.7 is the new default version since 2020Q1 which is reflected in
the new prefix (= py37-).
Also update the current available FreeBSD versions:
* FreeBSD 11.2-RELEASE has reached its End-of-Life in 2019Q4
* FreeBSD 12.1-RELEASE was also introduced in 2019Q4
2020-02-24 23:40:25 -08:00
James R. Barlow
c16f79d51b
docs: add Docker compose configuration for watchdog
2020-02-18 02:50:57 -08:00
James R. Barlow
e3e888efde
Readme: Add another heise article
2020-02-18 02:41:28 -08:00
James R. Barlow
84f7e34ace
Merge branch 'master' of github.com:jbarlow83/OCRmyPDF
2020-02-18 02:11:40 -08:00
Ivan Kuchin
32e2175891
Docker image includes also French, Portuguese and Spanish ( #491 )
2020-02-18 02:10:01 -08:00
James R. Barlow
975abfde9a
docs: archlinux install - yaourt is gone
2020-02-18 02:08:58 -08:00
James R. Barlow
d929ed76c2
docs: typo
2020-02-12 00:07:24 -08:00
James R. Barlow
4a27124eab
Simplify metadata for invalid xml in output
...
Removes possibly non-free resource enron1.pdf.
2020-02-12 00:07:18 -08:00
James R. Barlow
683ffb84e8
Update reqs
v9.6.0
2020-02-10 01:20:33 -08:00
James R. Barlow
2f2602357b
v9.6.0 notes updated
2020-02-10 01:20:12 -08:00
James R. Barlow
a5f429f499
Merge branch 'pr479'
2020-02-10 01:10:30 -08:00
James R. Barlow
bdb7f92131
ifmain -> main()
2020-02-10 01:10:12 -08:00
James R. Barlow
09f15ac4c0
v9.6.0 notes
2020-02-10 01:01:49 -08:00
James R. Barlow
4fdbf55c11
setup: approve pdfminer.six 20200124
2020-02-09 23:50:56 -08:00
James R. Barlow
fe2b07652b
docs: simplify/fix Ubuntu 18.04 install instructions
2020-02-09 23:48:53 -08:00
James R. Barlow
f6d7aa6e33
Refactor page rotation and re-enable message at info level
2020-01-30 17:35:20 -08:00
James R. Barlow
a005d14f91
Merge branch 'master' of github.com:jbarlow83/OCRmyPDF
2020-01-30 16:24:16 -08:00
Matthias Braun
6f66232d44
Fix typos, add instructions for training data ( #477 )
2020-01-30 16:24:41 -08:00
James R. Barlow
b8a780d684
Wait for file based on pikepdf
2020-01-30 12:40:48 -08:00
James R. Barlow
82f393dd09
Order of events
2020-01-30 12:40:19 -08:00
James R. Barlow
4952af1604
watcher: some refactoring
2020-01-28 12:56:19 -08:00
James R. Barlow
bcf77375c0
Fix grammar in output message
2020-01-28 07:33:28 -08:00
Ian Alexander
3eab161771
Update logging and env var extensibility
2020-01-20 10:45:28 -08:00
Ian Alexander
b7f38e976b
Watched folder bug fixes, new flags, and docs updates.
2020-01-20 00:20:29 -08:00
James R. Barlow
a6567f2ae4
v9.5.0 release notes revised
v9.5.0
2020-01-18 01:48:33 -08:00
James R. Barlow
e860c56b75
Fix regression: metadata updates not taking effect
2020-01-17 23:01:37 -08:00
James R. Barlow
2e15d52895
v9.5.0 release notes
2020-01-17 03:11:33 -08:00
James R. Barlow
ce97af5a79
Add OCR quality measurement API
2020-01-17 03:10:27 -08:00
James R. Barlow
3831c4cd4d
Refactor metadata_fixup
2020-01-14 01:10:15 -08:00
James R. Barlow
61a2674317
Skip test that needs chmod when on Windows
v9.4.0
2020-01-06 02:36:04 -08:00
James R. Barlow
9ad8cbf1f6
Fix assert that depends on POSIX-y file handling
2020-01-06 02:02:05 -08:00
James R. Barlow
123fde174d
Don't use debug.log in pytest
...
pytest does not reset the state of logging if we install a file handler,
which will cause FileNotFoundError after the temporary folder is removed.
Semi-related:
https://github.com/pytest-dev/pytest/issues/5502
2020-01-06 01:46:19 -08:00
James R. Barlow
fd991a2380
Allow pdfminer.six 20200104 and update recommended versions
2020-01-05 21:37:28 -08:00
James R. Barlow
6f5d77d930
Also generate log file in temp folder on verbose mode
2020-01-05 21:33:32 -08:00
James R. Barlow
5169ac633b
docs: mention pdfgrep too
2020-01-05 21:32:36 -08:00
James R. Barlow
5b6ab1e003
lept: improve lib not found error message
...
Closes #471
2020-01-05 01:05:53 -08:00
James R. Barlow
8f984bf958
docs: add note on limitations of sidecar file
2020-01-04 16:43:13 -08:00
James R. Barlow
9c5f0d0ec6
Eliminate last use of PyPDF2 from test suite
2020-01-04 16:32:01 -08:00
James R. Barlow
32041c43e1
tests: improve tesseract coverage
2020-01-04 02:35:14 -08:00
James R. Barlow
599028bebb
tesseract: don't explicitly set lstm_use_matrix
...
Apparently tesseract does this own its own as needed.
2020-01-04 01:17:33 -08:00
James R. Barlow
6faa8f7221
logging: always log process arguments and stderr when at debug
...
Also remove ad-hoc logging of this information.
2020-01-01 16:48:48 -08:00
James R. Barlow
a4dc5e365f
logging: fix incorrect usage: logging.Logger()
2020-01-01 16:47:36 -08:00
James R. Barlow
e2a563cc76
logging: create a debug log when -k parameter is issued
2020-01-01 16:47:15 -08:00
James R. Barlow
1037d73efb
tests: use smaller files for ghostscript
2019-12-31 17:20:28 -08:00
James R. Barlow
aeb7b142a9
tests: skip tests not compatible with coverage
...
For reasons not entirely clear, stdout will get some data injected when
pytest-cov is running. Our tests that
check for clean stdout need to ignore this.
We check for an environment variable that is defined only when coverage is
running.
2019-12-31 17:10:51 -08:00
James R. Barlow
422ea9777e
Remove session scope from fixtures
...
pytest seems to prepare os.environ in complex ways, so we want to ensure
these fixtures are not reused.
2019-12-31 17:09:23 -08:00
James R. Barlow
2f1c743227
Rewrite main pool loop
...
pytest-cov documentation recommends using explicit
management of multiprocessing.Pool rather than the context manager.
This is supposed to work better for collecting coverage data, particularly
on Windows.
2019-12-31 16:23:41 -08:00