3979 Commits

Author SHA1 Message Date
HuaPai
172ba4cad1 chore: 在.gitignore中添加.idea/目录
避免将IDE配置文件提交到版本控制中
2025-03-24 13:05:36 +08:00
James R. Barlow
7b2dd892e5
v16.10.0 release notes v16.10.0 2025-02-26 15:16:18 -08:00
James R. Barlow
c05ed7297c
Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF 2025-02-26 15:16:07 -08:00
Odin Dahlström
c29f58a8b7
Process hOCR textangle attribute 2025-02-26 15:09:59 -08:00
jbarlow
eb303fef1a
Merge pull request #1441 from aliemjay/fix-prog-bar 2025-02-26 15:05:54 -08:00
James R. Barlow
2a55ceadd0
Merge branch 'pr/rugk/1489' 2025-02-26 14:59:06 -08:00
James R. Barlow
71991ad09b
Remove podman 2025-02-26 14:58:43 -08:00
James R. Barlow
bd60d6ccd9
Merge branch 'pr/rugk/1488' 2025-02-26 14:57:46 -08:00
James R. Barlow
83b4469ef1
Word wrap 2025-02-26 14:57:18 -08:00
James R. Barlow
d2a7caf496
Merge branch 'feature/remove-ttyd' 2025-02-26 14:54:40 -08:00
James R. Barlow
ff0ea45bf2
Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF 2025-02-26 14:53:48 -08:00
James R. Barlow
b5bc1d209c
Remove ttyd 2025-02-26 14:53:13 -08:00
rugk
53270b8eb1
Doc: Update docker.rst to use
I prefer to write the name in full aka `jbarlow83/ocrmypdf-alpine` and I'd also suggest to document this because:
* if you use `docker tag` this AFAIK only tags the currently downloaded (=pulled) version of that image
* in case a new update comes out, the new one will not be pulled automatically and one would have to pull and tag the image locally, again
* This `docker tag`  command is easily overlooked, if users just run `docker run ocrmypdf` this may or may not work, depending on how it is resolved.
   Also, AFAIK if one could get Docker to register https://hub.docker.com/ocrmypdf then this would suddenly be used instead of your image (currently `podman pull docker.io/ocrmypdf` returns a 404 for me, though)
* It is more common to write at least the user namespace there and the project, to prevent such errors.

Also, default [Docker has many shortcuts for this and e.g. assumes Docker-Hub is always being used](https://stackoverflow.com/questions/37861791/how-are-docker-image-names-parsed). Podman usually does not, that's why I personally prefer to use the very full and clear `docker.io/jbarlow83/ocrmypdf-alpine:latest` e.g. for alpine. This makes it not only clear which version is used, but also where it is pulled from (should one have configured different Docker registries).
2025-02-26 02:43:46 +01:00
rugk
3049a10757
doc: Update docker.rst to explain how to use with podman
I've fiddled/struggled with this by myself, by getting a permission error like this one:
```shell
OutputFileAccessError: Output file location (./output.pdf) is not a writable file.
``` 

I've loosely followed and found https://github.com/containers/podlet?tab=readme-ov-file#in-a-container and explained the required flags in a similar way, but adapted for this tool (it likely won't be used so much on system files).

I've tested it and it works fine for me. The same issue may be on Docker rootless, but I guess people will get that and I cannot test it here.
2025-02-26 02:30:25 +01:00
jbarlow
53002f65d9
Merge pull request #1485 from alexpdp7/main
Correct the installation instructions for Windows
2025-02-22 11:18:14 -08:00
alex
acea9529ea Correct the installation instructions for Windows 2025-02-22 10:46:22 +01:00
James R. Barlow
32322a9fe9
Fix broken test_hocrtransform_matches_sandwich
Expect word similarity rather than exact match. Difference appears to be due to quote styles.

Thanks @QuLogic for reporting.
2025-02-09 13:57:50 -08:00
James R. Barlow
6b09129911
Fix github release yaml 2025-02-07 16:25:29 -08:00
James R. Barlow
e4274a956d
v16.9.0 release notes v16.9.0 2025-02-07 00:53:08 -08:00
James R. Barlow
19af116034
Tidy whitespace 2025-02-06 00:40:35 -08:00
jbarlow
a5896c45e8
Merge pull request #1466 from 0dinD/fix-hocr-caption 2025-02-06 00:38:59 -08:00
Odin Dahlström
b7d63f3dc1 Process ocr_caption lines 2025-01-30 17:49:06 +01:00
James R. Barlow
137b054f43
Adjust test again for older Ghostscript 2025-01-27 23:44:37 -08:00
James R. Barlow
e6daa28c6d
Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF 2025-01-25 16:55:53 -08:00
James R. Barlow
2512093076
Don't build PDF documentation anymore to fix RTD build issues 2025-01-15 16:17:12 -08:00
Quentin Fuxa
66bc4a3733
Improve docs for _progressbar.py (#1456) 2025-01-09 12:21:21 -08:00
James R. Barlow
65df44f670
Modify tests to deal with variety of Ghostscript versions 2025-01-09 02:14:29 -08:00
James R. Barlow
6edc749023
Fix error handling when PDF contains an invalid image with both ImageMask and ColorSpace set
Fixes #1453
2025-01-07 00:27:07 -08:00
James R. Barlow
cff98d258e
Upgrade to Alpine 3.21 2025-01-05 08:53:02 -08:00
James R. Barlow
d1fc77e1b6
docs: add imgconverter v16.8.0 2025-01-04 12:39:55 -08:00
James R. Barlow
17eed0529a
Update notes 2025-01-04 12:21:46 -08:00
James R. Barlow
f02353686d
s/input/output 2025-01-04 12:18:07 -08:00
James R. Barlow
32813a3c3d
Merge remote-tracking branch 'origin/dependabot/github_actions/astral-sh/setup-uv-5' 2025-01-04 12:10:18 -08:00
James R. Barlow
073a434ab3
Fix webservice interactions with Docker 2025-01-04 12:09:32 -08:00
James R. Barlow
f390e7f9d1
Add cache to Dockerfiles 2025-01-04 12:09:11 -08:00
James R. Barlow
bfbe571f12
docs: fix more rst formatting issues 2025-01-04 10:59:52 -08:00
James R. Barlow
368568b8ea
Change yaml strings in release script 2025-01-04 01:05:27 -08:00
James R. Barlow
55e7177dbe
Present similar interface in webservice.py 2025-01-04 01:04:58 -08:00
James R. Barlow
b486df7e2d
docs: auto update year 2025-01-04 01:04:29 -08:00
James R. Barlow
74a84b6ae9
Fix numerous documentation build problems 2025-01-03 12:23:42 -08:00
James R. Barlow
cfebf1dc8b
alpine: fix pyarrow name again 2025-01-01 20:30:12 -08:00
James R. Barlow
1aaff4af6f
alpine: use pyarrow package for webservice 2025-01-01 18:45:27 -08:00
James R. Barlow
36c82e0659
Add debugging helper scripts 2025-01-01 18:03:15 -08:00
James R. Barlow
522f9d5f56
Merge branch 'pr/pajowu/1448' 2025-01-01 18:00:52 -08:00
James R. Barlow
796e424ee5
graft: Handle stack underflow 2025-01-01 18:00:39 -08:00
James R. Barlow
d87db6cad0
Merge branch 'pr/pajowu/1446' 2025-01-01 17:50:33 -08:00
James R. Barlow
dd6ed4c5f8
Switch to streamlit based web app 2025-01-01 17:26:22 -08:00
James R. Barlow
206bab74bc
Improve diagnostics for unidentified image 2025-01-01 17:13:57 -08:00
dependabot[bot]
b333480749
Bump astral-sh/setup-uv from 4 to 5
Bumps [astral-sh/setup-uv](https://github.com/astral-sh/setup-uv) from 4 to 5.
- [Release notes](https://github.com/astral-sh/setup-uv/releases)
- [Commits](https://github.com/astral-sh/setup-uv/compare/v4...v5)

---
updated-dependencies:
- dependency-name: astral-sh/setup-uv
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-12-23 10:08:40 +00:00
James R. Barlow
f71a5ffd61
hocr: comment typo 2024-12-23 01:46:00 -08:00