2895 Commits

Author SHA1 Message Date
Jim Barlow
3a46ea1f36 Merge branch 'for-upstream/pdftoppm-error' into for-upstream/mono 2014-01-09 16:20:05 -08:00
Jim Barlow
d33779f301 Detect monochrome images and extract them as PBM (1 bpp) 2014-01-09 16:15:24 -08:00
Jim Barlow
d6ea0793b8 Fix ocrPage.sh pdftoppm error on OS X 10.9 2014-01-09 16:04:37 -08:00
fritz-hh
4e5e5bb925 version changed to v2.x 2014-01-08 20:57:55 +01:00
fritz-hh
3232ed8e38 link to releases updated 2014-01-08 20:56:34 +01:00
fritz-hh
29d6748af8 release_notes and readme updated for v2.0-rc1 v2.0-rc1 2014-01-07 23:13:42 +01:00
fritz-hh
828f195071 erroneous exit code corrected 2014-01-07 21:57:18 +01:00
fritz-hh
b0b7e32783 fixes #40 and code cleanup 2014-01-07 21:51:15 +01:00
fritz-hh
c1103c0248 check tesseract version
fixes #41
versions older than 3.02.02 are known to produce invalid hocr output (in
some cases)
2014-01-07 21:04:28 +01:00
fritz-hh
940a016e95 link to issue tracking system added 2014-01-06 23:12:15 +01:00
fritz-hh
c6cc098e47 create symbolic links and not copy
If deskew and/or cleanup is not requested, do not copy the files, but
just create symbolic link.
This saves disk place and makes the script slightly quicker
2014-01-06 23:08:35 +01:00
fritz-hh
54f47ab89b Minor change 2014-01-06 22:41:43 +01:00
fritz-hh
fc3de64dce Changed debug page name
In order to have the debug page after the normal panel in the final PDF
file
2014-01-06 22:41:29 +01:00
fritz-hh
414c4e3f3c round dpi value correctly 2014-01-06 22:32:11 +01:00
fritz-hh
6a9f38d31e removed unused variables 2014-01-06 22:23:41 +01:00
fritz-hh
aa4256d35c fixes #44
The x/y resolutions are not computed separately anymore.
We do not check anymore if x and y resolutions are different (not
measure could anyway be taken if they were not equal...)
2014-01-06 22:23:00 +01:00
fritz-hh
8a1241ba44 minor changes (indentation and fct name) 2014-01-06 22:05:49 +01:00
fritz-hh
7eab052e0f Improved consistency of tmp file names 2014-01-06 22:00:58 +01:00
fritz-hh
552d19e36b v1.1-stable added in release notes 2014-01-06 20:09:46 +01:00
fritz-hh
463b04e795 typo 2014-01-06 20:09:12 +01:00
fritz-hh
c0d8508264 minor change in log msg 2014-01-06 19:30:19 +01:00
fritz-hh
6ef4ba31e2 help and documentation improved 2014-01-05 22:02:12 +01:00
fritz-hh
10a3d26291 default PDI definition moved to cfg file 2014-01-05 22:01:45 +01:00
fritz-hh
ab994b32ee explanations added for no_ligature cfg file 2014-01-05 22:01:10 +01:00
fritz-hh
9352b71d78 Copyright added 2014-01-05 22:00:12 +01:00
fritz-hh
71593421ed minor change 2014-01-05 21:22:31 +01:00
fritz-hh
2754970f37 Echo arguments of script in debug mode 2014-01-04 21:43:41 +01:00
fritz-hh
5945454597 Support for -f option
Fixes #16
2014-01-04 21:24:33 +01:00
fritz-hh
884dbce712 copyright years updated 2014-01-04 21:20:29 +01:00
fritz-hh
8ee1bc6598 Minor change 2014-01-04 21:19:51 +01:00
fritz-hh
7d76c46731 Check if page already contains a font 2014-01-04 18:05:21 +01:00
fritz-hh
f8ccf42c06 path to tmp folder now defined in config.sh 2014-01-04 17:24:35 +01:00
fritz-hh
0abe0f1f10 minor change 2014-01-03 17:00:35 +01:00
fritz-hh
ee8a5d80ff echo also java version in debug mode 2014-01-03 16:27:11 +01:00
fritz-hh
f08893b5c8 Support section added 2014-01-03 16:13:17 +01:00
fritz-hh
41cd88506e Echo version of the used tools
Fixes #35
2014-01-03 15:59:51 +01:00
fritz-hh
081223b138 Delete 2013_09_LED_und_Energiesparlampen.pdf
file committed by mistake... So deleting it now
2013-12-31 23:38:38 +01:00
fritz-hh
4e60c9ba09 Warn user in case of low resolution 2013-12-30 23:55:26 +01:00
fritz-hh
95fe7cd3bc Oversampling + more than 1 img
- Oversampling resolution can now be set from the cmd line (-o option)
- If a page contains more than one image, warn the user but process the
page anyway with a default resolution
2013-12-30 23:44:38 +01:00
fritz-hh
79ec1d994e Automatic oversampling
- If resolution is too low (<250dpi) perform automatic oversampling of
the image
- comments improved
- log messages improved
2013-12-30 22:27:10 +01:00
fritz-hh
045362425f minor change 2013-12-30 19:16:29 +01:00
fritz-hh
2b2637fbc3 minor change 2013-12-30 18:21:03 +01:00
fritz-hh
bfc4f7a28d better resolution handling (fixes #38)
- dpi computation moved to in dedicated function
- do not exit in case of resolution mismatch (fixes #38)
- comments improved
2013-11-29 10:37:09 +01:00
fritz-hh
407670e1f3 Minor change 2013-11-29 10:34:05 +01:00
fritz-hh
d0671d81b5 New log level added (LOG_WARN) 2013-11-29 10:33:46 +01:00
fritz-hh
7a74ebbcc3 comments and log messages improved 2013-11-29 00:39:44 +01:00
fritz-hh
9e69800332 typo 2013-11-28 00:37:57 +01:00
fritz-hh
7542188592 Removed bashism
== does not exist in bourne shell
2013-11-27 23:44:45 +01:00
fritz-hh
b4a23c005d fixes #34
tell GNU parallel to protect against evaluation by the sub shell (-q
flag).
This is required in case the file name passed as argument contains
special characters like "#"
2013-11-27 23:15:54 +01:00
fritz-hh
5e0f8be4b1 Various improvements
-Constants moved to config.sh
- Use "python2" cmd instead of "python"
- few other minor changes
2013-11-27 22:34:21 +01:00