fritz-hh
a5c5353fbd
Create COPYRIGHT.md
2013-04-20 23:22:57 +03:00
fritz-hh
d5a3f76234
OCRmyPDF.sh: various improvements
...
- check if x_dpi = y_dpi
- separate options for image deskewing and cleaning
- exit codes defined as constants
2013-04-20 22:03:23 +02:00
fritz-hh
7c18203845
Update README.md
2013-04-20 00:15:14 +03:00
fritz-hh
d7c238723b
readme: new sections "features" & "Motivation"
2013-04-19 23:00:35 +02:00
fritz-hh
f3e581d162
OCRmyPDF.sh: minor changes
2013-04-19 23:00:00 +02:00
fritz-hh
4f65a31eba
OCRmyPDF.sh: check if utilities are installed
2013-04-19 22:23:28 +02:00
fritz-hh
35d8cffad4
OCRmyPDF.sh: fix error exit not exiting
...
Fixes an error that lead the script not to exit correctly in case more
than 1 image is detected on a page
2013-04-19 21:27:40 +02:00
fritz-hh
0c46a723bd
OCRmyPDF.sh: many improvements!
...
- automatic analysis of jhove validation report
- quiet generation of PDF/A with gs
- deletion of tmp files
- Corrected issue that lead to crash at page 8
- Improved log
2013-04-18 23:13:06 +02:00
fritz-hh
fcac99bc73
OCRmyPDF.sh: code clean-up
2013-04-18 11:16:40 +02:00
fritz-hh
42208aa5fe
Readme: Installation section started
2013-04-18 10:44:10 +02:00
fritz-hh
7c3abea232
OCRmyPDF.sh: page number now with leading zeros
2013-04-18 10:43:28 +02:00
fritz-hh
2c23bca913
OCRmyPDF.sh: conversion to PDF/A added
2013-04-18 10:31:36 +02:00
fritz-hh
4188d702ed
OCRmyPDF.sh: computation of resolution
...
Added compuation of resolution of each PDF page
Added extract of image of pgm if colorspace is Gray (to speed up
computation and save space)
2013-04-14 19:15:01 +02:00
fritz-hh
318c77b934
OCRmyPDF.sh: prepare intelligent image extraction
...
preparation of extraction of the image in the same resolution than the
original image inside the pdf file
2013-04-13 12:35:26 +02:00
fritz-hh
b041c0080b
OCRmyPDF.sh: new cmd line I/F of hocrTransform.py
...
Adapted to new new cmd line I/F of hocrTransform.py
2013-04-11 20:29:10 +02:00
fritz-hh
ed93878851
hocrTransform.py: cmd line interface improved
...
Command line interface improved in order to allow:
- show bounding boxes border
- set OCR resolution
- show text above image
2013-04-11 20:24:37 +02:00
fritz-hh
df56c134e4
hocrTranform.py: moved size computation to init
2013-04-10 16:33:03 +03:00
fritz-hh
c51babfd27
hocrTranform.py: A4 page size corrected
2013-04-10 16:22:15 +03:00
fritz-hh
8fdbfc3c95
hocrTranform.py: license added
2013-04-10 16:19:46 +03:00
fritz-hh
4d378c3b14
hocrTransform: code cleanup
2013-04-09 21:51:39 +02:00
fritz-hh
81d5b7b5e5
readme: warning that still in development
2013-04-09 21:35:47 +02:00
fritz-hh
accc082b91
hocrTransform: code cleanup
2013-04-09 21:35:22 +02:00
fritz-hh
4e4b5ddc58
initial version
2013-04-09 19:00:26 +02:00
fritz-hh
4202826dfa
gitignore, gitattributes and releaseNotes added
2013-04-09 18:54:14 +02:00
fritz-hh
b011ddd2d9
Update README.md
2013-04-09 19:53:17 +03:00
fritz-hh
7972a156fc
Initial commit
2013-04-09 09:44:46 -07:00