From ea5cfa40c181663ad80d38acef8cd2ba2d6cf6c4 Mon Sep 17 00:00:00 2001 From: fritz-hh Date: Fri, 3 Oct 2014 16:13:56 +0200 Subject: [PATCH] Update ROADMAP.md --- ROADMAP.md | 9 +++++++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/ROADMAP.md b/ROADMAP.md index 64b457e7..0e20ddc6 100644 --- a/ROADMAP.md +++ b/ROADMAP.md @@ -41,15 +41,20 @@ Generate output for each page - For pdf (if output file has a "pdf" extension): - Generate pdf pages from hocr files (note: pdf pages can already exist if OCR has been skipped for them) +- For txt + - generate txt file for each page (containing txt located into hocr file) Build final output ------------------ - For pdf: - Concatenate pdf pages - - Converte to pdf/1-a + - Convert to pdf/1-a - Verify conformity to pdf/1-a - +- For txt + - Concatenate all txt files into the final output txt file + + Tmp folder structure =========================