12 Commits

Author SHA1 Message Date
Jean-Francois Dockes
5fcffb7654 tesseract ocr: use compressed tif temp pages if pdftocairo is available (10x smaller than ppm) 2021-12-04 09:35:10 +01:00
Jean-Francois Dockes
e121695a3c Python handlers: factorise tmp dir code 2021-12-03 11:03:23 +01:00
Jean-Francois Dockes
1593b1d87f Change the way rclpd executes rclocr to avoid the command being killed before it can clean up when a signal is raised (e.g. timeout or kbd interrupt) 2021-12-03 10:49:44 +01:00
Jean-Francois Dockes
174ad9fe22 rcl ocr with tesseract: fix stupid breakage in script 2021-06-13 07:14:51 +01:00
Jean-Francois Dockes
824e305bb0 Add option to limit tesseract threads 2020-12-17 11:08:31 +01:00
Jean-Francois Dockes
96104e7d67 fix rclocrtesseract fix 2020-09-28 11:05:12 +02:00
Jean-Francois Dockes
8accec9b88 rclocrtesseract: unquote tesseractcmd parameter and check existence. 2020-09-24 07:13:21 +02:00
Jean-Francois Dockes
0dd609cf1a python filters: replace misc message printing with single method in rclexecm 2020-09-23 18:38:22 +02:00
Jean-Francois Dockes
e520176a2a OCR: small adjustments for Windows. Works with Tesseract. 2020-02-27 14:10:55 +01:00
Jean-Francois Dockes
abb7ef8803 added ocr module for abbyy 2020-02-27 11:35:23 +01:00
Jean-Francois Dockes
747e37a980 rclocr ckpt: cache+tesseract indexing working 2020-02-26 17:30:12 +01:00
Jean-Francois Dockes
38dfa5f841 1st version of the cached ocr mechanism 2020-02-15 21:19:13 +01:00