This commit is contained in:
Jean-Francois Dockes 2020-02-28 14:37:19 +01:00
parent 5e39a0eb18
commit 0fa0c273ab
2 changed files with 14 additions and 7 deletions

View File

@ -708,7 +708,7 @@ with possibly meaning-altering missing words.</para></listitem></varlistentry>
<varlistentry id="RCL.INSTALL.CONFIG.RECOLLCONF.PDFOCR">
<term><varname>pdfocr</varname></term>
<listitem><para>Attempt OCR of PDF files with no text content. This can be defined in subdirectories. The default is off because
OCR is so very slow. Will only do anything if ocrprogs is defined.</para></listitem></varlistentry>
OCR is so very slow.</para></listitem></varlistentry>
<varlistentry id="RCL.INSTALL.CONFIG.RECOLLCONF.PDFATTACH">
<term><varname>pdfattach</varname></term>
<listitem><para>Enable PDF attachment extraction by executing pdftk (if
@ -739,8 +739,11 @@ duplicate values.</para></listitem></varlistentry>
<term><varname>ocrprogs</varname></term>
<listitem><para>OCR modules to try. The top OCR script will try to load the corresponding modules in
order and use the first which reports being capable of performing OCR on
the input file. Modules for tesseract and ABBYY FineReader are present in
the standard distribution.</para></listitem></varlistentry>
the input file. Modules for tesseract (tesseract) and ABBYY FineReader
(abbyy) are present in the standard distribution. For compatibility with
the previous version, if this is not defined at all, the default value is
"tesseract". Use an explicit empty value if needed. A value of "abbyy
tesseract" will try everything.</para></listitem></varlistentry>
<varlistentry id="RCL.INSTALL.CONFIG.RECOLLCONF.OCRCACHEDIR">
<term><varname>ocrcachedir</varname></term>
<listitem><para>Location for caching OCR data. The default if this is empty or undefined is to store the cached

View File

@ -9650,8 +9650,7 @@ for i in range(nres):
<dd>
<p>Attempt OCR of PDF files with no text content.
This can be defined in subdirectories. The
default is off because OCR is so very slow. Will
only do anything if ocrprogs is defined.</p>
default is off because OCR is so very slow.</p>
</dd>
<dt><a name=
"RCL.INSTALL.CONFIG.RECOLLCONF.PDFATTACH" id=
@ -9714,8 +9713,13 @@ for i in range(nres):
try to load the corresponding modules in order
and use the first which reports being capable of
performing OCR on the input file. Modules for
tesseract and ABBYY FineReader are present in the
standard distribution.</p>
tesseract (tesseract) and ABBYY FineReader
(abbyy) are present in the standard distribution.
For compatibility with the previous version, if
this is not defined at all, the default value is
"tesseract". Use an explicit empty value if
needed. A value of "abbyy tesseract" will try
everything.</p>
</dd>
<dt><a name=
"RCL.INSTALL.CONFIG.RECOLLCONF.OCRCACHEDIR" id=