doc
This commit is contained in:
parent
425fb717dd
commit
edb13a64ef
@ -20,8 +20,8 @@ alink="#0000FF">
|
|||||||
<div class="titlepage">
|
<div class="titlepage">
|
||||||
<div>
|
<div>
|
||||||
<div>
|
<div>
|
||||||
<h1 class="title"><a name="idp33265328" id=
|
<h1 class="title"><a name="idp45717744" id=
|
||||||
"idp33265328"></a>Recoll user manual</h1>
|
"idp45717744"></a>Recoll user manual</h1>
|
||||||
</div>
|
</div>
|
||||||
|
|
||||||
<div>
|
<div>
|
||||||
@ -109,13 +109,13 @@ alink="#0000FF">
|
|||||||
multiple indexes</a></span></dt>
|
multiple indexes</a></span></dt>
|
||||||
|
|
||||||
<dt><span class="sect2">2.1.3. <a href=
|
<dt><span class="sect2">2.1.3. <a href=
|
||||||
"#idp65268448">Document types</a></span></dt>
|
"#idp51287344">Document types</a></span></dt>
|
||||||
|
|
||||||
<dt><span class="sect2">2.1.4. <a href=
|
<dt><span class="sect2">2.1.4. <a href=
|
||||||
"#idp65288128">Indexing failures</a></span></dt>
|
"#idp51307024">Indexing failures</a></span></dt>
|
||||||
|
|
||||||
<dt><span class="sect2">2.1.5. <a href=
|
<dt><span class="sect2">2.1.5. <a href=
|
||||||
"#idp65295136">Recovery</a></span></dt>
|
"#idp51314032">Recovery</a></span></dt>
|
||||||
</dl>
|
</dl>
|
||||||
</dd>
|
</dd>
|
||||||
|
|
||||||
@ -488,6 +488,29 @@ alink="#0000FF">
|
|||||||
</div>
|
</div>
|
||||||
</div>
|
</div>
|
||||||
|
|
||||||
|
<p>This document introduces full text search notions and
|
||||||
|
describes the installation and use of the <span class=
|
||||||
|
"application">Recoll</span> application. This version
|
||||||
|
describes <span class="application">Recoll</span> 1.22.</p>
|
||||||
|
|
||||||
|
<p><span class="application">Recoll</span> was for a long
|
||||||
|
time dedicated to Unix-like systems. It was only lately
|
||||||
|
(2015) ported to <span class="application">MS-Windows</span>.
|
||||||
|
Many references in this manual, especially file locations,
|
||||||
|
are specific to Unix, and not valid on <span class=
|
||||||
|
"application">Windows</span>. Some described features are
|
||||||
|
also not available on <span class=
|
||||||
|
"application">Windows</span>.</p>
|
||||||
|
|
||||||
|
<p>The manual will be progressively updated for <span class=
|
||||||
|
"application">Windows</span>. Until this happens, most
|
||||||
|
references to files can be translated by looking under the
|
||||||
|
Recoll installation directory (esp. the <code class=
|
||||||
|
"filename">Share</code> subdirectory). The user configuration
|
||||||
|
is stored by default under <code class=
|
||||||
|
"filename">AppData/Local/Recoll</code> inside the user
|
||||||
|
directory, along with the index itself.</p>
|
||||||
|
|
||||||
<div class="sect1">
|
<div class="sect1">
|
||||||
<div class="titlepage">
|
<div class="titlepage">
|
||||||
<div>
|
<div>
|
||||||
@ -544,8 +567,8 @@ alink="#0000FF">
|
|||||||
</div>
|
</div>
|
||||||
|
|
||||||
<p><span class="application">Recoll</span> is a full text
|
<p><span class="application">Recoll</span> is a full text
|
||||||
search application. Full text search finds your data by
|
search application, which means that it finds your data by
|
||||||
content rather than by external attributes (like a file
|
content rather than by external attributes (like the file
|
||||||
name). You specify words (terms) which should or should not
|
name). You specify words (terms) which should or should not
|
||||||
appear in the text you are looking for, and receive in
|
appear in the text you are looking for, and receive in
|
||||||
return a list of matching documents, ordered so that the
|
return a list of matching documents, ordered so that the
|
||||||
@ -679,28 +702,25 @@ alink="#0000FF">
|
|||||||
it is much less cumbersome than separating your documents
|
it is much less cumbersome than separating your documents
|
||||||
according to what language they are written in.</p>
|
according to what language they are written in.</p>
|
||||||
|
|
||||||
<p>Before version 1.18, <span class=
|
<p>By default, <span class="application">Recoll</span>
|
||||||
"application">Recoll</span> stripped most accents and
|
strips most accents and diacritics from terms, and converts
|
||||||
diacritics from terms, and converted them to lower case
|
them to lower case before either storing them in the index
|
||||||
before either storing them in the index or searching for
|
or searching for them. As a consequence, it is impossible
|
||||||
them. As a consequence, it was impossible to search for a
|
to search for a particular capitalization of a term
|
||||||
particular capitalization of a term (<code class=
|
(<code class="literal">US</code> / <code class=
|
||||||
"literal">US</code> / <code class="literal">us</code>), or
|
"literal">us</code>), or to discriminate two terms based on
|
||||||
to discriminate two terms based on diacritics (<code class=
|
diacritics (<code class="literal">sake</code> /
|
||||||
"literal">sake</code> / <code class=
|
<code class="literal">saké</code>, <code class=
|
||||||
"literal">saké</code>, <code class=
|
|
||||||
"literal">mate</code> / <code class=
|
"literal">mate</code> / <code class=
|
||||||
"literal">maté</code>).</p>
|
"literal">maté</code>).</p>
|
||||||
|
|
||||||
<p>As of version 1.18, <span class=
|
<p><span class="application">Recoll</span> versions 1.18
|
||||||
"application">Recoll</span> can optionally store the raw
|
and newer can optionally store the raw terms, without
|
||||||
terms, without accent stripping or case conversion. In this
|
accent stripping or case conversion. In this configuration,
|
||||||
configuration, it is still possible (and most common) for a
|
default searches will behave as before, but it is possible
|
||||||
query to be insensitive to case and/or diacritics.
|
to perform searches sensitive to case and diacritics. This
|
||||||
Appropriate term expansions are performed before actually
|
is described in more detail in the <a class="link" href=
|
||||||
accessing the main index. This is described in more detail
|
"#RCL.INDEXING.CONFIG.SENS" title=
|
||||||
in the <a class="link" href="#RCL.INDEXING.CONFIG.SENS"
|
|
||||||
title=
|
|
||||||
"2.3.2. Index case and diacritics sensitivity">section
|
"2.3.2. Index case and diacritics sensitivity">section
|
||||||
about index case and diacritics sensitivity</a>.</p>
|
about index case and diacritics sensitivity</a>.</p>
|
||||||
|
|
||||||
@ -711,20 +731,19 @@ alink="#0000FF">
|
|||||||
"2.3. Index configuration">configuration files</a>. A
|
"2.3. Index configuration">configuration files</a>. A
|
||||||
default configuration is copied into a standard location
|
default configuration is copied into a standard location
|
||||||
(usually something like <code class=
|
(usually something like <code class=
|
||||||
"filename">/usr/[local/]share/recoll/examples</code>)
|
"filename">/usr/share/recoll/examples</code>) during
|
||||||
during installation. The default values set by the
|
installation. The default values set by the configuration
|
||||||
configuration files in this directory may be overridden by
|
files in this directory may be overridden by values set
|
||||||
values that you set inside your personal configuration,
|
inside your personal configuration, found by default in the
|
||||||
found by default in the <code class=
|
<code class="filename">.recoll</code> sub-directory of your
|
||||||
"filename">.recoll</code> sub-directory of your home
|
home directory. The default configuration will index your
|
||||||
directory. The default configuration will index your home
|
home directory with default parameters and should be
|
||||||
directory with default parameters and should be sufficient
|
sufficient for giving <span class=
|
||||||
for giving <span class="application">Recoll</span> a try,
|
"application">Recoll</span> a try, but you may want to
|
||||||
but you may want to adjust it later, which can be done
|
adjust it later, which can be done either by editing the
|
||||||
either by editing the text files or by using configuration
|
text files or by using configuration menus in the
|
||||||
menus in the <span class=
|
<span class="command"><strong>recoll</strong></span> GUI.
|
||||||
"command"><strong>recoll</strong></span> GUI. Some other
|
Some other parameters affecting only the <span class=
|
||||||
parameters affecting only the <span class=
|
|
||||||
"command"><strong>recoll</strong></span> GUI are stored in
|
"command"><strong>recoll</strong></span> GUI are stored in
|
||||||
the standard location defined by <span class=
|
the standard location defined by <span class=
|
||||||
"application">Qt</span>.</p>
|
"application">Qt</span>.</p>
|
||||||
@ -943,8 +962,8 @@ alink="#0000FF">
|
|||||||
<div class="titlepage">
|
<div class="titlepage">
|
||||||
<div>
|
<div>
|
||||||
<div>
|
<div>
|
||||||
<h3 class="title"><a name="idp65268448" id=
|
<h3 class="title"><a name="idp51287344" id=
|
||||||
"idp65268448"></a>2.1.3. Document types</h3>
|
"idp51287344"></a>2.1.3. Document types</h3>
|
||||||
</div>
|
</div>
|
||||||
</div>
|
</div>
|
||||||
</div>
|
</div>
|
||||||
@ -1037,8 +1056,8 @@ indexedmimetypes = application/pdf
|
|||||||
<div class="titlepage">
|
<div class="titlepage">
|
||||||
<div>
|
<div>
|
||||||
<div>
|
<div>
|
||||||
<h3 class="title"><a name="idp65288128" id=
|
<h3 class="title"><a name="idp51307024" id=
|
||||||
"idp65288128"></a>2.1.4. Indexing
|
"idp51307024"></a>2.1.4. Indexing
|
||||||
failures</h3>
|
failures</h3>
|
||||||
</div>
|
</div>
|
||||||
</div>
|
</div>
|
||||||
@ -1078,8 +1097,8 @@ indexedmimetypes = application/pdf
|
|||||||
<div class="titlepage">
|
<div class="titlepage">
|
||||||
<div>
|
<div>
|
||||||
<div>
|
<div>
|
||||||
<h3 class="title"><a name="idp65295136" id=
|
<h3 class="title"><a name="idp51314032" id=
|
||||||
"idp65295136"></a>2.1.5. Recovery</h3>
|
"idp51314032"></a>2.1.5. Recovery</h3>
|
||||||
</div>
|
</div>
|
||||||
</div>
|
</div>
|
||||||
</div>
|
</div>
|
||||||
@ -4586,16 +4605,15 @@ bye goodbye "see you" \
|
|||||||
</ul>
|
</ul>
|
||||||
</div>
|
</div>
|
||||||
|
|
||||||
<p>More generally, the path translation facility may be
|
|
||||||
useful whenever the documents paths seen by the indexer are
|
|
||||||
not the same as the ones which should be used at query
|
|
||||||
time.</p>
|
|
||||||
|
|
||||||
<p><span class="application">Recoll</span> has a facility
|
<p><span class="application">Recoll</span> has a facility
|
||||||
for rewriting access paths when extracting the data from
|
for rewriting access paths when extracting the data from
|
||||||
the index. The translations can be defined for the main
|
the index. The translations can be defined for the main
|
||||||
index and for any additional query index.</p>
|
index and for any additional query index.</p>
|
||||||
|
|
||||||
|
<p>The path translation facility will be useful whenever
|
||||||
|
the documents paths seen by the indexer are not the same as
|
||||||
|
the ones which should be used at query time.</p>
|
||||||
|
|
||||||
<p>In the above NFS example, <span class=
|
<p>In the above NFS example, <span class=
|
||||||
"application">Recoll</span> could be instructed to rewrite
|
"application">Recoll</span> could be instructed to rewrite
|
||||||
any <code class="filename">file:///home/me</code> URL from
|
any <code class="filename">file:///home/me</code> URL from
|
||||||
@ -4608,7 +4626,11 @@ bye goodbye "see you" \
|
|||||||
"5.4.7. The ptrans file"><code class=
|
"5.4.7. The ptrans file"><code class=
|
||||||
"filename">ptrans</code></a> configuration file, which can
|
"filename">ptrans</code></a> configuration file, which can
|
||||||
be edited by hand or from the GUI external indexes
|
be edited by hand or from the GUI external indexes
|
||||||
configuration dialog.</p>
|
configuration dialog: <span class=
|
||||||
|
"guimenu">Preferences</span> → <span class=
|
||||||
|
"guimenuitem">External index dialog</span>, then click the
|
||||||
|
<span class="guilabel">Paths translations</span> button on
|
||||||
|
the right below the index list.</p>
|
||||||
|
|
||||||
<div class="note" style=
|
<div class="note" style=
|
||||||
"margin-left: 0.5in; margin-right: 0.5in;">
|
"margin-left: 0.5in; margin-right: 0.5in;">
|
||||||
@ -6958,7 +6980,12 @@ except:
|
|||||||
"application">FreeBSD</span> ports, etc.), or from some
|
"application">FreeBSD</span> ports, etc.), or from some
|
||||||
type of "backports" repository providing versions newer
|
type of "backports" repository providing versions newer
|
||||||
than the standard ones, or found on the <span class=
|
than the standard ones, or found on the <span class=
|
||||||
"application">Recoll</span> WEB site in some cases.</p>
|
"application">Recoll</span> WEB site in some cases. The
|
||||||
|
most up-to-date information about Recoll packages can
|
||||||
|
usually be found on the <a class="ulink" href=
|
||||||
|
"http://www.recoll.org/download.html" target=
|
||||||
|
"_top"><span class="application">Recoll</span> WEB site
|
||||||
|
downloads page</a></p>
|
||||||
|
|
||||||
<p>There used to exist another form of binary install, as
|
<p>There used to exist another form of binary install, as
|
||||||
pre-compiled source trees, but these are just less
|
pre-compiled source trees, but these are just less
|
||||||
@ -6999,6 +7026,18 @@ except:
|
|||||||
</div>
|
</div>
|
||||||
</div>
|
</div>
|
||||||
|
|
||||||
|
<div class="note" style=
|
||||||
|
"margin-left: 0.5in; margin-right: 0.5in;">
|
||||||
|
<h3 class="title">Note</h3>
|
||||||
|
|
||||||
|
<p>The <span class="application">Windows</span>
|
||||||
|
installation of <span class="application">Recoll</span>
|
||||||
|
is self-contained, and only needs Python 2.7 to be
|
||||||
|
externally installed. <span class=
|
||||||
|
"application">Windows</span> users can skip this
|
||||||
|
section.</p>
|
||||||
|
</div>
|
||||||
|
|
||||||
<p><span class="application">Recoll</span> uses external
|
<p><span class="application">Recoll</span> uses external
|
||||||
applications to index some file types. You need to install
|
applications to index some file types. You need to install
|
||||||
them for the file types that you wish to have indexed
|
them for the file types that you wish to have indexed
|
||||||
@ -7638,9 +7677,9 @@ except:
|
|||||||
<p>For each index, there are at least two sets of
|
<p>For each index, there are at least two sets of
|
||||||
configuration files. System-wide configuration files are
|
configuration files. System-wide configuration files are
|
||||||
kept in a directory named like <code class=
|
kept in a directory named like <code class=
|
||||||
"filename">/usr/[local/]share/recoll/examples</code>, and
|
"filename">/usr/share/recoll/examples</code>, and define
|
||||||
define default values, shared by all indexes. For each
|
default values, shared by all indexes. For each index, a
|
||||||
index, a parallel set of files defines the customized
|
parallel set of files defines the customized
|
||||||
parameters.</p>
|
parameters.</p>
|
||||||
|
|
||||||
<p>The default location of the customized configuration is
|
<p>The default location of the customized configuration is
|
||||||
@ -9549,7 +9588,7 @@ application/x-blobapp = exec rclblob
|
|||||||
displayed for the files inside the result lists.
|
displayed for the files inside the result lists.
|
||||||
Icons are normally 64x64 pixels PNG files which
|
Icons are normally 64x64 pixels PNG files which
|
||||||
live in <code class=
|
live in <code class=
|
||||||
"filename">/usr/[local/]share/recoll/images</code>.</p>
|
"filename">/usr/share/recoll/images</code>.</p>
|
||||||
</li>
|
</li>
|
||||||
|
|
||||||
<li class="listitem">
|
<li class="listitem">
|
||||||
@ -9566,9 +9605,9 @@ application/x-blobapp = exec rclblob
|
|||||||
"replaceable"><code>rclblob</code></em> handler should
|
"replaceable"><code>rclblob</code></em> handler should
|
||||||
be an executable program or script which exists inside
|
be an executable program or script which exists inside
|
||||||
<code class=
|
<code class=
|
||||||
"filename">/usr/[local/]share/recoll/filters</code>. It
|
"filename">/usr/share/recoll/filters</code>. It will be
|
||||||
will be given a file name as argument and should output
|
given a file name as argument and should output the
|
||||||
the text or html contents on the standard output.</p>
|
text or html contents on the standard output.</p>
|
||||||
|
|
||||||
<p>The <a class="link" href="#RCL.PROGRAM.FILTERS"
|
<p>The <a class="link" href="#RCL.PROGRAM.FILTERS"
|
||||||
title=
|
title=
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user