small manual fixes

This commit is contained in:
Jean-Francois Dockes 2012-04-07 15:03:05 +02:00
parent a200d22b34
commit 3509241018

View File

@ -45,12 +45,12 @@
<sect1 id="rcl.introduction.tryit"> <sect1 id="rcl.introduction.tryit">
<title>Giving it a try</title> <title>Giving it a try</title>
<para>If you do not like reading manuals (who does?) and would <para>If you do not like reading manuals (who does?) and would like
like to give &RCL; a try, just perform <link to give &RCL; a try, just <link
linkend="rcl.install.binary">installation</link> and start the linkend="rcl.install.binary">install</link> the application and
<command>recoll</command> user interface, which will index your start the <command>recoll</command> graphical user interface (GUI),
home directory by default, allowing you to search immediately after which will ask to index your home directory by default, allowing
indexing completes.</para> you to search immediately after indexing completes.</para>
<para>Do not do this if your home directory contains a huge <para>Do not do this if your home directory contains a huge
number of documents and you do not want to wait or are very number of documents and you do not want to wait or are very
@ -176,21 +176,24 @@
or by using configuration menus in the or by using configuration menus in the
<command>recoll</command> GUI</para> <command>recoll</command> GUI</para>
<para><link linkend="rcl.indexing.periodic.exec">Indexing</link> <para>The <link linkend="rcl.indexing.periodic.exec">indexing
is started automatically the first time you execute the process</link> is started automatically the first time you
<command>recoll</command> search graphical user interface, or by execute the <command>recoll</command> GUI. Indexing can also be
executing the <command>recollindex</command> command.</para> performed by executing the <command>recollindex</command>
command.</para>
<para><link linkend="rcl.search">Searches</link> are usually <para><link linkend="rcl.search">Searches</link> are usually
performed inside the <command>recoll</command> graphical user performed inside the <command>recoll</command> GUI, which has many
interface (GUI) program, which has many options to help you find options to help you find what you are looking for. However, there
what you are looking for. However, there are other ways to perform are other ways to perform &RCL; searches: mostly a <link
&RCL; searches: mostly a <link linkend="rcl.search.commandline"> linkend="rcl.search.commandline">
command line tool</link>, a command line interface</link>, a
<link linkend="rcl.program.api.python"> <link linkend="rcl.program.api.python">
<application>Python</application> <application>Python</application>
programming interface</link>, and a <link linkend="rcl.searchkio"> programming interface</link>, a <link linkend="rcl.searchkio">
<application>KDE</application> KIO slave module</link>.</para> <application>KDE</application> KIO slave module</link>, and
a <application>Ubuntu Unity Lens</application> module.
</para>
</sect1> </sect1>
</chapter> </chapter>
@ -251,25 +254,27 @@
<link linkend="rcl.indexing.config">configuration files</link>.</para> <link linkend="rcl.indexing.config">configuration files</link>.</para>
<para>Most file types, like HTML or word processing files, only hold <para>Most file types, like HTML or word processing files, only hold
one document. Some file types, like mail folder files or zip one document. Some file types, like email folders or zip
archives, can hold many individually indexed documents, which may archives, can hold many individually indexed documents, which may
in turn be themselves compound ones. Such hierarchies can go quite in turn be themselves compound ones. Such hierarchies can go quite
deep, and &RCL; has no problem processing, for example, an ms-word deep, and &RCL; can process, for example, an
document which would be an attachment to an email message part of <application>ms-word</application>
a folder file archived inside a zip file...</para> document stored as an attachment to an email message inside an
email folder archived in a zip file...</para>
<para>&RCL; indexing processes plain text, HTML, openoffice <para>&RCL; indexing processes plain text, HTML, OpenDocument
and e-mail files, and a few others internally.</para> (Open/LibreOffice), email formats, and a few others internally.</para>
<para>Other file types (ie: postscript, pdf, ms-word, rtf ...) <para>Other file types (ie: postscript, pdf, ms-word, rtf ...)
need external applications for preprocessing. The list is in the need external applications for preprocessing. The list is in the
<link linkend="rcl.install.external"> installation</link> <link linkend="rcl.install.external"> installation</link>
section. After every indexing operation, &RCL; updates a list of section. After every indexing operation, &RCL; updates a list of
commands that would be needed for indexing existing files commands that would be needed for indexing existing files
types. This list can be displayed from the types. This list can be displayed by selecting the menu option
<command>recoll</command> <guilabel>File</guilabel> menu. It is <guilabel>File</guilabel>-><guilabel>Show Missing Helpers</guilabel>
stored in the <filename>missing</filename> text file in the <command>recoll</command> GUI. It is stored in the
inside the configuration directory.</para> <filename>missing</filename> text file inside the configuration
directory.</para>
<para>Without further configuration, &RCL; will index all <para>Without further configuration, &RCL; will index all
appropriate files from your home directory, with a reasonable appropriate files from your home directory, with a reasonable
@ -353,9 +358,9 @@ recoll
indexed).</para> indexed).</para>
<para>Of course, images, sound and video do not increase the <para>Of course, images, sound and video do not increase the
index size, which means that it will be quite typical nowadays index size, which means that nowadays (2012), typically, even a big
(2006), that even a big index will be negligible against the index will be negligible against the total amount of data on the
total amount of data on the computer.</para> computer.</para>
<para>The index data directory (<filename>xapiandb</filename>) <para>The index data directory (<filename>xapiandb</filename>)
only contains data that can be completely rebuilt by an index run only contains data that can be completely rebuilt by an index run
@ -456,14 +461,20 @@ recoll
option.)</para> option.)</para>
<para>The interface is started from the <para>The interface is started from the
<guilabel>Preferences</guilabel> menu. It has two main <guilabel>Preferences</guilabel>-><guilabel>Indexing
panels. The first panel allows setting global variables, like Configuration</guilabel> menu entry. It is divided in three tabs,
the list of top directories or the list of skipped paths. The <guilabel>Global parameters</guilabel>, <guilabel>Local
second panel allows setting variables that can be redefined parameters</guilabel>, and <guilabel>Beagle web history</guilabel>,
for subdirectories. This second panel has an initially empty list of which is explained in the next section.</para>
customisation directories, to which you can add. The variables
are then set for the currently selected directory (or at the top <para>The first tab allows setting global variables, like the lists
level if the empty line is selected).</para> of top directories, skipped paths, or stemming languages.</para>
<para>The second tab allows setting variables that can be redefined
for subdirectories. This second tab has an initially empty list of
customisation directories, to which you can add. The variables are
then set for the currently selected directory (or at the top level
if the empty line is selected).</para>
<para>The meaning for most entries in the interface is <para>The meaning for most entries in the interface is
self-evident and documented by a <literal>ToolTip</literal> self-evident and documented by a <literal>ToolTip</literal>
@ -538,15 +549,17 @@ recoll
if canceled).</para> if canceled).</para>
<para>The <command>recollindex</command> indexing process can be <para>The <command>recollindex</command> indexing process can be
interrupted by sending an interrupt (^C, SIGINT) or terminate interrupted by sending an interrupt (Ctrl-C, SIGINT) or terminate
(SIGTERM) signal. Some time may elapse before the process exits, (SIGTERM) signal. Some time may elapse before the process exits,
because it needs to properly flush and close the index. The because it needs to properly flush and close the index. This can
indexing thread can be equivalently stopped from the menu.</para> also be done from the <command>recoll</command> GUI
<guilabel>File</guilabel>-><guilabel>Stop Indexing</guilabel>
menu entry.</para>
<para>After such an interruption, the index will be somewhat <para>After such an interruption, the index will be somewhat
inconsistent because some operations which are normally performed inconsistent because some operations which are normally performed
at the end of the indexing pass will have been skipped (for at the end of the indexing pass will have been skipped (for
exemple, the stemming and spelling databases will be inexistant example, the stemming and spelling databases will be inexistant
or out of date). You just need to restart indexing at a later or out of date). You just need to restart indexing at a later
time to restore consistency. The indexing will restart at the time to restore consistency. The indexing will restart at the
interruption point (the full file tree will be traversed, interruption point (the full file tree will be traversed,
@ -593,7 +606,8 @@ recoll
<para>As of version 1.17 the &RCL; GUI has dialogs to manage <para>As of version 1.17 the &RCL; GUI has dialogs to manage
<filename>crontab</filename> entries for <filename>crontab</filename> entries for
<command>recollindex</command>. You can reach them from the <command>recollindex</command>. You can reach them from the
<guimenu>Preferences->Indexing Schedule</guimenu> menu. They only <guilabel>Preferences</guilabel>-><guilabel>Indexing
Schedule</guilabel> menu. They only
work with the good old <command>cron</command>, and do not give work with the good old <command>cron</command>, and do not give
access to all features of <command>cron</command> scheduling.</para> access to all features of <command>cron</command> scheduling.</para>
@ -669,12 +683,13 @@ fvwm
on the log level.</para> on the log level.</para>
<para>When building &RCL;, the real time indexing support can be <para>When building &RCL;, the real time indexing support can be
customised during package customised during package <link
<link linkend="rcl.install.building.build">configuration</link> linkend="rcl.install.building.build">configuration</link> with the
with the <literal>--with[out]-fam</literal> or <literal>--with[out]-fam</literal> or
<literal>--with[out]-inotify</literal> options. The default is <literal>--with[out]-inotify</literal> options. The default is
currently to include inotify monitoring on systems that support currently to include <application>inotify</application> monitoring
it, and, as of recoll 1.17, gamin support on FreeBSD.</para> on systems that support it, and, as of recoll 1.17,
<application>gamin</application> support on FreeBSD.</para>
<para>While it is convenient that data is indexed in real time, <para>While it is convenient that data is indexed in real time,
repeated indexing can generate a significant load on the repeated indexing can generate a significant load on the
@ -729,7 +744,7 @@ fvwm
<para>In most cases, you can enter the terms as you <para>In most cases, you can enter the terms as you
think them, even if they contain embedded punctuation or other think them, even if they contain embedded punctuation or other
non-textual characters. For non-textual characters. For
exemple, &RCL; can handle things like e-mail addresses, or example, &RCL; can handle things like email addresses, or
arbitrary cut and paste from another text window, punctation arbitrary cut and paste from another text window, punctation
and all.</para> and all.</para>
@ -967,7 +982,7 @@ fvwm
that you can't actually visualize the folder (there will be an that you can't actually visualize the folder (there will be an
error dialog if you try). &RCL; is unfortunately not yet smart error dialog if you try). &RCL; is unfortunately not yet smart
enough to disable the entry in this case. In other cases, the enough to disable the entry in this case. In other cases, the
<guilabel>Open</guilabel> option makes sense, for exemple to <guilabel>Open</guilabel> option makes sense, for example to
start a <application>chm</application> viewer on the parent start a <application>chm</application> viewer on the parent
document for a help page.</para> document for a help page.</para>
@ -1023,7 +1038,7 @@ fvwm
create a new preview window. The old one stays open until you create a new preview window. The old one stays open until you
close it.</para> close it.</para>
<para>You can close a preview tab by typing <keycap>^W</keycap> <para>You can close a preview tab by typing <keycap>Ctrl-W</keycap>
(<keycap>Ctrl</keycap> + <keycap>W</keycap>) in the (<keycap>Ctrl</keycap> + <keycap>W</keycap>) in the
window. Closing the last tab for a window will also close the window. Closing the last tab for a window will also close the
window.</para> window.</para>
@ -1047,7 +1062,7 @@ fvwm
<keycap>F3</keycap> inside the text area to get to the next <keycap>F3</keycap> inside the text area to get to the next
occurrence.</para> occurrence.</para>
<para>If you have a search string entered and you use ^Up/^Down <para>If you have a search string entered and you use Ctrl-Up/Ctrl-Down
to browse the results, the search is initiated for each successive to browse the results, the search is initiated for each successive
document. If the string is found, the cursor will be positioned document. If the string is found, the cursor will be positioned
at the first occurrence of the search string.</para> at the first occurrence of the search string.</para>
@ -1059,8 +1074,8 @@ fvwm
the main text but in one of the fields.</para> the main text but in one of the fields.</para>
<para>You can print the current preview window contents by typing <para>You can print the current preview window contents by typing
<keycap>^P</keycap> (<keycap>Ctrl</keycap> + <keycap>P</keycap>) in <keycap>Ctrl-P</keycap> (<keycap>Ctrl</keycap> +
the window text.</para> <keycap>P</keycap>) in the window text.</para>
</sect2> </sect2>
@ -1556,19 +1571,19 @@ fvwm
</formalpara> </formalpara>
<formalpara><title>Closing previews</title> <formalpara><title>Closing previews</title>
<para>Entering <keycap>^W</keycap> in a tab will <para>Entering <keycap>Ctrl-W</keycap> in a tab will
close it (and, for the last tab, close the preview close it (and, for the last tab, close the preview
window). Entering <keycap>Esc</keycap> will close the preview window). Entering <keycap>Esc</keycap> will close the preview
window and all its tabs.</para> window and all its tabs.</para>
</formalpara> </formalpara>
<formalpara><title>Printing previews</title> <formalpara><title>Printing previews</title>
<para>Entering <keycap>^P</keycap> in a preview window will print <para>Entering <keycap>Ctrl-P</keycap> in a preview window will print
the currently displayed text.</para> the currently displayed text.</para>
</formalpara> </formalpara>
<formalpara><title>Quitting</title> <formalpara><title>Quitting</title>
<para>Entering <keycap>^Q</keycap> almost anywhere will <para>Entering <keycap>Ctrl-Q</keycap> almost anywhere will
close the application.</para> close the application.</para>
</formalpara> </formalpara>
</sect3> </sect3>
@ -1605,9 +1620,10 @@ fvwm
on startup. The default value is empty, but there is a on startup. The default value is empty, but there is a
skeleton style sheet (<filename>recoll.qss</filename>) skeleton style sheet (<filename>recoll.qss</filename>)
inside the <filename>/usr/share/recoll/examples</filename> inside the <filename>/usr/share/recoll/examples</filename>
directory. Using a style sheet, you can change most Recoll directory. Using a style sheet, you can change most
graphical parameters: colors, fonts, etc. See the sample <command>recoll</command> graphical parameters: colors,
file for a few simple examples.</para> fonts, etc. See the sample file for a few simple
examples.</para>
</listitem> </listitem>
<listitem><para><guilabel>Maximum text size highlighted for <listitem><para><guilabel>Maximum text size highlighted for
@ -1847,7 +1863,7 @@ fvwm
<para>No more detail will be given about the header part (only <para>No more detail will be given about the header part (only
useful with the WebKit build), if there are restrictions to useful with the WebKit build), if there are restrictions to
what you can do, they are beyond this author's HTML/CSS/Javascript what you can do, they are beyond this author's HTML/CSS/Javascript
abilities... There are a few exemples on the abilities... There are a few examples on the
<ulink url="http://www.recoll.org/custom.html">page about <ulink url="http://www.recoll.org/custom.html">page about
customising the result list</ulink> on the &RCL; web site.</para> customising the result list</ulink> on the &RCL; web site.</para>
@ -2143,7 +2159,7 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r
<replaceable>potatoes</replaceable> (in any part of the document).</para> <replaceable>potatoes</replaceable> (in any part of the document).</para>
<para>An element is composed of an optional field specification, <para>An element is composed of an optional field specification,
and a value, separated by a colon. Exemple: and a value, separated by a colon. Example:
<replaceable>Beatles</replaceable>, <replaceable>Beatles</replaceable>,
<replaceable>author:balzac</replaceable>, <replaceable>author:balzac</replaceable>,
<replaceable>dc:title:grandet</replaceable> </para> <replaceable>dc:title:grandet</replaceable> </para>
@ -2180,7 +2196,7 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r
<replaceable>title:prejudice title:pride</replaceable>, and is <replaceable>title:prejudice title:pride</replaceable>, and is
unlikely to find a result.</para> unlikely to find a result.</para>
<para>Modifiers can be set on a phrase clause, for exemple to specify <para>Modifiers can be set on a phrase clause, for example to specify
a proximity search (unordered). See a proximity search (unordered). See
<link linkend="rcl.search.lang.modifiers">the modifier <link linkend="rcl.search.lang.modifiers">the modifier
section</link>.</para> section</link>.</para>
@ -2226,7 +2242,7 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r
</listitem> </listitem>
<listitem><para><literal>size</literal> for filtering the <listitem><para><literal>size</literal> for filtering the
results on file size. Exemple: results on file size. Example:
<literal>size&lt;10000</literal>. You can use <literal>size&lt;10000</literal>. You can use
<literal>&lt;</literal>, <literal>&gt;</literal> or <literal>&lt;</literal>, <literal>&gt;</literal> or
<literal>=</literal> as operators. You can specify a range like the <literal>=</literal> as operators. You can specify a range like the
@ -2250,7 +2266,7 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r
The days and months parts may be missing. If the The days and months parts may be missing. If the
<literal>/</literal> is present but an element is missing, the <literal>/</literal> is present but an element is missing, the
missing element is interpreted as the lowest or highest date in the missing element is interpreted as the lowest or highest date in the
index. Exemples:</para> index. Examples:</para>
<itemizedlist> <itemizedlist>
<listitem><para><literal>2001-03-01/2002-05-01</literal> the <listitem><para><literal>2001-03-01/2002-05-01</literal> the
basic syntax for an interval of dates.</para> basic syntax for an interval of dates.</para>
@ -2572,7 +2588,7 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r
<literal>Subject:</literal> for email) when indexing. This is not <literal>Subject:</literal> for email) when indexing. This is not
essential.</para> essential.</para>
<para>You should look to one of the simple filters, for exemple <para>You should look to one of the simple filters, for example
<literal>rclps</literal> for a starting point.</para> <literal>rclps</literal> for a starting point.</para>
<para>Don't forget to make your filter executable before <para>Don't forget to make your filter executable before
@ -3104,7 +3120,7 @@ while query.next >= 0 and query.next < nres:
<para>You will only have to check or install <link <para>You will only have to check or install <link
linkend="rcl.install.external">supporting applications</link> linkend="rcl.install.external">supporting applications</link>
for the file types that you want to index beyond those that are for the file types that you want to index beyond those that are
natively processed by &RCL; (text, HTML, mail files, and a few natively processed by &RCL; (text, HTML, email files, and a few
others).</para> others).</para>
<para>You should also maybe have a look at the <para>You should also maybe have a look at the
@ -3276,13 +3292,13 @@ while query.next >= 0 and query.next < nres:
<listitem><para>Konqueror webarchive format with Python (uses the <listitem><para>Konqueror webarchive format with Python (uses the
Tarfile module).</para></listitem> Tarfile module).</para></listitem>
<listitem><para>mimehtml web archive format (support based on the mail <listitem><para>mimehtml web archive format (support based on the email
filter, which introduces some mild weirdness, but still filter, which introduces some mild weirdness, but still
usable).</para></listitem> usable).</para></listitem>
</itemizedlist> </itemizedlist>
<para>Text, HTML, mail folders, and Scribus files are <para>Text, HTML, email folders, and Scribus files are
processed internally. <application>Lyx</application> is used to processed internally. <application>Lyx</application> is used to
index Lyx files. Many filters need <command>iconv</command> and the index Lyx files. Many filters need <command>iconv</command> and the
standard <command>sed</command> and <command>awk</command>. standard <command>sed</command> and <command>awk</command>.
@ -3628,7 +3644,7 @@ skippedNames = #* bin CVS Cache cache* caughtspam tmp .thumbnails .svn \
<para>The list in the default configuration does not <para>The list in the default configuration does not
exclude hidden directories (names beginning with a exclude hidden directories (names beginning with a
dot), which means that it may index quite a few things dot), which means that it may index quite a few things
that you do not want. On the other hand, mail user that you do not want. On the other hand, email user
agents like <application>thunderbird</application> agents like <application>thunderbird</application>
usually store messages in hidden directories, and you usually store messages in hidden directories, and you
probably want this indexed. One possible solution is to probably want this indexed. One possible solution is to
@ -3835,7 +3851,7 @@ skippedPaths = ~/somedir/&lowast;.txt
<varlistentry><term><literal>maildefcharset</literal></term> <varlistentry><term><literal>maildefcharset</literal></term>
<listitem><para>This can be used to define the default <listitem><para>This can be used to define the default
character set specifically for mail messages which don't character set specifically for email messages which don't
specify it. This is mainly useful for readpst (libpst) dumps, specify it. This is mainly useful for readpst (libpst) dumps,
which are utf-8 but do not say so.</para> which are utf-8 but do not say so.</para>
</listitem> </listitem>
@ -4098,9 +4114,9 @@ mondelaypatterns = *.log:20 "this one has spaces*:10"
<varlistentry> <varlistentry>
<term>filter-specific sections</term> <term>filter-specific sections</term>
<listitem><para>Some filters may need specific <listitem><para>Some filters may need specific
configuration for handling fields. Only the mail message filter configuration for handling fields. Only the email message filter
currently has such a section (named currently has such a section (named
<literal>[mail]</literal>). It allows indexing arbitrary mail <literal>[mail]</literal>). It allows indexing arbitrary email
headers in addition to the ones indexed by default. Other such headers in addition to the ones indexed by default. Other such
sections may appear in the future.</para> sections may appear in the future.</para>
</listitem> </listitem>
@ -4110,9 +4126,9 @@ mondelaypatterns = *.log:20 "this one has spaces*:10"
<para>Here follows a small example of a personal <para>Here follows a small example of a personal
<filename>fields</filename> <filename>fields</filename>
file. This would extract a specific mail header and file. This would extract a specific email header and
use it as a searchable field, with data displayable inside result use it as a searchable field, with data displayable inside result
lists. (Side note: as the mail filter does no decoding on the values, lists. (Side note: as the email filter does no decoding on the values,
only plain ascii headers can be indexed, and only the only plain ascii headers can be indexed, and only the
first occurrence will be used for headers that occur several times). first occurrence will be used for headers that occur several times).