aspell pass

This commit is contained in:
dockes 2006-09-29 11:43:45 +00:00
parent 650e43c25f
commit 3484138b5d

View File

@ -24,7 +24,7 @@
Dockes</holder> Dockes</holder>
</copyright> </copyright>
<releaseinfo>$Id: usermanual.sgml,v 1.20 2006-09-28 14:30:53 dockes Exp $</releaseinfo> <releaseinfo>$Id: usermanual.sgml,v 1.21 2006-09-29 11:43:45 dockes Exp $</releaseinfo>
<abstract> <abstract>
<para>This document introduces full text search notions <para>This document introduces full text search notions
@ -74,7 +74,7 @@
<para>You do not need to remember in what file or email message you <para>You do not need to remember in what file or email message you
stored a given piece of information. You just ask for related stored a given piece of information. You just ask for related
terms, and the tool will return a list of documents where terms, and the tool will return a list of documents where
those terms are prominent, in a similar way to internet search those terms are prominent, in a similar way to Internet search
engines.</para> engines.</para>
<para>&RCL; tries to determine which documents are most relevant to <para>&RCL; tries to determine which documents are most relevant to
@ -88,12 +88,12 @@
word, not for a specific form or spelling. These different word, not for a specific form or spelling. These different
forms may include plurals, different tenses for a verb, or forms may include plurals, different tenses for a verb, or
terms derived from the same root or <emphasis>stem</emphasis> terms derived from the same root or <emphasis>stem</emphasis>
(exemple: floor, floors, floored, floorings...). &RCL; will by (example: floor, floors, floored, flooring...). &RCL; will by
default expand queries to all such related terms (words that default expand queries to all such related terms (words that
reduce to the same stem). This expansion can be disabled at reduce to the same stem). This expansion can be disabled at
search time.</para> search time.</para>
<para>Stemming, by itself, does not accomodate for misspellings or <para>Stemming, by itself, does not accommodate for misspellings or
phonetic searches. &RCL; currently does not support these phonetic searches. &RCL; currently does not support these
features.</para> features.</para>
@ -121,7 +121,7 @@
the place from which they were indexed. (Actually, there is a the place from which they were indexed. (Actually, there is a
way to reconstruct a document from the information in the way to reconstruct a document from the information in the
index, but the result is not nice, as all formatting, index, but the result is not nice, as all formatting,
punctuation and capitalisation are lost).</para> punctuation and capitalization are lost).</para>
<para>&RCL; stores all internal data in <application>Unicode <para>&RCL; stores all internal data in <application>Unicode
UTF-8</application> format, and it can index files with UTF-8</application> format, and it can index files with
@ -131,7 +131,7 @@
<para>Stemming depends on the document language. &RCL; stores <para>Stemming depends on the document language. &RCL; stores
the unstemmed versions of terms and uses auxiliary databases for the unstemmed versions of terms and uses auxiliary databases for
term expansion. It can switch stemming languages, or add a term expansion. It can switch stemming languages, or add a
language, without reindexing. Storing documents in different language, without re-indexing. Storing documents in different
languages in the same index is possible, and useful in languages in the same index is possible, and useful in
practice, but does introduce possibilities of confusion. &RCL; practice, but does introduce possibilities of confusion. &RCL;
currently makes no attempt at automatic language recognition.</para> currently makes no attempt at automatic language recognition.</para>
@ -144,9 +144,9 @@
(usually something like (usually something like
<filename>/usr/[local/]share/recoll/examples</filename>) <filename>/usr/[local/]share/recoll/examples</filename>)
during installation. The default parameters from this file may during installation. The default parameters from this file may
be overriden by values that you set inside your personal be overridden by values that you set inside your personal
configuration, found by default in the configuration, found by default in the
<filename>.recoll</filename> subdirectory of your home <filename>.recoll</filename> sub-directory of your home
directory. The default configuration will index your home directory. The default configuration will index your home
directory with default parameters and should be sufficient for directory with default parameters and should be sufficient for
giving &RCL; a try, but you may want to adjust it giving &RCL; a try, but you may want to adjust it
@ -267,7 +267,7 @@
<para>As of version 1.4, &RCL; will create the configuration <para>As of version 1.4, &RCL; will create the configuration
directory with a mode of 0700 (access by owner only). As the directory with a mode of 0700 (access by owner only). As the
index data directory is by default a subdirectory of the index data directory is by default a sub-directory of the
configuration directory, this should result in appropriate configuration directory, this should result in appropriate
protection.</para> protection.</para>
@ -285,7 +285,7 @@
<para>Values set in the system-wide configuration file (named <para>Values set in the system-wide configuration file (named
like like
<filename>/usr/[local/]share/recoll/examples/recoll.conf</filename>) <filename>/usr/[local/]share/recoll/examples/recoll.conf</filename>)
can be overriden by those set in the personal one, named can be overridden by those set in the personal one, named
<filename>$HOME/.recoll/recoll.conf</filename> by default or <filename>$HOME/.recoll/recoll.conf</filename> by default or
<filename>$RECOLL_CONFDIR/recoll.conf</filename> if <filename>$RECOLL_CONFDIR/recoll.conf</filename> if
RECOLL_CONFDIR is set.</para> RECOLL_CONFDIR is set.</para>
@ -303,7 +303,7 @@
this document, or in the recoll.conf(5) man page.</para> this document, or in the recoll.conf(5) man page.</para>
<para>The applications needed to index file types other than <para>The applications needed to index file types other than
text, html or email (ie: pdf, postscript, ms-word...) are text, HTML or email (ie: pdf, postscript, ms-word...) are
described in the <link linkend="rcl.install.external">external described in the <link linkend="rcl.install.external">external
packages section</link></para> packages section</link></para>
@ -323,7 +323,7 @@
<para>If the <command>recoll</command> program finds no index <para>If the <command>recoll</command> program finds no index
when it starts, it will automatically start indexing (except when it starts, it will automatically start indexing (except
if cancelled).</para> if canceled).</para>
<para>It is best to avoid interrupting the indexing process, as <para>It is best to avoid interrupting the indexing process, as
this may sometimes leave the index in a bad state. This is this may sometimes leave the index in a bad state. This is
@ -408,7 +408,7 @@
performed. You can use the simple search text entry widget (a performed. You can use the simple search text entry widget (a
combobox) to recall them (click on the thing at the right of the combobox) to recall them (click on the thing at the right of the
text field). Please note, however, that only the search texts text field). Please note, however, that only the search texts
are remembered, not the mode (all/any/filename).</para> are remembered, not the mode (all/any/file name).</para>
<para>Hitting <keycap>^Tab</keycap> (<keycap>Ctrl</keycap> + <para>Hitting <keycap>^Tab</keycap> (<keycap>Ctrl</keycap> +
<keycap>Tab</keycap>) while entering a word in the <keycap>Tab</keycap>) while entering a word in the
@ -469,7 +469,7 @@
<title>The result list right-click menu</title> <title>The result list right-click menu</title>
<para>Apart from the preview and edit links, you can display a <para>Apart from the preview and edit links, you can display a
popup menu by right-clicking over a paragraph in the result pop-up menu by right-clicking over a paragraph in the result
list. This menu has the following entries:</para> list. This menu has the following entries:</para>
<itemizedlist> <itemizedlist>
@ -483,7 +483,7 @@
<para>The <guilabel>Preview</guilabel> and <para>The <guilabel>Preview</guilabel> and
<guilabel>Edit</guilabel> entries do the same thing as the <guilabel>Edit</guilabel> entries do the same thing as the
corresponding links. The two following entries will copy either corresponding links. The two following entries will copy either
an url or the file path to the clipboard, for pasting into an URL or the file path to the clipboard, for pasting into
another application.</para> another application.</para>
<para>The <guilabel>Find similar</guilabel> entry will select <para>The <guilabel>Find similar</guilabel> entry will select
@ -528,13 +528,13 @@
into the <guilabel>Search for:</guilabel> text field and into the <guilabel>Search for:</guilabel> text field and
entering the search string. You can then use the entering the search string. You can then use the
<guilabel>Next</guilabel> and <guilabel>Previous</guilabel> <guilabel>Next</guilabel> and <guilabel>Previous</guilabel>
buttons to find the next/previous occurence. You can also type buttons to find the next/previous occurrence. You can also type
<keycap>F3</keycap> inside the text area to get to the next <keycap>F3</keycap> inside the text area to get to the next
occurrence.</para> occurrence.</para>
<para>If you have a search string entered and you use ^Up/^Down <para>If you have a search string entered and you use ^Up/^Down
to browse the results, the search is initiated for each successive to browse the results, the search is initiated for each successive
document. If the string is found, the cursor will be positionned document. If the string is found, the cursor will be positioned
at the first occurrence of the search string.</para> at the first occurrence of the search string.</para>
</sect1> </sect1>
@ -552,14 +552,14 @@
<para>Advanced search will let you search for documents of specific mime <para>Advanced search will let you search for documents of specific mime
types (ie: only <literal>text/plain</literal>, or types (ie: only <literal>text/plain</literal>, or
<literal>text/html</literal> or <literal>text/HTML</literal> or
<literal>application/pdf</literal> etc...). The state of the <literal>application/pdf</literal> etc...). The state of the
file type selection can be saved as the default (the file type file type selection can be saved as the default (the file type
filter will not be activated at program startup, but the lists filter will not be activated at program start-up, but the lists
will be in the restored state).</para> will be in the restored state).</para>
<para>You can also restrict the search results <para>You can also restrict the search results
to a subtree of the indexed area. If you need to do this often, to a sub-tree of the indexed area. If you need to do this often,
you may think of setting up multiple indexes instead, as the you may think of setting up multiple indexes instead, as the
performance will be much better.</para> performance will be much better.</para>
@ -655,7 +655,7 @@
specified criteria. The currently available criteria are specified criteria. The currently available criteria are
<emphasis>date</emphasis> and <emphasis>mime type</emphasis>.</para> <emphasis>date</emphasis> and <emphasis>mime type</emphasis>.</para>
<para>The sort parameters stay in effect until they are explicitely <para>The sort parameters stay in effect until they are explicitly
reset, or the program exits. An activated sort is indicated in reset, or the program exits. An activated sort is indicated in
the result list header.</para> the result list header.</para>
@ -734,7 +734,7 @@
<para>File names are added as terms during indexing, and you can <para>File names are added as terms during indexing, and you can
specify them as ordinary terms in normal search fields (&RCL; used specify them as ordinary terms in normal search fields (&RCL; used
to index all directories in the file path as terms. This has been to index all directories in the file path as terms. This has been
abandonned as it did not seem really useful). Alternatively, you abandoned as it did not seem really useful). Alternatively, you
can use the specific file name search which will can use the specific file name search which will
<emphasis>only</emphasis> look for file names and can use wildcard <emphasis>only</emphasis> look for file names and can use wildcard
expansion.</para> expansion.</para>
@ -742,7 +742,7 @@
<formalpara><title>Query explanation</title> <formalpara><title>Query explanation</title>
<para>You can get an exact description of what the query <para>You can get an exact description of what the query
looked for, including stem expansion, and boolean operators looked for, including stem expansion, and Boolean operators
used, by clicking on the result list header.</para> used, by clicking on the result list header.</para>
</formalpara> </formalpara>
@ -761,9 +761,9 @@
</sect1> </sect1>
<sect1 id="rcl.search.custom"> <sect1 id="rcl.search.custom">
<title>Customising the search interface</title> <title>Customizing the search interface</title>
<para>It is possible to customise some aspects of the search <para>It is possible to customize some aspects of the search
interface by using <guimenu>Query configuration</guimenu> entry interface by using <guimenu>Query configuration</guimenu> entry
in the <guimenu>Preferences</guimenu> menu.</para> in the <guimenu>Preferences</guimenu> menu.</para>
@ -781,12 +781,12 @@
<listitem><para><guilabel>Result list font</guilabel>: There <listitem><para><guilabel>Result list font</guilabel>: There
is quite a lot of information shown in the result list, and is quite a lot of information shown in the result list, and
you may want to customise the font and/or font size. The rest you may want to customize the font and/or font size. The rest
of the fonts used by &RCL; are determined by your generic QT of the fonts used by &RCL; are determined by your generic QT
config (try the <command>qtconfig</command> command.</para> config (try the <command>qtconfig</command> command.</para>
</listitem> </listitem>
<listitem><para><guilabel>Html help browser</guilabel>: this <listitem><para><guilabel>HTML help browser</guilabel>: this
will let you chose your preferred browser which will be will let you chose your preferred browser which will be
started from the <guimenu>Help</guimenu> menu to read the user started from the <guimenu>Help</guimenu> menu to read the user
manual. You can enter a simple name if the command is in your manual. You can enter a simple name if the command is in your
@ -800,7 +800,7 @@
</listitem> </listitem>
<listitem><para><guilabel>Auto-start simple search on <listitem><para><guilabel>Auto-start simple search on
whitespace entry</guilabel>: if this is checked, a search will white space entry</guilabel>: if this is checked, a search will
be executed each time you enter a space in the simple search be executed each time you enter a space in the simple search
input field. This lets you look at the result list as you input field. This lets you look at the result list as you
enter new terms. This is off by default, you may like it or enter new terms. This is off by default, you may like it or
@ -838,7 +838,7 @@
</listitem> </listitem>
<listitem><para><guilabel>Replace abstracts from <listitem><para><guilabel>Replace abstracts from
documents</guilabel>: this decides if we should synthetize and documents</guilabel>: this decides if we should synthesize and
display an abstract in place of an explicit abstract found display an abstract in place of an explicit abstract found
within the document itself.</para> within the document itself.</para>
</listitem> </listitem>
@ -866,11 +866,11 @@
<para>Once entered, the indexes will appear in the <para>Once entered, the indexes will appear in the
<guilabel>All indexes</guilabel> list, and you can <guilabel>All indexes</guilabel> list, and you can
chose which ones you want to use at any moment by tranferring chose which ones you want to use at any moment by transferring
them to/from the <guilabel>Active indexes</guilabel> them to/from the <guilabel>Active indexes</guilabel>
list.</para> list.</para>
<para>Your main database (the one the current configuration <para>Your main database (the one the current configuration
indexes to), is always implicitely active. If this is not indexes to), is always implicitly active. If this is not
desirable, you can set up your configuration so that it indexes, desirable, you can set up your configuration so that it indexes,
for example, an empty directory.</para> for example, an empty directory.</para>
@ -890,7 +890,7 @@
will only have to check or install will only have to check or install
<link linkend="rcl.install.external">supporting <link linkend="rcl.install.external">supporting
applications</link> for the file types that you want to index applications</link> for the file types that you want to index
beyond text, html and mail files.</para> beyond text, HTML and mail files.</para>
<sect2 id="rcl.install.binary.package"> <sect2 id="rcl.install.binary.package">
<title>Installing through a package system</title> <title>Installing through a package system</title>
@ -981,12 +981,12 @@
<listitem><para>MP3: &RCL; will use the <listitem><para>MP3: &RCL; will use the
<command>id3info</command> command from the <ulink <command>id3info</command> command from the <ulink
url="http://id3lib.sourceforge.net/">id3lib</ulink> package to url="http://id3lib.sourceforge.net/">id3lib</ulink> package to
extract tag information. Without it, only the filenames will extract tag information. Without it, only the file names will
be indexed.</para> be indexed.</para>
</listitem> </listitem>
</itemizedlist> </itemizedlist>
<para>Text, Html, mail folders and Openoffice files are <para>Text, HTML, mail folders and Openoffice files are
processed internally.</para> processed internally.</para>
</sect1> </sect1>
@ -1001,9 +1001,9 @@
<ulink url="http://www.xapian.org">xapian core package</ulink> <ulink url="http://www.xapian.org">xapian core package</ulink>
(&RCL; development currently uses version 0.9.5), and the <ulink (&RCL; development currently uses version 0.9.5), and the <ulink
url="http://www.trolltech.com/products/qt/index.html">qt url="http://www.trolltech.com/products/qt/index.html">qt
runtime and development packages</ulink> (&RCL; development run-time and development packages</ulink> (&RCL; development
currently uses version 3.3.5, but any 3.3 version is currently uses version 3.3.5, but any 3.3 version is
probably ok).</para> probably OK).</para>
<para>You will most probably be able to find a binary package for <para>You will most probably be able to find a binary package for
<application>qt</application> for your system. You may have to <application>qt</application> for your system. You may have to
@ -1038,7 +1038,7 @@
</listitem> </listitem>
<listitem><para><literal>QMAKESPECS</literal> should <listitem><para><literal>QMAKESPECS</literal> should
be set to the name of one of the be set to the name of one of the
<application>qt</application> mkspecs subdirectories (ie: <application>qt</application> mkspecs sub-directories (ie:
linux-g++).</para> linux-g++).</para>
</listitem> </listitem>
</itemizedlist> </itemizedlist>
@ -1061,11 +1061,11 @@
<userinput>cd recoll-xxx</userinput> <userinput>cd recoll-xxx</userinput>
<userinput>configure</userinput> <userinput>configure</userinput>
<userinput>make</userinput> <userinput>make</userinput>
<userinput>(practises usual hardship-repelling invocations)</userinput> <userinput>(practices usual hardship-repelling invocations)</userinput>
</screen> </screen>
<para>There little autoconfiguration. The <para>There little auto-configuration. The
<command>configure</command> script will mainly link one of <command>configure</command> script will mainly link one of
the system-specific files in the <filename>mk</filename> the system-specific files in the <filename>mk</filename>
directory to <filename>mk/sysconf</filename>. If your system directory to <filename>mk/sysconf</filename>. If your system
@ -1135,7 +1135,7 @@
files, and we will just give a general overview here.</para> files, and we will just give a general overview here.</para>
<para>All configuration files share the same format. For <para>All configuration files share the same format. For
exemple, a short extract of the main configuration file might example, a short extract of the main configuration file might
look as follows:</para> look as follows:</para>
<programlisting> <programlisting>
# Space-separated list of directories to index. # Space-separated list of directories to index.
@ -1159,7 +1159,7 @@
</itemizedlist> </itemizedlist>
<para>Section lines allow redefining some parameters for a <para>Section lines allow redefining some parameters for a
directory subtree. Some of the parameters used for indexing directory sub-tree. Some of the parameters used for indexing
are looked up hierarchically from the more to the less are looked up hierarchically from the more to the less
specific. Not all parameters can be meaningfully redefined, specific. Not all parameters can be meaningfully redefined,
this is specified for each in the next section. </para> this is specified for each in the next section. </para>
@ -1216,7 +1216,7 @@
<programlisting> <programlisting>
*~ #* bin CVS Cache caughtspam tmp *~ #* bin CVS Cache caughtspam tmp
</programlisting> </programlisting>
<para>The list can be redefined for subdirectories, but is only <para>The list can be redefined for sub-directories, but is only
actually changed for the top level ones in actually changed for the top level ones in
<literal>topdirs</literal>.</para> <literal>topdirs</literal>.</para>
<para>The top-level directories are not affected by this <para>The top-level directories are not affected by this
@ -1255,7 +1255,7 @@
filter scripts used to index some types of files. The filter scripts used to index some types of files. The
value should not be changed, except if you want to modify value should not be changed, except if you want to modify
one of the default scripts. The value can be redefined for one of the default scripts. The value can be redefined for
any subdirectory. </para> any sub-directory. </para>
</listitem> </listitem>
</varlistentry> </varlistentry>
@ -1275,7 +1275,7 @@
<listitem><para>The name of the character set used for <listitem><para>The name of the character set used for
files that do not contain a character set definition (ie: files that do not contain a character set definition (ie:
plain text files). This can be redefined for any plain text files). This can be redefined for any
subdirectory. If it is not set at all, the character set sub-directory. If it is not set at all, the character set
used is the one defined by the nls environment (LC_ALL, used is the one defined by the nls environment (LC_ALL,
LC_CTYPE, LANG), or iso8859-1 if nothing is set.</para> LC_CTYPE, LANG), or iso8859-1 if nothing is set.</para>
@ -1292,7 +1292,7 @@
system command as a final step for determining the mime system command as a final step for determining the mime
type for a file (the main procedure uses suffix type for a file (the main procedure uses suffix
associations as defined in the <filename>mimemap</filename> associations as defined in the <filename>mimemap</filename>
file). This can be useful for files with suffixless names, file). This can be useful for files with suffix-less names,
but it will also cause the indexing of many bogus "text" but it will also cause the indexing of many bogus "text"
files.</para> files.</para>
</listitem> </listitem>
@ -1304,7 +1304,7 @@
searches using wild cards. This parameter decides if searches using wild cards. This parameter decides if
file name indexing is performed only for files with mime file name indexing is performed only for files with mime
types that would qualify them for full text indexing, or types that would qualify them for full text indexing, or
for all files inside the selected subtrees, independant of for all files inside the selected subtrees, independently of
mime type.</para> mime type.</para>
</listitem> </listitem>
</varlistentry> </varlistentry>
@ -1357,7 +1357,7 @@
partially redundant with <literal>skippedNames</literal> in partially redundant with <literal>skippedNames</literal> in
the main configuration file, with two differences: it will not the main configuration file, with two differences: it will not
affect directories, and it can be changed for any affect directories, and it can be changed for any
subdirectory.</para> sub-directory.</para>
</sect2> </sect2>
@ -1369,10 +1369,10 @@
display.</para> display.</para>
<para>Changing the indexing parameters is probably not a <para>Changing the indexing parameters is probably not a
good idea except if you are a &RCL; developper.</para> good idea except if you are a &RCL; developers.</para>
<para>You may want to adjust the external viewers defined in <para>You may want to adjust the external viewers defined in
(ie: html is either previewed internally or displayed using (ie: HTML is either previewed internally or displayed using
<application>firefox</application>, but you may prefer <application>firefox</application>, but you may prefer
<application>mozilla</application>, your <application>mozilla</application>, your
<application>openoffice.org</application> <application>openoffice.org</application>