release 1.17.2

This commit is contained in:
Jean-Francois Dockes 2012-04-06 17:46:33 +02:00
parent 9f402d33cb
commit 34070eb304

View File

@ -235,24 +235,37 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
files with different character sets, encodings, and languages into the files with different character sets, encodings, and languages into the
same index. It has input filters for many document types. same index. It has input filters for many document types.
Stemming depends on the document language. Recoll stores the unstemmed Stemming is the process by which Recoll reduces words to their radicals so
versions of terms and uses auxiliary databases for term expansion. It can that searching does not depend, for example, on a word being singular or
switch stemming languages, or add a language, without re-indexing. Storing plural (floor, floors), or on a verb tense (flooring, floored). Because
documents in different languages in the same index is possible, and useful the mechanisms used for stemming depend on the specific grammatical rules
in practice, but does introduce possibilities of confusion. Recoll for each language, there is a separate stemmer module for most common
currently makes no attempt at automatic language recognition. languages where stemming makes sense. Storing documents written in
different languages in the same index is possible, and commonly done. In
this situation, you can specify several stemming languages for the index.
Recoll stores the unstemmed versions of terms in the main index and uses
auxiliary databases for term expansion (one for each stemming language),
which means that you can switch stemming languages between searches, or
add a language without needing a full reindex. Recoll currently makes no
attempt at automatic language recognition, which means that the stemmer
will sometimes be applied to terms from other languages with potentially
strange results. In practise, even if this introduces possibilities of
confusion, this approach has been proven quite useful, and, awaiting the
addition of an automatic language recognition module to Recoll, it is much
less cumbersome than separating your documents according to what language
they are written in.
Recoll has many parameters which define exactly what to index, and how to Recoll has many parameters which define exactly what to index, and how to
classify and decode the source documents. These are kept in configuration classify and decode the source documents. These are kept in configuration
files. A default configuration is copied into a standard location (usually files. A default configuration is copied into a standard location (usually
something like /usr/[local/]share/recoll/examples) during installation. something like /usr/[local/]share/recoll/examples) during installation.
The default parameters from this file may be overridden by values that you The default values set by the configuration files in this directory may be
set inside your personal configuration, found by default in the .recoll overridden by values that you set inside your personal configuration,
sub-directory of your home directory. The default configuration will index found by default in the .recoll sub-directory of your home directory. The
your home directory with default parameters and should be sufficient for default configuration will index your home directory with default
giving Recoll a try, but you may want to adjust it later, which can be parameters and should be sufficient for giving Recoll a try, but you may
done either by editing the text files or by using configuration menus in want to adjust it later, which can be done either by editing the text
the recoll GUI files or by using configuration menus in the recoll GUI
Indexing is started automatically the first time you execute the recoll Indexing is started automatically the first time you execute the recoll
search graphical user interface, or by executing the recollindex command. search graphical user interface, or by executing the recollindex command.