clarify stemming options
This commit is contained in:
parent
54ec944e41
commit
658a21d5ad
@ -125,41 +125,44 @@ databases will not be updated.
|
|||||||
will build the stem expansion database for a given language, which may or
|
will build the stem expansion database for a given language, which may or
|
||||||
may not be part of the list in the configuration file. If the language is
|
may not be part of the list in the configuration file. If the language is
|
||||||
not part of the configuration, the stem expansion database will be deleted
|
not part of the configuration, the stem expansion database will be deleted
|
||||||
during the next normal run. You can get the list of stemmer names from the
|
at the end of the next normal indexing run. You can get the list of stemmer
|
||||||
.B recollindex -m
|
names from the
|
||||||
command. At the time of this writbng, the following languages
|
.B recollindex -l
|
||||||
(abbreviations) are recognized (out of Xapian's stem.h):
|
command. Note that this is mostly for experimental use, the normal way to
|
||||||
.RS
|
add a stemming language is to set it in the configuration, either by
|
||||||
|
editing "recoll.conf" or by using the GUI indexing configuration dialog.
|
||||||
|
.br
|
||||||
|
At the time of this writing, the following languages
|
||||||
|
are recognized (out of Xapian's stem.h):
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
danish (da)
|
danish
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
dutch (nl)
|
dutch
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
english (en) Martin Porter's 2002 revision of his stemmer
|
english Martin Porter's 2002 revision of his stemmer
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
english_lovins (lovins) Lovin's stemmer
|
english_lovins Lovin's stemmer
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
english_porter (porter) Porter's stemmer as described in his 1980 paper
|
english_porter Porter's stemmer as described in his 1980 paper
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
finnish (fi)
|
finnish
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
french (fr)
|
french
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
german (de)
|
german
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
italian (it)
|
italian
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
norwegian (no)
|
norwegian
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
portuguese (pt)
|
portuguese
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
russian (ru)
|
russian
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
spanish (es)
|
spanish
|
||||||
.IP \(bu
|
.IP \(bu
|
||||||
swedish (sv)
|
swedish
|
||||||
.RE
|
.PP
|
||||||
|
|
||||||
.B recollindex -S
|
.B recollindex -S
|
||||||
will rebuild the phonetic/orthographic index. This feature uses the
|
will rebuild the phonetic/orthographic index. This feature uses the
|
||||||
.B aspell
|
.B aspell
|
||||||
|
|||||||
@ -27,6 +27,10 @@ recollq \- command line / standard output Recoll query command.
|
|||||||
|
|
|
|
||||||
.B -a
|
.B -a
|
||||||
]
|
]
|
||||||
|
[
|
||||||
|
.B -s
|
||||||
|
<stemming language>
|
||||||
|
]
|
||||||
<query string>
|
<query string>
|
||||||
|
|
||||||
.SH DESCRIPTION
|
.SH DESCRIPTION
|
||||||
@ -66,6 +70,11 @@ can be used to set the maximum number of results that should be printed.
|
|||||||
If
|
If
|
||||||
.B -d
|
.B -d
|
||||||
is set, the text for the result files contents will be dumped to stdout.
|
is set, the text for the result files contents will be dumped to stdout.
|
||||||
|
.B -s
|
||||||
|
<language>
|
||||||
|
selects the word stemming language. The value should match an existing
|
||||||
|
stemming database (as set in the configuration or added with recollindex -s).
|
||||||
|
|
||||||
.SH SEE ALSO
|
.SH SEE ALSO
|
||||||
.PP
|
.PP
|
||||||
recollindex(1) recollq(1) recoll.conf(5)
|
recollindex(1) recollq(1) recoll.conf(5)
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user