synonyms documentation
This commit is contained in:
parent
15110701cc
commit
df8dcbcce0
@ -2975,6 +2975,52 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r
|
|||||||
</programlisting>
|
</programlisting>
|
||||||
</sect1>
|
</sect1>
|
||||||
|
|
||||||
|
<sect1 id="RCL.SEARCH.SYNONYMS">
|
||||||
|
<title>Using Synonyms (&RCL; 1.22 and later)</title>
|
||||||
|
|
||||||
|
<para>There are a number of different uses for synonyms in text
|
||||||
|
search. They can be used at index time (either to increase or decrease
|
||||||
|
the number of indexed terms), or at query time, to reduce user terms to
|
||||||
|
a set of canonical ones, or to expand queries to match texts containing
|
||||||
|
synonyms of the user terms.</para>
|
||||||
|
|
||||||
|
<para>Only the last approach is used in &RCL;. Synonym groups can be
|
||||||
|
defined so that a user query term which is found to be part of a
|
||||||
|
synonym group will be optionally expanded into an OR query for all
|
||||||
|
synonyms.</para>
|
||||||
|
|
||||||
|
<para>In practise, synonym groups are defined inside ordinary text
|
||||||
|
files. Each line in the file defines a group. Example:
|
||||||
|
<programlisting>
|
||||||
|
hi hello "good morning"
|
||||||
|
|
||||||
|
# not sure about до свидания though. Is this english ?
|
||||||
|
bye goodbye "see you" \
|
||||||
|
"до свидания"
|
||||||
|
</programlisting>
|
||||||
|
As usual lines beginning with a <literal>#</literal> are comments,
|
||||||
|
empty lines are ignored, and lines can be continued by ending them with
|
||||||
|
a backslash.
|
||||||
|
</para>
|
||||||
|
|
||||||
|
<para>The synonyms are searched for matches with user terms after these
|
||||||
|
are stem-expanded, but the contents of the synonyms file itself is not
|
||||||
|
subjected to stem expansion (1.22). This means that a match
|
||||||
|
will not be found if the form present in the synonyms file is not
|
||||||
|
present anywhere in the document set.</para>
|
||||||
|
|
||||||
|
<para>Multi-word synonyms are supported, but be aware that these will
|
||||||
|
generate phrase queries, which may degrade performance (and also, no
|
||||||
|
stemming).</para>
|
||||||
|
|
||||||
|
<para>A synonyms file can be specified in the GUI preferences, or as an
|
||||||
|
option to <command>recollq</command>.</para>
|
||||||
|
|
||||||
|
<para>This feature is new in &RCL; 1.22 and will probably need to be
|
||||||
|
refined after some user feedback.</para>
|
||||||
|
|
||||||
|
</sect1>
|
||||||
|
|
||||||
<sect1 id="RCL.SEARCH.PTRANS">
|
<sect1 id="RCL.SEARCH.PTRANS">
|
||||||
<title>Path translations</title>
|
<title>Path translations</title>
|
||||||
|
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user