From df8dcbcce01c9786c668a509bd85130b2c644700 Mon Sep 17 00:00:00 2001 From: Jean-Francois Dockes Date: Sat, 21 Nov 2015 13:22:11 +0100 Subject: [PATCH] synonyms documentation --- src/doc/user/usermanual.xml | 46 +++++++++++++++++++++++++++++++++++++ 1 file changed, 46 insertions(+) diff --git a/src/doc/user/usermanual.xml b/src/doc/user/usermanual.xml index 3743515d..1af2c93f 100644 --- a/src/doc/user/usermanual.xml +++ b/src/doc/user/usermanual.xml @@ -2975,6 +2975,52 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r + + Using Synonyms (&RCL; 1.22 and later) + + There are a number of different uses for synonyms in text + search. They can be used at index time (either to increase or decrease + the number of indexed terms), or at query time, to reduce user terms to + a set of canonical ones, or to expand queries to match texts containing + synonyms of the user terms. + + Only the last approach is used in &RCL;. Synonym groups can be + defined so that a user query term which is found to be part of a + synonym group will be optionally expanded into an OR query for all + synonyms. + + In practise, synonym groups are defined inside ordinary text + files. Each line in the file defines a group. Example: + +hi hello "good morning" + +# not sure about до свидания though. Is this english ? +bye goodbye "see you" \ + "до свидания" + + As usual lines beginning with a # are comments, + empty lines are ignored, and lines can be continued by ending them with + a backslash. + + + The synonyms are searched for matches with user terms after these + are stem-expanded, but the contents of the synonyms file itself is not + subjected to stem expansion (1.22). This means that a match + will not be found if the form present in the synonyms file is not + present anywhere in the document set. + + Multi-word synonyms are supported, but be aware that these will + generate phrase queries, which may degrade performance (and also, no + stemming). + + A synonyms file can be specified in the GUI preferences, or as an + option to recollq. + + This feature is new in &RCL; 1.22 and will probably need to be + refined after some user feedback. + + + Path translations