From 4d5cd413afb3be60a06825fd36bc9ac0e07cc185 Mon Sep 17 00:00:00 2001 From: Jean-Francois Dockes Date: Mon, 1 Apr 2013 18:40:42 +0200 Subject: [PATCH] doc --- src/doc/user/usermanual.sgml | 62 ++++++++++++++++++++++++++---------- 1 file changed, 46 insertions(+), 16 deletions(-) diff --git a/src/doc/user/usermanual.sgml b/src/doc/user/usermanual.sgml index 43f13b64..5537af54 100644 --- a/src/doc/user/usermanual.sgml +++ b/src/doc/user/usermanual.sgml @@ -2780,14 +2780,16 @@ text/html [file:///Users/uncrypted-dockes/projets/bateaux/ilur/factEtCie/r dir for filtering the - results on file location (Ex: - dir:/home/me/somedir). -dir + results on file location + (Ex: dir:/home/me/somedir). + -dir also works to find results not in the specified directory - (release >= 1.15.8). A tilde inside the value will be expanded - to the home directory. Wildcards will not - be expanded. You cannot use OR with - dir clauses (this restriction may go away in - the future). + (release >= 1.15.8). A tilde inside the value will be + expanded to the home directory. Wildcards will be + expanded, but + please have a + look at an important limitation of wildcards in + path filters. Relative paths also make sense, for example, dir:share/doc would match either @@ -2804,17 +2806,20 @@ dir:recoll dir:src -dir:utils -dir:common utils or common. - Another special aspect of dir clauses is - that the values in the index are not transcoded to UTF-8, and - never lower-cased or unaccented, but stored as binary. This means - that you need to enter the values in the exact lower or upper - case, and that searches for names with diacritics may sometimes - be impossible because of character set conversion - issues. Non-ASCII UNIX file paths are an unending source of - trouble and are best avoided. + You can also use OR conjunctions + with dir: clauses. + + A special aspect of dir clauses is + that the values in the index are not transcoded to UTF-8, and + never lower-cased or unaccented, but stored as binary. This means + that you need to enter the values in the exact lower or upper + case, and that searches for names with diacritics may sometimes + be impossible because of character set conversion + issues. Non-ASCII UNIX file paths are an unending source of + trouble and are best avoided. You need to use double-quotes around the path value if it - contains space characters. + contains space characters. @@ -3094,6 +3099,31 @@ dir:recoll dir:src -dir:utils -dir:common term). + + Wildcards and path filtering + + Due to the way that &RCL; processes wildcards + inside dir path filtering clauses, they + will have a multiplicative effect on the query size. A clause + containg wildcards in several paths elements, like, for + example, + dir:/home/me/*/*/docdir, + will almost certainly fail if your indexed tree is of any realistic + size. + + Depending on the case, you may be able to work around + the issue by specifying the paths elements more narrowly, with + a constant prefix, or by using 2 + separate dir: clauses instead of multiple + wildcards, as + in dir:/home/me dir:docdir. The + latter query is not equivalent to the initial one because it + does not specify a number of directory levels, but that's + the best we can do (and it may be actually more useful in + some cases). + + +