*** empty log message ***
This commit is contained in:
parent
ce8ebf93d0
commit
5a9b90d26c
@ -24,7 +24,7 @@
|
|||||||
Dockes</holder>
|
Dockes</holder>
|
||||||
</copyright>
|
</copyright>
|
||||||
|
|
||||||
<releaseinfo>$Id: usermanual.sgml,v 1.35 2007-01-15 13:03:35 dockes Exp $</releaseinfo>
|
<releaseinfo>$Id: usermanual.sgml,v 1.36 2007-01-25 15:47:45 dockes Exp $</releaseinfo>
|
||||||
|
|
||||||
<abstract>
|
<abstract>
|
||||||
<para>This document introduces full text search notions
|
<para>This document introduces full text search notions
|
||||||
@ -178,7 +178,7 @@
|
|||||||
is normally incremental: documents will only be processed if
|
is normally incremental: documents will only be processed if
|
||||||
they have been modified. On the first execution, of course, all
|
they have been modified. On the first execution, of course, all
|
||||||
documents will need processing. A full index build can be forced
|
documents will need processing. A full index build can be forced
|
||||||
later on by specifying an option to the indexing command
|
later by specifying an option to the indexing command
|
||||||
(<command>recollindex -z</command>).</para>
|
(<command>recollindex -z</command>).</para>
|
||||||
|
|
||||||
<para>&RCL; indexing can be performed with two different
|
<para>&RCL; indexing can be performed with two different
|
||||||
@ -486,7 +486,7 @@ fvwm
|
|||||||
</chapter>
|
</chapter>
|
||||||
|
|
||||||
<chapter id="rcl.search">
|
<chapter id="rcl.search">
|
||||||
<title>Search</title>
|
<title>Searching</title>
|
||||||
|
|
||||||
<para>The <command>recoll</command> program provides the user
|
<para>The <command>recoll</command> program provides the user
|
||||||
interface for searching. It is based on the
|
interface for searching. It is based on the
|
||||||
@ -510,19 +510,27 @@ fvwm
|
|||||||
</step>
|
</step>
|
||||||
</procedure>
|
</procedure>
|
||||||
|
|
||||||
<para>The initial default search mode is <guilabel>Any
|
<para>The initial default search mode is <guilabel>All
|
||||||
term</guilabel>. This will look for documents with any of the
|
terms</guilabel>. This will look for documents containing all
|
||||||
search terms (the ones with more terms will get better scores).
|
of the search terms (the ones with more terms will get better
|
||||||
<guilabel>All terms</guilabel> will ensure
|
scores). <guilabel>Any term</guilabel> will search for
|
||||||
that only documents with all the terms will be
|
documents where at least one of the terms appear. <guilabel>File
|
||||||
returned. <guilabel>File name</guilabel> will specifically
|
name</guilabel> will specifically look for file names.</para>
|
||||||
look for file names, and allows using wildcards
|
|
||||||
(<literal>*</literal>, <literal>?</literal> ,
|
<para>The fourth entry (<guilabel>Query Language</guilabel>) is
|
||||||
<literal>[]</literal>). </para>
|
described in <link linkend="rcl.search.lang">its own
|
||||||
|
section</link>.</para>
|
||||||
|
|
||||||
|
<para>All search modes allow wildcards inside terms
|
||||||
|
(<literal>*</literal>, <literal>?</literal>,
|
||||||
|
<literal>[]</literal>). You may want to have a look at the
|
||||||
|
<link linkend="rcl.search.wildcards">section about wildcards</link>
|
||||||
|
for more information about this.</para>
|
||||||
|
|
||||||
<para>You can search for exact phrases (adjacent words in a
|
<para>You can search for exact phrases (adjacent words in a
|
||||||
given order) by enclosing the input inside double quotes. Ex:
|
given order) by enclosing the input inside double quotes. Ex:
|
||||||
<literal>"virtual reality"</literal>.</para>
|
<literal>"virtual reality"</literal>.</para>
|
||||||
|
|
||||||
<para>Character case has no influence on search, except that you
|
<para>Character case has no influence on search, except that you
|
||||||
can disable stem expansion for any term by capitalizing it. Ie:
|
can disable stem expansion for any term by capitalizing it. Ie:
|
||||||
a search for <literal>floor</literal> will also normally look for
|
a search for <literal>floor</literal> will also normally look for
|
||||||
@ -537,7 +545,7 @@ fvwm
|
|||||||
text field). Please note, however, that only the search texts
|
text field). Please note, however, that only the search texts
|
||||||
are remembered, not the mode (all/any/file name).</para>
|
are remembered, not the mode (all/any/file name).</para>
|
||||||
|
|
||||||
<para>Typing <keycap>Esc</keycap> <keycap>Space</keycap>) while
|
<para>Typing <keycap>Esc</keycap> <keycap>Space</keycap> while
|
||||||
entering a word in the simple search entry will open a window
|
entering a word in the simple search entry will open a window
|
||||||
with possible completions for the word. The completions are
|
with possible completions for the word. The completions are
|
||||||
extracted from the database.</para>
|
extracted from the database.</para>
|
||||||
@ -568,7 +576,10 @@ fvwm
|
|||||||
tabs in the existing preview window. You can use
|
tabs in the existing preview window. You can use
|
||||||
<keycap>Shift</keycap>+Click to force the creation of another
|
<keycap>Shift</keycap>+Click to force the creation of another
|
||||||
preview window, which may be useful to view the documents side
|
preview window, which may be useful to view the documents side
|
||||||
by side.</para>
|
by side. (You can also browse successive results in a single
|
||||||
|
preview window by typing
|
||||||
|
<keycap>Shift</keycap>+<keycap>ArrowUp/Down</keycap> in the
|
||||||
|
window).</para>
|
||||||
|
|
||||||
<para>Clicking the <literal>Edit</literal> link will attempt to
|
<para>Clicking the <literal>Edit</literal> link will attempt to
|
||||||
start an external viewer. The viewers can be configured through the
|
start an external viewer. The viewers can be configured through the
|
||||||
@ -618,9 +629,11 @@ fvwm
|
|||||||
|
|
||||||
<para>The <guilabel>Preview</guilabel> and
|
<para>The <guilabel>Preview</guilabel> and
|
||||||
<guilabel>Edit</guilabel> entries do the same thing as the
|
<guilabel>Edit</guilabel> entries do the same thing as the
|
||||||
corresponding links. The two following entries will copy either
|
corresponding links.</para>
|
||||||
an URL or the file path to the clipboard, for pasting into
|
|
||||||
another application.</para>
|
<para>The <guilabel>Copy File Name</guilabel> and
|
||||||
|
<guilabel>Copy Url</guilabel> copy the relevant data to the
|
||||||
|
clipboard, for later pasting.</para>
|
||||||
|
|
||||||
<para>The <guilabel>Find similar</guilabel> entry will select
|
<para>The <guilabel>Find similar</guilabel> entry will select
|
||||||
a number of relevant term from the current document and enter
|
a number of relevant term from the current document and enter
|
||||||
@ -628,10 +641,6 @@ fvwm
|
|||||||
search, with a good chance of finding documents related to the
|
search, with a good chance of finding documents related to the
|
||||||
current result.</para>
|
current result.</para>
|
||||||
|
|
||||||
<para>The <guilabel>Copy File Name</guilabel> and
|
|
||||||
<guilabel>Copy Url</guilabel> copy the relevant data to the
|
|
||||||
clipboard, for later pasting.</para>
|
|
||||||
|
|
||||||
<para>The <guilabel>Parent document</guilabel> entry will
|
<para>The <guilabel>Parent document</guilabel> entry will
|
||||||
appear for documents which are not actually files but are
|
appear for documents which are not actually files but are
|
||||||
part of, or attached to, a higher level document. This entry
|
part of, or attached to, a higher level document. This entry
|
||||||
@ -653,7 +662,9 @@ fvwm
|
|||||||
<literal>Preview</literal> link inside the result list.</para>
|
<literal>Preview</literal> link inside the result list.</para>
|
||||||
|
|
||||||
<para>Subsequent preview requests for a given search open new
|
<para>Subsequent preview requests for a given search open new
|
||||||
tabs in the existing window.</para>
|
tabs in the existing window (except if you hold the
|
||||||
|
<keycap>Shift</keycap> key while clicking which will open a new
|
||||||
|
window for side by side viewing).</para>
|
||||||
|
|
||||||
<para>Starting another search and requesting a preview will
|
<para>Starting another search and requesting a preview will
|
||||||
create a new preview window. The old one stays open until you
|
create a new preview window. The old one stays open until you
|
||||||
@ -690,12 +701,93 @@ fvwm
|
|||||||
|
|
||||||
</sect1>
|
</sect1>
|
||||||
|
|
||||||
|
<sect1 id="rcl.search.lang">
|
||||||
|
<title>The query language</title>
|
||||||
|
|
||||||
|
<para>The query language processor is activated on the
|
||||||
|
simple search entry when the search mode selector is set to
|
||||||
|
<guilabel>Query Language</guilabel>.</para>
|
||||||
|
|
||||||
|
<para>Here follows a sample request that we are going to
|
||||||
|
explain:</para>
|
||||||
|
<programlisting>
|
||||||
|
mime:message/rfc822 author:"john doe" Beatles OR Lennon Live OR Unplugged -potatoes
|
||||||
|
</programlisting>
|
||||||
|
|
||||||
|
<para>This would search for all email messages with
|
||||||
|
<replaceable>John Doe</replaceable>
|
||||||
|
appearing as a phrase in the <literal>From:</literal> header,
|
||||||
|
and containing either <replaceable>beatles</replaceable> or
|
||||||
|
<replaceable>lennon</replaceable> and either
|
||||||
|
<replaceable>live</replaceable> or
|
||||||
|
<replaceable>unplugged</replaceable> but not
|
||||||
|
<replaceable>potatoes</replaceable>.</para>
|
||||||
|
|
||||||
|
<para>The first element, <literal>mime:message/rfc822</literal>
|
||||||
|
is a special switch that restricts the results to be email
|
||||||
|
messages. There could be several such switches, which would form
|
||||||
|
a list of allowed types.</para>
|
||||||
|
|
||||||
|
<para>The second element <literal>author:"john doe"</literal> is
|
||||||
|
a phrase search limited to a specific field. Phrase searches are
|
||||||
|
specified as usual by enclosing the words in double quotes. The
|
||||||
|
field specification appears before the colon. &RCL; currently
|
||||||
|
manages the following fields:</para>
|
||||||
|
<itemizedlist>
|
||||||
|
<listitem><para><literal>title</literal>,
|
||||||
|
<literal>subject</literal> or <literal>caption</literal> are
|
||||||
|
synonyms which specify data to be searched for in the
|
||||||
|
document title or subject.</para>
|
||||||
|
</listitem>
|
||||||
|
<listitem><para><literal>author</literal> or
|
||||||
|
<literal>from</literal> for searching the documents originators.</para>
|
||||||
|
</listitem>
|
||||||
|
<listitem><para><literal>keyword</literal> for searching the
|
||||||
|
document specified keywords (few documents actually have any).</para>
|
||||||
|
</listitem>
|
||||||
|
</itemizedlist>
|
||||||
|
|
||||||
|
<para>The query language is currently the only way to use the
|
||||||
|
&RCL; field search capability.</para>
|
||||||
|
|
||||||
|
<para>All elements in the search entry are normally combined
|
||||||
|
with an implicit AND. It is possible to specify that elements be
|
||||||
|
OR'ed instead, as in <replaceable>Beatles</replaceable>
|
||||||
|
<literal>OR</literal> <replaceable>Lennon</replaceable>. The
|
||||||
|
<literal>OR</literal> must be entered literally (capitals), and
|
||||||
|
it has priority over the AND associations:
|
||||||
|
<replaceable>word1</replaceable>
|
||||||
|
<replaceable>word2</replaceable> <literal>OR</literal>
|
||||||
|
<replaceable>word3</replaceable>
|
||||||
|
means
|
||||||
|
<replaceable>word1</replaceable> AND
|
||||||
|
(<replaceable>word2</replaceable> <literal>OR</literal>
|
||||||
|
<replaceable>word3</replaceable>)
|
||||||
|
not
|
||||||
|
(<replaceable>word1</replaceable> AND
|
||||||
|
<replaceable>word2</replaceable>) <literal>OR</literal>
|
||||||
|
<replaceable>word3</replaceable>. Do not enter explicit
|
||||||
|
parenthesis, they are not supported for now.</para>
|
||||||
|
|
||||||
|
<para>An entry preceded by a <literal>-</literal> specifies a
|
||||||
|
term that should <emphasis>not</emphasis> appear.</para>
|
||||||
|
|
||||||
|
<para>Words inside phrases and capitalized words are not
|
||||||
|
stem-expanded. Wildcards may be used anywhere.</para>
|
||||||
|
|
||||||
|
<para>You can use the <literal>show query</literal> link at the
|
||||||
|
top of the result list to check the exact query which was
|
||||||
|
finally executed by Xapian.</para>
|
||||||
|
|
||||||
|
</sect1>
|
||||||
|
|
||||||
<sect1 id="rcl.search.complex">
|
<sect1 id="rcl.search.complex">
|
||||||
<title>Complex/advanced search</title>
|
<title>Complex/advanced search</title>
|
||||||
|
|
||||||
<para>The advanced search dialog has fields that will allow a more
|
<para>The advanced search dialog has a number of fields that
|
||||||
refined search. It has a number of entry fields, each of which
|
will allow a more refined search. Each entry field is
|
||||||
is configurable for the following modes:
|
configurable for the following modes:</para>
|
||||||
|
|
||||||
<itemizedlist>
|
<itemizedlist>
|
||||||
<listitem><para>All terms.</para>
|
<listitem><para>All terms.</para>
|
||||||
</listitem>
|
</listitem>
|
||||||
@ -712,16 +804,17 @@ fvwm
|
|||||||
<listitem><para>Filename search with wildcards.</para>
|
<listitem><para>Filename search with wildcards.</para>
|
||||||
</listitem>
|
</listitem>
|
||||||
</itemizedlist>
|
</itemizedlist>
|
||||||
</para>
|
|
||||||
<para>Additional entry fields can be created by clicking the
|
<para>Additional entry fields can be created by clicking the
|
||||||
<guilabel>Add clause</guilabel> button.</para>
|
<guilabel>Add clause</guilabel> button.</para>
|
||||||
|
|
||||||
<para>All relevant fields will be combined by an implicit AND
|
<para>You can choose that all relevant fields will be combined
|
||||||
or OR conjunction. All types of clauses except "phrase" and
|
by either an AND or an OR conjunction. All types of clauses
|
||||||
"near" can accept a mix of single words and phrases enclosed
|
except "phrase" and "near" can accept a mix of single words and
|
||||||
in double quotes. Stemming expansion will be performed for all
|
phrases enclosed in double quotes. Stemming expansion will be
|
||||||
terms not beginning with a capital letter, except for "phrase"
|
performed for all terms not beginning with a capital letter,
|
||||||
clauses.</para>
|
except for terms inside "phrase" clauses. Wildcards will be
|
||||||
|
processed everywhere.</para>
|
||||||
|
|
||||||
<para>Advanced search will also let you search for documents of
|
<para>Advanced search will also let you search for documents of
|
||||||
specific mime types (ie: only <literal>text/plain</literal>, or
|
specific mime types (ie: only <literal>text/plain</literal>, or
|
||||||
@ -764,18 +857,26 @@ fvwm
|
|||||||
<varlistentry>
|
<varlistentry>
|
||||||
<term>Wildcard</term>
|
<term>Wildcard</term>
|
||||||
<listitem><para>In this mode of operation, you can enter a
|
<listitem><para>In this mode of operation, you can enter a
|
||||||
search string with shell-like wildcards (*, ?). ie:
|
search string with shell-like wildcards (*, ?, []). ie:
|
||||||
<replaceable>xapi*</replaceable> .</para></listitem>
|
<replaceable>xapi*</replaceable> would display all index terms
|
||||||
|
beginning with <replaceable>xapi</replaceable>. (More
|
||||||
|
about wildcards <link
|
||||||
|
linkend="rcl.search.wildcards">here</link>).</para></listitem>
|
||||||
</varlistentry>
|
</varlistentry>
|
||||||
|
|
||||||
<varlistentry>
|
<varlistentry>
|
||||||
<term>Regular expression</term>
|
<term>Regular expression</term>
|
||||||
<listitem><para>This mode will accept a regular expression
|
<listitem><para>This mode will accept a regular expression
|
||||||
as input. Example:
|
as input. Example:
|
||||||
<replaceable>word[0-9]+</replaceable> . The regular
|
<replaceable>word[0-9]+</replaceable>. The expression is
|
||||||
expression is anchored by enclosing in
|
implicitely anchored at the beginning. Ie:
|
||||||
<literal>^</literal> and <literal>$</literal> before
|
<replaceable>press</replaceable> will match
|
||||||
execution.</para></listitem>
|
<replaceable>pression</replaceable> but not
|
||||||
|
<replaceable>expression</replaceable>. You can use
|
||||||
|
<replaceable>.*press</replaceable> to match the latter,
|
||||||
|
but be aware that this will cause a full index term list
|
||||||
|
scan, which can be quite long.</para>
|
||||||
|
</listitem>
|
||||||
</varlistentry>
|
</varlistentry>
|
||||||
<varlistentry>
|
<varlistentry>
|
||||||
|
|
||||||
@ -815,6 +916,53 @@ fvwm
|
|||||||
|
|
||||||
</sect1>
|
</sect1>
|
||||||
|
|
||||||
|
<sect1 id="rcl.search.wildcards">
|
||||||
|
<title>More about wildcards</title>
|
||||||
|
<para>All words entered in &RCL; search fields will be processed
|
||||||
|
for wildcard expansion before the request is finally
|
||||||
|
executed.</para>
|
||||||
|
|
||||||
|
<para>The wildcard characters are:</para>
|
||||||
|
|
||||||
|
<itemizedlist>
|
||||||
|
<listitem><para><literal>*</literal> which matches 0 or more
|
||||||
|
characters.</para>
|
||||||
|
</listitem>
|
||||||
|
<listitem><para><literal>?</literal> which matches
|
||||||
|
a single character.</para>
|
||||||
|
</listitem>
|
||||||
|
<listitem><para><literal>[]</literal> which allow
|
||||||
|
defining sets of characters to be matched (ex:
|
||||||
|
<literal>[</literal><userinput>abc</userinput><literal>]</literal>
|
||||||
|
matches a single character which may be 'a' or 'b' or 'c',
|
||||||
|
<literal>[</literal><userinput>0-9</userinput><literal>]</literal>
|
||||||
|
matches any number.</para>
|
||||||
|
</listitem>
|
||||||
|
</itemizedlist>
|
||||||
|
|
||||||
|
<para>You should be aware of a few things before using
|
||||||
|
wildcards.</para>
|
||||||
|
|
||||||
|
<itemizedlist>
|
||||||
|
<listitem><para>Using a wildcard character at the beginning of
|
||||||
|
a word can make for a slow search because &RCL; will have to
|
||||||
|
scan the whole index term list to find the matches.</para>
|
||||||
|
</listitem>
|
||||||
|
<listitem><para>Using a <literal>*</literal> at the end of a
|
||||||
|
word can produce more matches than you would think, and
|
||||||
|
strange search results. You can use the <link
|
||||||
|
linkend="rcl.search.termexplorer">term explorer</link> tool to
|
||||||
|
check what completions exist for a given term. You can also
|
||||||
|
see exactly what search was performed by clicking on the link
|
||||||
|
at the top of the result list. In general, for natural
|
||||||
|
language terms, stem expansion will produce better results
|
||||||
|
than an ending <literal>*</literal> (stem expansion is turned
|
||||||
|
off when any wildcard character appears in the term).</para>
|
||||||
|
</listitem>
|
||||||
|
</itemizedlist>
|
||||||
|
|
||||||
|
</sect1>
|
||||||
|
|
||||||
<sect1 id="rcl.search.multidb">
|
<sect1 id="rcl.search.multidb">
|
||||||
<title>Multiple databases</title>
|
<title>Multiple databases</title>
|
||||||
|
|
||||||
@ -861,14 +1009,14 @@ fvwm
|
|||||||
|
|
||||||
<para>A typical usage scenario for the multiple index feature
|
<para>A typical usage scenario for the multiple index feature
|
||||||
would be for a system administrator to set up a central index
|
would be for a system administrator to set up a central index
|
||||||
for shared data, that you may choose to search, or not, in
|
for shared data, that you choose to search or not in addition to
|
||||||
addition to your personal data. Of course, there are other
|
your personal data. Of course, there are other
|
||||||
possibilities. There are many cases where you know the subset of
|
possibilities. There are many cases where you know the subset of
|
||||||
files that you want to be searched for a given query, and where
|
files that should be searched, and where narrowing the search
|
||||||
restricting the query will much improve the precision of the
|
can improve the results. You can achieve approximately the same
|
||||||
results. This can also be performed with the directory filter in
|
effect with the directory filter in advanced search, but
|
||||||
advanced search, but multiple indexes will have much better
|
multiple indexes will have much better performance and may be
|
||||||
performance and may be worth the trouble.</para>
|
worth the trouble.</para>
|
||||||
|
|
||||||
</sect1>
|
</sect1>
|
||||||
|
|
||||||
@ -1167,10 +1315,10 @@ fvwm
|
|||||||
<filename>/usr/local/recollglobal/xapiandb</filename>).</para>
|
<filename>/usr/local/recollglobal/xapiandb</filename>).</para>
|
||||||
|
|
||||||
<para>Once entered, the indexes will appear in the
|
<para>Once entered, the indexes will appear in the
|
||||||
<guilabel>All indexes</guilabel> list, and you can
|
<guilabel>External indexes</guilabel> list, and you can
|
||||||
chose which ones you want to use at any moment by transferring
|
chose which ones you want to use at any moment by checking or
|
||||||
them to/from the <guilabel>Active indexes</guilabel>
|
unchecking their entries.</para>
|
||||||
list.</para>
|
|
||||||
<para>Your main database (the one the current configuration
|
<para>Your main database (the one the current configuration
|
||||||
indexes to), is always implicitly active. If this is not
|
indexes to), is always implicitly active. If this is not
|
||||||
desirable, you can set up your configuration so that it indexes,
|
desirable, you can set up your configuration so that it indexes,
|
||||||
@ -1292,8 +1440,11 @@ fvwm
|
|||||||
</listitem>
|
</listitem>
|
||||||
</itemizedlist>
|
</itemizedlist>
|
||||||
|
|
||||||
<para>Text, HTML, mail folders and Openoffice files are
|
<para>Text, HTML, mail folders Openoffice and Scribus files
|
||||||
processed internally.</para>
|
are processed internally. Lyx is used to index Lyx files. Many
|
||||||
|
filters need <command>sed</command> and <command>awk</command>.
|
||||||
|
</para>
|
||||||
|
|
||||||
</sect1>
|
</sect1>
|
||||||
|
|
||||||
|
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user