From 6ed2673331b69abd0e5f0ac6f7aa16abaa51a2ec Mon Sep 17 00:00:00 2001
From: dockes
Date: Fri, 13 Jul 2007 10:24:32 +0000
Subject: [PATCH] *** empty log message ***
---
src/INSTALL | 119 +++++++++----------
src/README | 228 +++++++++++++++++++++++++-----------
src/doc/man/recoll.conf.5 | 55 ++++++---
src/qtgui/i18n/recoll_fr.ts | 14 ++-
src/qtgui/i18n/recoll_it.ts | 10 +-
src/qtgui/i18n/recoll_ru.ts | 12 +-
src/qtgui/i18n/recoll_uk.ts | 12 +-
website/BUGS.txt | 16 +++
website/CHANGES.txt | 63 +++++++---
website/download.html | 11 +-
10 files changed, 353 insertions(+), 187 deletions(-)
diff --git a/src/INSTALL b/src/INSTALL
index 10372c6d..88a1923b 100644
--- a/src/INSTALL
+++ b/src/INSTALL
@@ -23,40 +23,35 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
4.4. Configuration overview
+ 4.5. Extending Recoll
+
4.1. Installing a prebuilt copy
- Recoll binary installations are always linked statically to the xapian
- libraries, and have no other dependencies. You will only have to check or
- install supporting applications for the file types that you want to index
- beyond text, HTML and mail files.
+ Recoll binary packages from the Recoll web site are always linked
+ statically to the Xapian libraries, and have no other dependencies. You
+ will only have to check or install supporting applications for the file
+ types that you want to index beyond text, HTML and mail files, and maybe
+ have a look at the configuration section (but this may not be necessary
+ for a quick test with default parameters).
4.1.1. Installing through a package system
If you use a BSD-type port system or a prebuilt package (RPM or other),
- just follow the usual procedure, and maybe have a look at the
- configuration section (but this may not be necessary for a quick test with
- default parameters).
+ just follow the usual procedure for your system.
4.1.2. Installing a prebuilt Recoll
- The unpackaged binary versions are just compressed tar files of a build
- tree, where only the useful parts were kept (executables and sample
- configuration).
+ The unpackaged binary versions on the Recoll web site are just compressed
+ tar files of a build tree, where only the useful parts were kept
+ (executables and sample configuration).
The executable binary files are built with a static link to libxapian and
- libiconv, to make installation easier (no dependencies). However, this
- also means that you cannot change the versions which are used.
+ libiconv, to make installation easier (no dependencies).
After extracting the tar file, you can proceed with installation as if you
had built the package from source (that is, just type make install). The
binary trees are built for installation to /usr/local.
- You may then need to install external applications to process some file
- types that you want indexed (ie: acrobat, postscript ...). See next
- section.
-
- Finally, you may want to have a look at the configuration section.
-
--------------------------------------------------------------------------
Prev Home Next
@@ -120,9 +115,10 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
4.3.1. Prerequisites
At the very least, you will need to download and install the xapian core
- package (Recoll development currently uses version 0.9.5), and the qt
- run-time and development packages (Recoll development currently uses
- version 3.3.5, but any 3.3 version is probably OK).
+ package (Recoll 1.9 normally uses version 1.0.2, but any 0.9 or 1.0.x
+ version will work too), and the qt run-time and development packages
+ (Recoll development currently uses version 3.3.5, but any 3.3 version is
+ probably OK).
You will most probably be able to find a binary package for qt for your
system. You may have to compile Xapian but this is not difficult (if you
@@ -135,8 +131,8 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
4.3.2. Building
Recoll has been built on Linux (redhat7.3, mandriva 2005/6, Fedora Core
- 3/4/5), FreeBSD and Solaris 8. If you build on another system, I would
- very much welcome patches.
+ 3/4/5/6), FreeBSD 5/6, macosx, and Solaris 8. If you build on another
+ system, and need to modify things, I would very much welcome patches.
Depending on the qt configuration on your system, you may have to set the
QTDIR and QMAKESPECS variables in your environment:
@@ -190,9 +186,10 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
Link: HOME
Link: UP
Link: PREVIOUS
+ Link: NEXT
Recoll user manual
- Prev Chapter 4. Installation
+ Prev Chapter 4. Installation Next
--------------------------------------------------------------------------
@@ -334,20 +331,14 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
value, and is the default. The daemversion is specific to the
indexing monitor daemon.
- filtersdir
-
- A directory to search for the external filter scripts used to
- index some types of files. The value should not be changed, except
- if you want to modify one of the default scripts. The value can be
- redefined for any sub-directory.
-
indexstemminglanguages
A list of languages for which the stem expansion databases will be
- built. See recollindex(1) for possible values. You can add a stem
- expansion database for a different language by using recollindex
- -s, but it will be deleted during the next indexing. Only
- languages listed in the configuration file are permanent.
+ built. See recollindex(1) or use the recollindex -l command for
+ possible values. You can add a stem expansion database for a
+ different language by using recollindex -s, but it will be deleted
+ during the next indexing. Only languages listed in the
+ configuration file are permanent.
defaultcharset
@@ -357,6 +348,32 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
character set used is the one defined by the nls environment
(LC_ALL, LC_CTYPE, LANG), or iso8859-1 if nothing is set.
+ maxfsoccuppc
+
+ Maximum file system occupation before we stop indexing. The value
+ is a percentage, corresponding to what the "Capacity" df output
+ column shows. The default value is 0, meaning no checking.
+
+ idxflushmb
+
+ Threshold (megabytes of new text data) where we flush from memory
+ to disk index. Setting this can help control memory usage. A value
+ of 0 means no explicit flushing, letting Xapian use its own
+ default, which is flushing every 10000 documents (memory usage
+ depends on average document size). The default value is 10.
+
+ filtersdir
+
+ A directory to search for the external filter scripts used to
+ index some types of files. The value should not be changed, except
+ if you want to modify one of the default scripts. The value can be
+ redefined for any sub-directory.
+
+ iconsdir
+
+ The name of the directory where recoll result list icons are
+ stored. You can change this if you want different images.
+
guesscharset
Decide if we try to guess the character set of files if no
@@ -389,11 +406,6 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
section or just be the beginning of the text). The default value
is 250.
- iconsdir
-
- The name of the directory where recoll result list icons are
- stored. You can change this if you want different images.
-
aspellLanguage
Language definitions to use when creating the aspell dictionary.
@@ -525,29 +537,10 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
argument and should output the text contents in html format on the
standard output.
- The html could be very minimal like the following example:
-
-
-
-
- some text content
-
-
- You should take care to escape some characters inside the text by
- transforming them into appropriate entities. "&" should be transformed
- into "&", "<" should be transformed into "<".
-
- The character set needs to be specified in the header. It does not need to
- be UTF-8 (Recoll will take care of translating it), but it must be
- accurate for good results.
-
- Recoll will also make use of other header fields if they are present:
- title, description, keywords.
-
- The easiest way to write a new filter is probably to start from an
- existing one.
+ You can find more details about writing a Recoll filter in the section
+ about writing filters
--------------------------------------------------------------------------
- Prev Home
- Building from source Up
+ Prev Home Next
+ Building from source Up Extending Recoll
diff --git a/src/README b/src/README
index bb466d3f..d5089196 100644
--- a/src/README
+++ b/src/README
@@ -11,7 +11,8 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
Copyright (c) 2005 Jean-Francois Dockes
This document introduces full text search notions and describes the
- installation and use of the Recoll application.
+ installation and use of the Recoll application. It currently describes
+ Recoll 1.9.
[ Split HTML / Single HTML ]
@@ -105,6 +106,10 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
4.4.5. Examples of configuration adjustments
+ 4.5. Extending Recoll
+
+ 4.5.1. Writing a document filter
+
----------------------------------------------------------------------
Chapter 1. Introduction
@@ -370,9 +375,10 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
configuration files.
The configuration is documented inside the installation chapter of this
- document, or in the recoll.conf(5) man page. The most immediately useful
- variable you may interested in is probably topdirs, which determines what
- subtrees get indexed.
+ document, or in the recoll.conf(5) man page, but the most current
+ information will most likely be the comments inside the sample file. The
+ most immediately useful variable you may interested in is probably
+ topdirs, which determines what subtrees get indexed.
The applications needed to index file types other than text, HTML or email
(ie: pdf, postscript, ms-word...) are described in the external packages
@@ -660,23 +666,6 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
or lennon and either live or unplugged but not potatoes (in any part of
the document).
- The first element author:"john doe" is a phrase search limited to a
- specific field. Phrase searches are specified as usual by enclosing the
- words in double quotes. The field specification appears before the colon
- (of course this is not limited to phrases, author:Balzac would be ok too).
- Recoll currently manages the following fields:
-
- * title, subject or caption are synonyms which specify data to be
- searched for in the document title or subject.
-
- * author or from for searching the documents originators.
-
- * keyword for searching the document specified keywords (few documents
- actually have any).
-
- The query language is currently the only way to use the Recoll field
- search capability.
-
All elements in the search entry are normally combined with an implicit
AND. It is possible to specify that elements be OR'ed instead, as in
Beatles OR Lennon. The OR must be entered literally (capitals), and it has
@@ -686,8 +675,40 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
An entry preceded by a - specifies a term that should not appear.
+ The first element in the above exemple, author:"john doe" is a phrase
+ search limited to a specific field. Phrase searches are specified as usual
+ by enclosing the words in double quotes. The field specification appears
+ before the colon (of course this is not limited to phrases, author:Balzac
+ would be ok too). Recoll currently manages the following fields:
+
+ * title, subject or caption are synonyms which specify data to be
+ searched for in the document title or subject.
+
+ * author or from for searching the documents originators.
+
+ * keyword for searching the document specified keywords (few documents
+ actually have any).
+
+ As of release 1.9, the filters have the possibility to create other fields
+ with arbitrary names. No standard filters use this possibility yet.
+
+ There are two other elements which may be specified through the field
+ syntax, but are somewhat special:
+
+ * ext for specifying the file name extension (Ex: ext:html)
+
+ * mime for specifying the mime type. This one is quite special because
+ you can specify several values which will be OR'ed (the normal default
+ for the language is AND). Ex: mime:text/plain mime:text/html.
+ Specifying an explicit boolean operator or negation (-) before a mime
+ specification is not supported and will produce strange results.
+
+ The query language is currently the only way to use the Recoll field
+ search capability.
+
Words inside phrases and capitalized words are not stem-expanded.
- Wildcards may be used anywhere.
+ Wildcards may be used anywhere inside a term. Specifying a wild-card on
+ the left of a term can produce a very slow search.
You can use the show query link at the top of the result list to check the
exact query which was finally executed by Xapian.
@@ -873,8 +894,13 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
3.9. Document history
Documents that you actually view (with the internal preview or an external
- tool) are entered into the document history, which is remembered. You can
- display the history list by using the Tools/Doc History menu entry.
+ tool) are entered into the document history, which is remembered.
+
+ You can display the history list by using the Tools/Doc History menu
+ entry.
+
+ You can erase the document history by using the Erase document history
+ entry in the File menu.
----------------------------------------------------------------------
@@ -891,6 +917,11 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
The sort parameters stay in effect until they are explicitly reset, or the
program exits. An activated sort is indicated in the result list header.
+ Sort parameters are remembered between program invocations, but result
+ sorting is normally always inactive when the program starts. It is
+ possible to keep the sorting activation state between program invocations
+ by checking the Remember sort activation state option in the preferences.
+
----------------------------------------------------------------------
3.11. Search tips, shortcuts
@@ -984,6 +1015,8 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
* %D. Date
+ * %I. Icon image name
+
* %K. Keywords (if any)
* %L. Preview and Edit links
@@ -1002,7 +1035,7 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
The default value for the string is:
- %R %S %L %T
+ %R %S %L %T
%M %D %U
%A %K
@@ -1014,19 +1047,30 @@ More documentation can be found in the doc/ directory or at http://www.recoll.or
%A%U - %S - %L
+ Or the clean looking:
+
+ %L %R
+ %T %S
+ %U
+
+ page. Recoll 1.8.2 should build with any 0.9.x or 1.0.x
+ Xapian version (the current one is 1.0.1).
You need Qt 3.3 (or qt 4) in all cases (configure Recoll with
- configure --enable-qt4 to build with qt4).
+ configure --enable-qt4 to build with qt4, this needs
+ the qt3 support library to be present).
Recoll relies on external packages for some
of its functionality (ie: for many of the non-text file
@@ -124,9 +125,7 @@
of which is the new default index format. In order to take
advantage of the new format (which is not mandatory) Recoll
users updating from an older release need to delete their old
- index. There are more
- details in the user manual.