diff --git a/packaging/FreeBSD/recoll/pkg-plist b/packaging/FreeBSD/recoll/pkg-plist index b8a72b6d..445f1e1b 100644 --- a/packaging/FreeBSD/recoll/pkg-plist +++ b/packaging/FreeBSD/recoll/pkg-plist @@ -22,6 +22,7 @@ share/icons/hicolor/48x48/apps/recoll.png %%DATADIR%%/filters/rclxls %%DATADIR%%/images/document.png %%DATADIR%%/images/drawing.png +%%DATADIR%%/images/folder.png %%DATADIR%%/images/html.png %%DATADIR%%/images/image.png %%DATADIR%%/images/message.png diff --git a/src/excludefile b/src/excludefile index daaf01e8..683884e1 100644 --- a/src/excludefile +++ b/src/excludefile @@ -28,4 +28,5 @@ qtgui/recoll.pro recollinstall sampleconf/recoll.conf sysconf +wasabi wxgui diff --git a/website/BUGS.txt b/website/BUGS.txt index 3938183e..3a48211e 100644 --- a/website/BUGS.txt +++ b/website/BUGS.txt @@ -4,8 +4,7 @@ Bugs that are listed in an older version section are supposedly fixed in later versions. Bugs listed in the topmost section may also exist in older versions. -Latest (1.6.2): - +Latest (1.6.3): - 1.6 NEAR crashes: 1.6 has added NEAR searches. Unlike what recoll did with PHRASES, stemming expansion is performed on terms inside NEAR clauses (except if prevented by a capitalized entry of course). There is @@ -53,6 +52,11 @@ Latest (1.6.2): exception handling (recoll catches an exception while trying the yest inexistant db). +1.6.2 + - Relatively unfrequent issue with message boundary detection in mbox + files, could cause miscellaneous problems. + - Executing an external viewer for a file with single-quotes in the name + would not work. *************************************************************************** 1.5.10 - If a defaultcharset was set in the configuration file for a subdirectory, diff --git a/website/CHANGES.txt b/website/CHANGES.txt index 1c00c650..fca42ad2 100644 --- a/website/CHANGES.txt +++ b/website/CHANGES.txt @@ -1,15 +1,30 @@ CHANGES -Updating from 1.2 to 1.3 or 1.4 or 1.5: ---------------------------------------- -From version 1.3 up, there is a new feature to search specifically for file -names (with wildcard processing). If you want to take full advantage of -this, you should perform a full reindex after installing the new version -(ie: use recollindex -z, or delete ~/.recoll/xapiandb). -Also, we now use the central copies of configuration files for default -values, and the user ones only for overrides. Your old configuration files -will still work, but, you may want to remove them if they are unmodified, -or keep only the modified parameters. +1.7.0 2006-12-20 + - Email attachments are now indexed. + - Right-click menu option to access the parent document of an embedded + result (ie from mail attachment to parent message). + - The sort tool has been improved: no need to restart the query after sort + criteria change. + - Support for real-time indexing with inotify is now enabled by default + when appropriate. + - Recoll now warns when the configured native viewer can not be found and + starts an interface for chosing another one. + - Categories (text, presentation, spreadsheets, etc.) can be used instead + of raw mime types when filtering on file types in advanced search. + - The port to qt4 is functional and can be enabled with configure --enable-qt4 + - 'autophrase' option improved and may now actually be useful. + - Improved highlighting (again...) + - Display term frequencies in term explorer. + - Recollindex -e to remove data from index for listed files. + +1.6.3 + - Fixed problem with bad detection of mbox message boundaries. + Upgrading can change the message numbering in some cases, and you should + perform a full index update (recollindex -z) after installing + the new version. + - Fixed problem with execution of external viewer for files with + single-quotes in the name. 1.6.2 - Minor solaris compilation glitches only. @@ -34,6 +49,18 @@ or keep only the modified parameters. managers. - Improved recall for phrases with composite words like email addresses. + +Updating from 1.2 to 1.3 or 1.4 or 1.5: +--------------------------------------- +From version 1.3 up, there is a new feature to search specifically for file +names (with wildcard processing). If you want to take full advantage of +this, you should perform a full reindex after installing the new version +(ie: use recollindex -z, or delete ~/.recoll/xapiandb). +Also, we now use the central copies of configuration files for default +values, and the user ones only for overrides. Your old configuration files +will still work, but, you may want to remove them if they are unmodified, +or keep only the modified parameters. + 1.5.9 - Fix bad timezone conversion in email dates. Display timezone in result list dates. diff --git a/website/download.html b/website/download.html index 6e0ce69b..9a61f82f 100644 --- a/website/download.html +++ b/website/download.html @@ -54,21 +54,21 @@ decide what you may want to install.

Source

- -

Current version: - 1.6.1: recoll-1.6.1.tar.gz - See the known bugs and issues and changes.

-

recoll 1.6 has the capacity to perform proximity searches (a - bit like phrases, but unordered). There is a still unpatched - problem in Xapian 0.9.9 which will make NEAR searches fail. - If you intend to perform proximity searches, have a look at the - errata for a workaround and Xapian - patch. All the statically linked binary packages below use a - patched Xapian-core library in order for NEAR searches to work.

+

The cutting edge + Version 1.7.0: recoll-1.7.0.tar.gz brings some + nice features such as email attachment indexing, and + improvements to real-time indexing session support. See the + changes file for more detail.

+ +

Current version: + 1.6.3: recoll-1.6.3.tar.gz + See the known bugs and issues and + changes.

Older recoll releases: + 1.6.1 1.5.11. 1.5.6. 1.4.3. @@ -94,11 +94,11 @@

Mandriva 2006 (also works on 2005 and 2007) RPM: - - recoll-1.6.1-0.1.20060mdk.i586.rpm. + + recoll-1.6.3-0.1.20060mdk.i586.rpm. Source: - - recoll-1.6.1-0.1.20060mdk.src.rpm + + recoll-1.6.3-0.1.20060mdk.src.rpm

Suse 10.1 @@ -150,6 +150,9 @@ recoll port.

+

Up to date ports for recoll-1.6 and + xapian-0.9.9 (without the + NEAR patch).

diff --git a/website/features.html b/website/features.html index 41c932c5..58fe3636 100644 --- a/website/features.html +++ b/website/features.html @@ -59,7 +59,7 @@
  • html.
  • OpenOffice - files.
  • + files (needs unzip command).
  • maildir and mailbox (Support for multiple charsets. Internal processing and storage uses Unicode UTF-8.
  • -
  • Stemming performed at query time (can switch stemming - language after indexing).
  • +
  • Stemming performed at query + time (can switch stemming language after indexing).
  • Easy installation. No database daemon, web server or exotic language necessary.
  • @@ -134,7 +134,47 @@ +

    Stemming

    +

    Stemming is a process which transforms inflected words into + their most basic form. For exemple, flooring, + floors, floored would probably all be transformed + to floor by a stemmer for the English language.

    + +

    In many search engines, the stemming process occurs during + indexing. The index will only contain the stemmed form of words, + with exceptions for terms which are detected as being probably + proper nouns (ie: capitalized). At query time, the terms entered + by the user are stemmed, then matched against the index.

    + +

    This process results into a smaller index, but it has the + grave inconvenient of irrevocably losing information during + indexing.

    + +

    Recoll works in a different way. No stemming is performed at + query time, so that all information gets into the index. The + resulting index is bigger, but most people probably don't care + much about this nowadays, because they have a 100Gb disk 95% + full of binary data which does not get indexed.

    +

    At the end of an indexing pass, Recoll builds one or several + stemming dictionaries, where all word stems are listed in + correspondence to the list of their derivatives.

    + +

    At query time, by default, user-entered terms are stemmed, + then matched against the stem database, and the query is + expanded to include all derivatives. This will yield search + results analogous to those obtained by a classical engine. + The benefits of this approach is that stem expansion can be + controlled instantly at query time in several ways: +

    + diff --git a/website/index.html b/website/index.html index a84fe393..3884e36d 100644 --- a/website/index.html +++ b/website/index.html @@ -47,7 +47,7 @@

    Recoll is free, open source, and GPL-licensed. The current version is - 1.6.1

    + 1.6.3

    We borrow a lot of code from other packages, and welcome code and ideas from contributors, see the + diff --git a/website/rclidxfmt.html b/website/rclidxfmt.html index 6e79daa2..41b330de 100644 --- a/website/rclidxfmt.html +++ b/website/rclidxfmt.html @@ -7,7 +7,10 @@

    Recoll index format details

    -

    Special (capitalized) terms:

    +

    Terms are not stemmed before being stored. They are turned to + all minuscule letters with no accents.

    + +

    Special prefixed terms: