diff --git a/website/CHANGES.html b/website/CHANGES.html
index 7222e7a8..bf7a81b9 100644
--- a/website/CHANGES.html
+++ b/website/CHANGES.html
@@ -1,4 +1,4 @@
-
+
Recoll changes
@@ -6,10 +6,10 @@
+ "recoll is a simple full-text search system for unix and linux
+ based on the powerful and mature xapian engine">
+ "full text search, desktop search, unix, linux">
@@ -31,823 +31,872 @@
Recoll journal of user-visible changes
-
- Provide a set of configuration defaults so that compilation has a
- chance to succeed on unknown systems.
- Install icon to the pixmaps directory.
- Fixes stemming, which was broken for all previous 1.13
- releases.
- Bugs fixed between 1.13.02
- and 1.13.04.
-
-
- This version has a single fix to work around a problem in the
- Qt 4.6.1 uic utility. If you are not using Qt 4.6.1 and are
- currently running Recoll 1.13.01, you do not need to
- upgrade.
-
-
-
+
- - Recoll has a new class of persistent external filters
- with the capability to process several documents, or
- multi-document files, in the same instance. Benefits: much
- faster image tag indexing, and new file formats. Except for
- the Perl image tag filter (because of ExifTool), the new
- filters are written in Python.
-
-
-
- New file formats: chm (microsoft help), zip archives, .ics
- calendar files. Individual pages in chm files are indexed and
- can be previewed. Zip is quite convenient for maildir
- archives (for example).
-
- - Recoll can now use the output of the Beagle Firefox plugin
- to index visited web pages and bookmarks. This is only usable
- if Beagle itself is not running, else Recoll and Beagle will be
- fighting for the same queue.
-
- - Big text files (like application logs) can now be paged for
- indexing, avoiding excess memory usage during indexing and
- improving the usability at query time. They can also be
- altogether skipped by setting a maximum size configuration
- parameter. These parameters have default values (1 MB and 20
- MB) which change Recoll behaviour compared to previous
- versions. You can set textfilepagekbs
- and textfilemaxmbs to -1 in the configuration to
- restore the old behaviour.
-
- - A cache was implemented for mbox message header offsets. This
- speeds up message previews for big mbox files.
-
- - Miscellaneous usability improvements:
-
- - Allow using page-up/down and shift-home to scroll the
- result list while the focus is in the search entry.
- - Make 'Use desktop preferences' the default for new
- Recoll installations, and make this choice more
- prominent in the external viewer dialog.
- - ^P starts the print dialog on a preview window.
- - If a search has no result, alternate spellings are
- suggested. This feature is still a bit raw and will be
- improved.
- - If the text of a document is empty, preview will switch to
- displaying the document fields.
- - New entry in the result list contextual menu for opening
- the parent document of a result list hit with its native
- application. Useful for exemple for pages inside chm files.
-
-
-
- - Indentation is now preserved when displaying text documents
- inside the preview window. This is particularly welcome for
- program source files.
-
- - Allow substituting arbitrary fields in the result
- paragraph, using a %(fieldname) syntax
-
- - The real-time indexing monitor will now accumulate
- modifications during 30 S before indexing.
-
- - The indexer can now split camelCase words, allowing search on
- component terms. This is not enabled by default as it can
- confuse phrase searches (ie: "MySQL manual" is matched by
- phrase queries for "my sql manual" and "MySQL manual"
- but not "mysql manual"). Use "configure --enable-camelcase"
- to activate it.
-
-
- - The ipath is now printed by default after the url in the
- default result list format.
-
- - recoll_noindex and skippedNames can now be
- changed at any point in the tree (only for topdirs previously).
-
- - Allow using location/application sensitivity in external viewer
- choice. This uses several new functions:
-
- - Allow the substitution of arbitrary document fields inside
- external viewer command line arguments.
- - Allow field values to be set on all documents
- in a file system subtree. For example, you can
- set an application tag (ie: rclaptg = gnus) on all mailbox
- files under a specific directory.
- - New syntax in mimeview for including the rclaptg field in
- viewer choice
- (mimetype|tagvalue = ...).
-
-
-
- - Allow specifiying a specific default character set for mail
- messages. This is mainly useful for readpst dumps. All
- reasonable non-ascii messages specify their character set.
-
- - Added a --without-gui configure option. Removes all X11 and
- Qt dependancies and only compiles the command-line interface.
-
- - Improved the kio_recoll build. There is no need to run
- configure manually in the main directory any more. Ubuntu
- packages for kio_recoll are now built on the
-
- recoll-backports PPA on launchpad.net.
-
+ -
+ date selection in queries.
+ - Pure negative queries (ie: -someterm date:P10D/.
+ - Autosuffs: option to automatically turn words into ext:
+ clauses (ie: xls -> ext:xls) (GUI preferences
+ panel).
+ - Allow extracting arbitrary mail headers and use them as
+ index/search fields (configured in the fields
+ file).
+ - nonumbers configuration parameter: disable
+ indexing of all numbers, useful for some data files with lots
+ of numerical data.
+ - Shortcuts for the results page: PageUp/Down can
+ be used even when the focus is in the search
+ entry. Shift+Home: back to first page of results.
+ Ctrl+Shift+s: return focus to the search
+ entry.
+ - Add full screen mode for small devices.
+ - Added -i option to recollq to specify extra indexes.
+ - Filter for fictionbook (.fb2) documents.
+ - Cleaned up the Python samples and made recollq.py a usable
+ clone of recollq.
+ - Errors when opening additional indexes for a query are now
+ fatal. They could easily go unnoticed before.
+ - Proper LARGEFILE support.
+ - Use xsltproc instead of misc dirty tricks to
+ extract test from most current XML-based documents (except
+ those in which the XML is too broken).
+ - Implement configure --enable-pic and use it for
+ the KIO slave and Python and PHP modules.
-
-
- Bugs fixed:
+ Bugs also fixed in the 1.13 branch:
- - Qt4 version only: the search inside the preview window
- could become unbearably slow for big documents (quadratically
- so), and could not be interrupted (Qt bug). The Qt3 version of
- the code was included in the preview tool to restore good
- performance. This bug is the main reason for this release.
-
- Build system improvements:
-
- - Perform minimal base package configuration inside the kio
- cmake code to permit building it from scratch (without a build
- of the main code). Mainly useful for builds on the Ubuntu
- PPA.
- - Implement a --without-gui option to build a pure
- command-line version with no Qt or X11 dependancies.
- - Ensure that the user's PATH settings determine where we
- look first for qmake in all cases.
+ - The filename (transcoded file name) field
+ could not be stored, so it could not be displayed in the
+ result list.
+ - Html files would always be indexed even when filtered
+ out by indexedmimetypes
+ - Preview: toggling between main text and metadata
+ display would confuse the text format.
+ - Restore indexallfilenames=0 functionality.
-
- This is a bug fix release.
-
-
- This is a very minor release, mainly to fix compilation
- issues and a few very minor bugs. No need to upgrade if
- you don't experience these.
-
- - Fixed compilation errors for new gcc and gnu libc.
- - Use groff html output in rclman to get rid of control
- characters in output (improve manual pages indexing). Fix
- 8bit character issues in file names in rcllyx.
- - Fixed command line arguments processing problem with
- "recoll -q"
-
+
+ This is a very minor release, mainly to fix compilation
+ issues and a few very minor bugs. No need to upgrade if
+ you don't experience these.
+
+ - Fixed compilation errors for new gcc and gnu libc.
+ - Use groff html output in rclman to get rid of control
+ characters in output (improve manual pages indexing). Fix
+ 8bit character issues in file names in rcllyx.
+ - Fixed command line arguments processing problem with
+ "recoll -q"
+
-
-
- - Recoll now implements a KIO slave to allow searching
- directly from KDE applications. This does not affect the
- main application and is not enabled by default (go to the
- kde/kio/recoll source directory for build
- instructions).
- - Recoll now computes md5 checksums for all indexed
- documents and optionally collapses duplicate entries inside
- the result list. This needs a full reindex to become
- effective for older documents already in the index. The
- option to activate collapsing is in the Query
- Configuration.
- - Typing F1 anywhere in the GUI should bring up the
- appropriate section of the manual in the application
- configured for viewing HTML documents.
- - The result list right click menu now has an entry to
- save the document to a file. This is only enabled for
- documents contained inside another file (ie, messages inside
- an mbox folder, or attachments), and is especially useful for
- extracting an attachment with no associated external
- editor.
- - The preview window now has a right-click menu, with an
- entry to toggle between viewing the main text or all the
- metadata for the document. This is most useful in the case
- where the search match actually occurred in a field not
- visible in the main text (ie: author or HTML title).
- - Words glued by an underscore character like
- compound_word are now split during indexing, and
- will be found when queried either as themselves or in a
- search for the components.
- - There is now a size limit over which no attempt will be made to
- uncompress/identify/index compressed files. Not active by
- default, to be set in the Indexing Configuration.
- - Added support for fetching field values from extended file
- attributes. This is not enabled by default, use
- configure --enable-xattr. You'll also need to
- set up a map from the attributes names to the Recoll field
- names (see comment at the end of the fields
- configuration file.
-
+
+
+ - Recoll now implements a KIO slave to allow searching
+ directly from KDE applications. This does not affect the
+ main application and is not enabled by default (go to the
+ kde/kio/recoll source directory for build
+ instructions).
+ - Recoll now computes md5 checksums for all indexed
+ documents and optionally collapses duplicate entries inside
+ the result list. This needs a full reindex to become
+ effective for older documents already in the index. The
+ option to activate collapsing is in the Query
+ Configuration.
+ - Typing F1 anywhere in the GUI should bring up the
+ appropriate section of the manual in the application
+ configured for viewing HTML documents.
+ - The result list right click menu now has an entry to
+ save the document to a file. This is only enabled for
+ documents contained inside another file (ie, messages inside
+ an mbox folder, or attachments), and is especially useful for
+ extracting an attachment with no associated external
+ editor.
+ - The preview window now has a right-click menu, with an
+ entry to toggle between viewing the main text or all the
+ metadata for the document. This is most useful in the case
+ where the search match actually occurred in a field not
+ visible in the main text (ie: author or HTML title).
+ - Words glued by an underscore character like
+ compound_word are now split during indexing, and
+ will be found when queried either as themselves or in a
+ search for the components.
+ - There is now a size limit over which no attempt will be made to
+ uncompress/identify/index compressed files. Not active by
+ default, to be set in the Indexing Configuration.
+ - Added support for fetching field values from extended file
+ attributes. This is not enabled by default, use
+ configure --enable-xattr. You'll also need to
+ set up a map from the attributes names to the Recoll field
+ names (see comment at the end of the fields
+ configuration file.
+
-
-
+
+
- - Bugs fixed:
- check the list.
+ - Bugs fixed:
+ check the list.
- - The right-click menu "Copy" commands inside the result list
- now copy to the clipboard in addition to the main selection,
- enabling subsequent ^v commands.
+ - The right-click menu "Copy" commands inside the result list
+ now copy to the clipboard in addition to the main selection,
+ enabling subsequent ^v commands.
-
+
-
+
- Recoll release 1.11 has relatively extensive changes that have
- necessitated a modification of the index format. Hence installing this
- release implies a full re-indexing, which is enforced by the
- software.
+ Recoll release 1.11 has relatively extensive changes that have
+ necessitated a modification of the index format. Hence installing this
+ release implies a full re-indexing, which is enforced by the
+ software.
-
- - Filtering on category (message/text/media etc.) as a function of
- the main window for quick access.
+
+ - Filtering on category (message/text/media etc.) as a function of
+ the main window for quick access.
- - Use html for preview when available (ex: html files or "colorized"
- python) instead of converting to text. This can be turned of in the
- preferences.
+ - Use html for preview when available (ex: html files or "colorized"
+ python) instead of converting to text. This can be turned of in the
+ preferences.
- - New Python query and index interfaces. The Python query
- interface will be used for building a Xesam adapter for
- Recoll when the specification is stabilized, and could be
- useful for other things, such as indexing contents from an
- RDBMS (see
-
- the manual for details). Restructured and cleaned up
- internal Recoll interfaces.
+ - New Python query and index interfaces. The Python query
+ interface will be used for building a Xesam adapter for
+ Recoll when the specification is stabilized, and could be
+ useful for other things, such as indexing contents from an
+ RDBMS (see
+
+ the manual for details). Restructured and cleaned up
+ internal Recoll interfaces.
- - Improved filter framework. Can now process either html or text output
- from the filters, and more easily execute "raw" commands instead of
- Recoll scripts. Avoided wasteful repeated execution of filters for
- which the helper application is missing.
+ - Improved filter framework. Can now process either html or text output
+ from the filters, and more easily execute "raw" commands instead of
+ Recoll scripts. Avoided wasteful repeated execution of filters for
+ which the helper application is missing.
- - Query language now closer to Xesam specification, (but
- still far from a
- complete implementation). See the Recoll manual and
-
- http://www.xesam.org/main/XesamUserSearchLanguage
+ - Query language now closer to Xesam specification, (but
+ still far from a
+ complete implementation). See the Recoll manual and
+
+ http://www.xesam.org/main/XesamUserSearchLanguage
- - Much improved configuration for fields. Fields like
- "author" can now be specified as storable (displayable in
- results) and/or indexed (searchable). Added alias facility
- for translating from user-level names to internal.
+ - Much improved configuration for fields. Fields like
+ "author" can now be specified as storable (displayable in
+ results) and/or indexed (searchable). Added alias facility
+ for translating from user-level names to internal.
- - Added "recipient" as an indexed/searchable field for emails.
+ - Added "recipient" as an indexed/searchable field for emails.
- - rcltext filter for processing text such as C code for which no specific
- processing is needed when indexing but a specific viewer is
- desired.
-
+ - rcltext filter for processing text such as C code for which no specific
+ processing is needed when indexing but a specific viewer is
+ desired.
+
-
-
- - Fix a simple and mildly nasty bug that would cause the
- indexer to stop
- indexing an mbox on encountering a specific but not exceptional error
- condition (like a few dozen errors while indexing attachments for which
- no filter was installed).
-
+
+
+ - Fix a simple and mildly nasty bug that would cause the
+ indexer to stop
+ indexing an mbox on encountering a specific but not exceptional error
+ condition (like a few dozen errors while indexing attachments for which
+ no filter was installed).
+
-
-
- - Ensure that file names indexed as terms don't overflow the maximum term
- size.
+
+
+ - Ensure that file names indexed as terms don't overflow the maximum term
+ size.
- - Handle non-standard date format in mbox separator lines sometimes
- generated by thunderbird.
+
- Handle non-standard date format in mbox separator lines sometimes
+ generated by thunderbird.
-
- Use attachment file names to help identify a better mime type for
- parts only described as application/octet-stream
+
- Use attachment file names to help identify a better mime type for
+ parts only described as application/octet-stream
-
- For Phrase/Near searches, highlight all term groups in preview, not just
- the first
+
- For Phrase/Near searches, highlight all term groups in preview, not just
+ the first
-
- Added Open XML filters
+
- Added Open XML filters
-
-
-
+
+
- - Fixed openSuse 11 compile issues.
+
- Fixed openSuse 11 compile issues.
-
- Fixed bug in interpreting email mime structure, which resulted in base-64
- decoding errors.
+
- Fixed bug in interpreting email mime structure, which resulted in base-64
+ decoding errors.
-
- Fixed "Prev" button in preview window. Would actually go forward when
- walking the search terms.
+
- Fixed "Prev" button in preview window. Would actually go forward when
+ walking the search terms.
-
- Allow setting the highlight color for search terms in result list and
- preview (yes: feature change, should have waited for major release...)
+
- Allow setting the highlight color for search terms in result list and
+ preview (yes: feature change, should have waited for major release...)
-
- Added svg filter
+
- Added svg filter
-
-
-
+
+
- - Ensure that in case the data of a file can't be indexed because of some
- error, at least the file name is indexed.
+
- Ensure that in case the data of a file can't be indexed because of some
+ error, at least the file name is indexed.
-
- Improve query language to support OR queries of terms with field
- specifications (ie: title:someterm OR author:someauthor).
+
- Improve query language to support OR queries of terms with field
+ specifications (ie: title:someterm OR author:someauthor).
-
- Fix filename search to split patterns on white space, so that
- a "*.jpg *.jpeg" search does what's expected. Means you now need to use
- double-quotes if there is actual embedded white space.
+
- Fix filename search to split patterns on white space, so that
+ a "*.jpg *.jpeg" search does what's expected. Means you now need to use
+ double-quotes if there is actual embedded white space.
-
- Jump directly to the external editor choice dialog instead of opening
- preferences when an external viewer is not found.
+
- Jump directly to the external editor choice dialog instead of opening
+ preferences when an external viewer is not found.
-
- Allow stopping indexing through menu action (only works with qt4 for now).
+
- Allow stopping indexing through menu action (only works with qt4 for now).
-
- Create an "indexedmimetypes" configuration variable to allow explicitely
- restricting the file types which do get indexed.
+
- Create an "indexedmimetypes" configuration variable to allow explicitely
+ restricting the file types which do get indexed.
-
-
-
+
+
- - Added a GUI dialog to configure the indexing parameters.
+
- Added a GUI dialog to configure the indexing parameters.
-
- Added better support for indexing CJK text (Chinese, Japanese, Korean).
- Please note that:
- - You will need a full reindex to take good advantage of this. (You
- *don't* need to reindex if you don't need to search CJK, even if there
- is some in your index).
- - When entering CJK search terms, words (single or multiple characters)
- should be separated with white space.
- - The specific CJK processing can be turned off by setting the nocjk
- variable to true in the configuration file (this may make sense if you
- have a mixed cjk/other document base and don't want to index the cjk
- part, as it will save some disk space and a minuscule amount of cpu).
+
- Added better support for indexing CJK text (Chinese, Japanese, Korean).
+ Please note that:
+ - You will need a full reindex to take good advantage of this. (You
+ *don't* need to reindex if you don't need to search CJK, even if there
+ is some in your index).
+ - When entering CJK search terms, words (single or multiple characters)
+ should be separated with white space.
+ - The specific CJK processing can be turned off by setting the nocjk
+ variable to true in the configuration file (this may make sense if you
+ have a mixed cjk/other document base and don't want to index the cjk
+ part, as it will save some disk space and a minuscule amount of cpu).
-
- Changed the way Recoll handles searches including composite words (like
- an email address). The new approach looks saner, but could have
- side-effects, please report any problems in this area.
+
- Changed the way Recoll handles searches including composite words (like
+ an email address). The new approach looks saner, but could have
+ side-effects, please report any problems in this area.
-
- The query language got a new "dir:" specifier to filter results on location.
+
- The query language got a new "dir:" specifier to filter results on location.
-
- New rclimg perl filter for better indexing of picture tags, thanks to
- Cedric Scott. This depends on Exiftool.
-
- New rcltex filter.
+
- New rclimg perl filter for better indexing of picture tags, thanks to
+ Cedric Scott. This depends on Exiftool.
+
- New rcltex filter.
-
- Changed and improved how the preview window local search finds the
- query terms, this does not involve weird characters any more. The
- display is cleaner and cut and paste works better.
+
- Changed and improved how the preview window local search finds the
+ query terms, this does not involve weird characters any more. The
+ display is cleaner and cut and paste works better.
-
- Fixed the fact that a newline-separated word list in simple search would
- wrongly trigger a phrase search.
+
- Fixed the fact that a newline-separated word list in simple search would
+ wrongly trigger a phrase search.
-
- Fixed the way we input text to the preview textedit (the old way would
- sometimes confuse the window into displaying tags instead of acting on
- them).
+
- Fixed the way we input text to the preview textedit (the old way would
+ sometimes confuse the window into displaying tags instead of acting on
+ them).
-
- Fixed transcoding to utf-8 for text/plain email attachments
+
- Fixed transcoding to utf-8 for text/plain email attachments
-
- Improved mbox From_ line detection
+
- Improved mbox From_ line detection
-
- Added indexedmimetypes variables to allow restricting the list of indexed
- mime types.
+
- Added indexedmimetypes variables to allow restricting the list of indexed
+ mime types.
-
- KDE kicker applet: start a recoll search from the panel and get a
- Recoll window. This is a clone from the find_applet, originally meant to
- start a Tracker search. Not so useful presently because it will start a
- new Recoll instance for every search. Not part of the main source (the
- configure script is a whopping 1MB...), linked from the download page.
-
- Added recoll command line options to define a query and execute it
- immediately when the program starts. This is used in practice from the
- applet and could be used from other programs. There is a also a new
- option to not start the GUI and print the results to stdout.
+
- KDE kicker applet: start a recoll search from the panel and get a
+ Recoll window. This is a clone from the find_applet, originally meant to
+ start a Tracker search. Not so useful presently because it will start a
+ new Recoll instance for every search. Not part of the main source (the
+ configure script is a whopping 1MB...), linked from the download page.
+
- Added recoll command line options to define a query and execute it
+ immediately when the program starts. This is used in practice from the
+ applet and could be used from other programs. There is a also a new
+ option to not start the GUI and print the results to stdout.
-
-
-
- - Incompatible change: the icon image reference is now part of the result
- list paragraph format string:
- - If you had a standard config, you need do nothing.
- - If you had a custom format string, you need to add
-
at its beginning to get the same result as
- before.
- - If you had unchecked the "show icons" option, you need to remove the
- above string from the paragraph format to make the icons go away.
- Changes to the format string are performed in the
- "Preferences->Query Configuration->User Interface" dialog tab.
-
- - New filters: wordperfect, abiword and kword, rcljpeg, rclflac, rclogg
- (contributed filters). The jpeg and audio filters should be extended to
- make use of the new field indexing/search capability (hint :) )
-
-
- When searching for an empty string inside the preview window, position
- the window to the next occurrence of a primary search term.
-
-
- Added ext: and mime: selectors to the query language.
-
-
- Added an adjustable flush threshold during indexing: should help control
- memory usage. See the idxflushmb configuration variable.
-
-
- Added a check for file system free space. Indexing will stop if the
- threshold is reached. See the maxfsoccuppc configuration parameter.
-
-
- Added 'followLinks' configuration option to have the indexer follow
- symbolic links while walking the tree (the default is false).
-
-
- Allow symbolic links as 'topdirs' members. These are always followed.
-
-
- Add preference option to remember sort tool state between program
- invocations (it is reset to inactive by default)
-
-
- Added File menu entry to erase document history.
-
-
- Bound the space and backspace keys to PgUp/PgDown in preview.
-
-
- (Hopefully) Improved abstract (keyword in context) generation
-
-
- Added support for arbitrary fields. Filters can now produce any number of
- fields which will be selectively searchable through the query
- language. This could be useful, for exemple, for the mp3 and jpeg filters
- (but it is not currently used).
-
-
- Improved qt4 build: no more need for --enable-qt4. Note: the qt4 build
- still needs the qt3 support library.
-
-
- Changed the icon to an ugly one. The previous one was nicer but looked
- too much like Xapian's.
-
-
- Added some kind of support for a stopword list.
-
-
- Have email attachments inherit date and author from their parent message
- (instead of mail folder).
-
-
- Fix bus error on rclmon exit
-
-
- Better handling of aspell errors inside rclmon
-
-
- Fixed a number of qt4 glitches: selection and keyboard shortcuts.
-
-
- New query configuration parameter to set the maximum text size beyond
- which text won't be hilighted before preview (takes too much time). This
- was a fixed value in 1.8.
-
-
-
-
- - Fixed method name for compatibility with xapian 1.0.0
-
- Add .beagle to default list of skipped names (avoids indexing beagle
- document cache...)
-
- Fix configure.ac to use $libdir instead of /usr/lib
-
- Fix recollinstall to properly copy translations and pictures for qt4
-
-
-
-
- - Add a small query language with some field-based searches (author, title,
- etc.)
-
- Add wildcard handling everywhere. *, ?, [] can be used in any
- search. Warning: using a wild card at the left of a term can make
- for a very slow search.
-
- Allow skipping specific paths during indexing (in addition to file name
- patterns)
-
- Improved external index choice dialog, accessible from the top-level menu.
-
- Many small bugs fixed: stemming language choice ignored in term explorer,
- qt4 preview window reentrancy crashes, issues with saving the default
- advanced search file, type filter, display more clearly missing helper
- errors, etc.
-
- Option to use the desktop defaults (with xdg-open) to choose the native
- viewer for files (instead of recoll's mimeview).
-
-
-
-
- - Fixes an issue with the openoffice filter on debian systems.
-
- Adds Scribus and Lyx filters.
-
-
-
-
- - Fixes two email indexing bugs in 1.7.3, which would bail out from an
- mbox folder on the first attachment filtering error, and would decline
- to handle multipart/signed bodies. You may need to run a full indexing
- pass (recollindex -z), to force reindexing of old folders.
-
-
-
-
- - Email attachments are now indexed.
-
- Right-click menu option to access the parent document of an embedded
- result (ie from mail attachment to parent message), or the parent folder
- of a given file (which is opened with the application configured for
- directories)
-
- The sort tool has been improved: no need to restart the query after sort
- criteria change.
-
- Support for real-time indexing with inotify is now enabled by default
- when appropriate.
-
- Recoll now warns when the configured native viewer can not be found and
- starts an interface for chosing another one.
-
- Categories (text, presentation, spreadsheets, etc.) can be used instead
- of raw mime types when filtering on file types in advanced search.
-
- The port to qt4 is functional and can be enabled with configure --enable-qt4
-
- 'autophrase' option improved and may now actually be useful.
-
- Improved highlighting (again...)
-
- Display term frequencies in term explorer.
-
- Recollindex -e to remove data from index for listed files.
-
- Directory names now indexed. Directories can be 'edited' with the
- configured application (rox by default)
-
-
-
-
- - Fixed problem with bad detection of mbox message boundaries.
- Upgrading can change the message numbering in some cases, and you should
- perform a full index update (recollindex -z) after installing
- the new version.
-
- Fixed problem with execution of external viewer for files with
- single-quotes in the name.
-
-
-
- - Minor solaris compilation glitches only.
-
-
-
- - Term explorer: a multimode wildcard-regexp-spell/phonetic tool to search
- the index for terms. This uses aspell for the orthographic/phonetic part.
-
- A more dynamic advanced search window. You now have a choice of the top
- level conjunction (OR/AND) and of any number of clauses, including NEAR
- and PHRASE clauses with an adjustable proximity parameter.
-
- User-settable format for the result-list entries, which use an HTML
- string with %xx printf-like replacements (accessible from the user
- preferences).
-
- Real time monitoring/indexing support. This is not configured by
- default, and must be specified at build time (configure --help).
-
- Improved phrase/group highlighting in abstracts and preview
-
- Better sample selection for synthetic abstracts.
-
- Improved performance of the text splitter, good for indexing and previewing.
-
- Shift+click link to open new preview window instead of tab in existing
- window.
-
- The key sequence for term completion in the simple search entry was
- changed from CTRL+TAB to "Escape Space" to avoid interaction with window
- managers.
-
- Improved recall for phrases with composite words like email addresses.
-
-
- Updating from 1.2 to 1.3 or 1.4 or 1.5:
-
- --------------------------------------
- From version 1.3 up, there is a new feature to search specifically for file
- names (with wildcard processing). If you want to take full advantage of
- this, you should perform a full reindex after installing the new version
- (ie: use recollindex -z, or delete ~/.recoll/xapiandb).
- Also, we now use the central copies of configuration files for default
- values, and the user ones only for overrides. Your old configuration files
- will still work, but, you may want to remove them if they are unmodified,
- or keep only the modified parameters.
-
-
-
-
- - Fix bad timezone conversion in email dates. Display timezone in result
- list dates.
-
-
-
-
- - Fix stored and displayed dates which used to come from the file's ctime,
- now use mtime (which was already used for deciding re-indexing).
-
- Fix problem with some weird MIME messages (with null boundaries) which
- crashed the indexer.
-
-
-
-
- - Small fixes dealing with the build process or compiler issues.
- 1.5.6 has updated ukrainian and russian messages.
- Otherwise no functional changes, and no need to upgrade from 1.5.1
-
-
-
-
- - Fix serious bug with non ascii strings in simple search history
-
- Improve synthetic abstracts: remove size limitations, handle overlapping
- extracts, avoid printing several terms from the same position.
-
-
-
-
-
- - Added support for powerpoint and excel files, with the catdoc package.
-
- Allow viewing consecutive documents from the result list inside a single
- preview window using the shift-arrow-up and shift-arrow-down keys.
-
- Colorize search terms in abstracts in the result list.
-
- A number of elements are now remembered between program invocations:
- sort criteria, list of ignored file types (always starts inactive),
- subtree restriction, better handling of the recent searches listbox, the
- buildAbstract and replaceAbstract settings are not forgotten any more.
-
- New option to automatically add a phrase to simple searches.
-
- Possibility to adjust the length and context width for synthetic abstracts.
-
- Handle weird html better.
-
- When indexing mail messages, walk the full mime tree instead of staying
- at the top level, index all text parts and attachement file names.
-
- Add -c option to recoll and recollindex to specify the
- configuration directory on the command line
-
- Better synchronization between the active preview and the highlighted
- paragraph inside the list
-
- Improved recall for some special cases of stemming.
-
- Much better handling of email dates, allowing better email sorting by
- date (previously the message date was quite often the date when the file
- was indexed).
-
- Store the external database lists in the configuration directory, not the
- qt preferences.
-
- Ensure dialogs are sized according to font size
-
-
-
-
-
- - Multiple search databases.
-
- Optionally auto-search when a word is entered in the simple search
- field.
-
- Show possible term completions in simple search by typing CTRL+TAB
-
- Add 'more like this' option to result list right-click menu, to look for
- documents related to the current result.
-
- Double-click in preview or result list adds the selected word to the
- simple search text field.
-
- The simple search text entry field is now a combobox and remembers
- previous searches.
-
- Additional OR field in complex search.
-
- Improved indexing cancellability (interrupting recollindex or closing
- recoll with an indexing thread active), and status reporting.
-
- Fixed filters to handle file paths with embedded spaces.
-
- Misc small bug and memory leaks fixes.
-
- More compact result list.
-
- Set mode 0700 on .recoll directory by default
-
-
-
-
- - Implement specific search on file names with wildcard
- support. Indexing can optionally process all file names or only those
- with mime types supported for normal indexing. UPDATING: you need a
- full re-indexing to take advantage of this.
-
- Use links and a right-click popup menu to replace confusing use of
- mouse clicks and double-clicks inside the result list.
-
- The 'example' configuration files are now used as default, and are not
- copied any more to the user directory during installation. Overrides can
- be set in the personal files for any value that the user wishes to
- modify, with unchanged formats and file names (so that the files from
- previous versions remain valid, but you may wish to trim them of values
- that duplicate the central ones).
-
- Use NLS information (LC_CTYPE, LANG) do determine default charset when
- possible.
-
- Mp3 file indexing, either filenames only or also id3 tags if id3info is
- available. c/c++ ext edit. Use gnuclient instead of xemacs for text files.
-
- Russian and Ukrainian translations and many improvement ideas thanks to
- Michael Shigorin.
-
-
-
-
- - Added support for dvi (with dvips), and dvu (with DjVuLibre).
-
- Ensure that configure and make use the same qt version.
-
- Fix sorted sequence title display.
-
- Discriminate fatal errors and missing docs while loading a doc list.
-
- Improved and cleaned up way to position a preview on the first search term.
-
-
-
-
- - Fix minor compilation glitches (FreeBSD 4, QT 3.1, xapian-config problem)
-
-
-
-
- - Improved preview loading: don't highlight very big documents (over 1Mb),
- allow cancellation while loading.
-
- Abstracts generated in the result list by looking at search term
- contexts. This can slow down result list display for big documents, and
- can be turned off in the preferences menu.
-
- Wrap query detail line displayed when clicking on result list header.
-
- Text splitting cleanup with less spurious terms should result in
- slightly smaller databases.
-
- Sligthly improved presentation in preview, esp. line breaks.
-
- Color icons...
-
- Let the user select the html browser used for help display.
-
- autoconf/Makefile change: allow building UI from inside the qtgui
- directory.
-
- autoconf/Makefile: improved search and diagnostics for qt/qmake.
-
- Internal code cleanup for maintainability: text splitting, user
- interface.
-
- Added prototype kio_slave to show result inside Konqueror, doesn't seem
- particularly useful.
-
-
-
-
- - A much better user manual, which can be browsed from the help menu.
-
- man pages for recoll, recollindex, recoll.conf
-
- User/query interface configuration dialog.
-
- Click on result list header will display the exact boolean search which
- was used.
-
- recollindex can be used to create stem expansion databases independantly
- of a full indexing pass.
-
- Misc user interface improvements, like an 'all terms' checkbox for
- simple search.
-
- Fixed case-insensitivity issues. Probably needs more testing.
-
-
-
-
-
- - Minor installation tweaks for rpm compatibility
-
-
-
-
- - Fix problems with prefix != /usr/local
-
- Remove '.*' from the default list of ignored file/dir names: this
- prevented mozilla/thunderbird mail indexing.
-
- Fix some 64 bits issues
-
-
-
-
- - Small changes for FreeBSD 4 compilation.
-
-
-
-
- - Install of recollinstall program not done or needed any more.
-
-
-
-
- - Fixed nasty html parsing bug introduced in 1.0.9 Html parsing failed
- whenever the document charset name differed from the default only in
- character case or punctuation.
-
-
-
-
- - Create personal configuration on first start.
-
- Use qt toolbars.
-
- Also index terms in file paths.
-
- Tool for sorting on dates or mime types.
-
- Fixed pdf filter which was broken by more recent xpdf
-
- Filters now installed/executed from /usr/local
-
-
-
-
-
- - Added tool to manage the history of consulted documents.
-
- Try harder to convert email messages with wrongly declared charsets.
-
- Add option to reset the database before indexing (easier than rm -rf).
-
- Small gui improvements.
-
- Install partial french translation as a tease for future translaters...
-
-
-
-
-
- - Fixed 2 really ennoying bugs in 1.0.8: wouldn't preview 2nd document
- from same file + spurious db close when filter could not be executed.
-
-
-
-
-
- - Add support for rtf and gaim logs
-
- Optionally show icons to indicate mime types in result list
-
- Better (but imperfect) feedback during the preview
- loading for big files
-
- Remember main window geometry when closing
-
- Fix stem expansion in advanced search
-
- Some autoconf
-
- Option to use the system's 'file' command as a final step of
- identification for suffix-less or unknown files.
-
- Typo had removed support for .Z compression
-
- Use more appropriate conjonction operators when computing the advanced
- search query (OP_AND_MAYBE, OP_FILTER instead of OP_AND)
-
+
+
+
+ - Incompatible change: the icon image reference is now part of the result
+ list paragraph format string:
+ - If you had a standard config, you need do nothing.
+ - If you had a custom format string, you need to add
+
at its beginning to get the same result as
+ before.
+ - If you had unchecked the "show icons" option, you need to remove the
+ above string from the paragraph format to make the icons go away.
+ Changes to the format string are performed in the
+ "Preferences->Query Configuration->User Interface" dialog tab.
+
+ - New filters: wordperfect, abiword and kword, rcljpeg, rclflac, rclogg
+ (contributed filters). The jpeg and audio filters should be extended to
+ make use of the new field indexing/search capability (hint :) )
+
+
- When searching for an empty string inside the preview window, position
+ the window to the next occurrence of a primary search term.
+
+
- Added ext: and mime: selectors to the query language.
+
+
- Added an adjustable flush threshold during indexing: should help control
+ memory usage. See the idxflushmb configuration variable.
+
+
- Added a check for file system free space. Indexing will stop if the
+ threshold is reached. See the maxfsoccuppc configuration parameter.
+
+
- Added 'followLinks' configuration option to have the indexer follow
+ symbolic links while walking the tree (the default is false).
+
+
- Allow symbolic links as 'topdirs' members. These are always followed.
+
+
- Add preference option to remember sort tool state between program
+ invocations (it is reset to inactive by default)
+
+
- Added File menu entry to erase document history.
+
+
- Bound the space and backspace keys to PgUp/PgDown in preview.
+
+
- (Hopefully) Improved abstract (keyword in context) generation
+
+
- Added support for arbitrary fields. Filters can now produce any number of
+ fields which will be selectively searchable through the query
+ language. This could be useful, for exemple, for the mp3 and jpeg filters
+ (but it is not currently used).
+
+
- Improved qt4 build: no more need for --enable-qt4. Note: the qt4 build
+ still needs the qt3 support library.
+
+
- Changed the icon to an ugly one. The previous one was nicer but looked
+ too much like Xapian's.
+
+
- Added some kind of support for a stopword list.
+
+
- Have email attachments inherit date and author from their parent message
+ (instead of mail folder).
+
+
- Fix bus error on rclmon exit
+
+
- Better handling of aspell errors inside rclmon
+
+
- Fixed a number of qt4 glitches: selection and keyboard shortcuts.
+
+
- New query configuration parameter to set the maximum text size beyond
+ which text won't be hilighted before preview (takes too much time). This
+ was a fixed value in 1.8.
+
+
+
+
+ - Fixed method name for compatibility with xapian 1.0.0
+
- Add .beagle to default list of skipped names (avoids indexing beagle
+ document cache...)
+
- Fix configure.ac to use $libdir instead of /usr/lib
+
- Fix recollinstall to properly copy translations and pictures for qt4
+
+
+
+
+ - Add a small query language with some field-based searches (author, title,
+ etc.)
+
- Add wildcard handling everywhere. *, ?, [] can be used in any
+ search. Warning: using a wild card at the left of a term can make
+ for a very slow search.
+
- Allow skipping specific paths during indexing (in addition to file name
+ patterns)
+
- Improved external index choice dialog, accessible from the top-level menu.
+
- Many small bugs fixed: stemming language choice ignored in term explorer,
+ qt4 preview window reentrancy crashes, issues with saving the default
+ advanced search file, type filter, display more clearly missing helper
+ errors, etc.
+
- Option to use the desktop defaults (with xdg-open) to choose the native
+ viewer for files (instead of recoll's mimeview).
+
+
+
+
+ - Fixes an issue with the openoffice filter on debian systems.
+
- Adds Scribus and Lyx filters.
+
+
+
+
+ - Fixes two email indexing bugs in 1.7.3, which would bail out from an
+ mbox folder on the first attachment filtering error, and would decline
+ to handle multipart/signed bodies. You may need to run a full indexing
+ pass (recollindex -z), to force reindexing of old folders.
+
+
+
+
+ - Email attachments are now indexed.
+
- Right-click menu option to access the parent document of an embedded
+ result (ie from mail attachment to parent message), or the parent folder
+ of a given file (which is opened with the application configured for
+ directories)
+
- The sort tool has been improved: no need to restart the query after sort
+ criteria change.
+
- Support for real-time indexing with inotify is now enabled by default
+ when appropriate.
+
- Recoll now warns when the configured native viewer can not be found and
+ starts an interface for chosing another one.
+
- Categories (text, presentation, spreadsheets, etc.) can be used instead
+ of raw mime types when filtering on file types in advanced search.
+
- The port to qt4 is functional and can be enabled with configure --enable-qt4
+
- 'autophrase' option improved and may now actually be useful.
+
- Improved highlighting (again...)
+
- Display term frequencies in term explorer.
+
- Recollindex -e to remove data from index for listed files.
+
- Directory names now indexed. Directories can be 'edited' with the
+ configured application (rox by default)
+
+
+
+
+ - Fixed problem with bad detection of mbox message boundaries.
+ Upgrading can change the message numbering in some cases, and you should
+ perform a full index update (recollindex -z) after installing
+ the new version.
+
- Fixed problem with execution of external viewer for files with
+ single-quotes in the name.
+
+
+
+ - Minor solaris compilation glitches only.
+
+
+
+ - Term explorer: a multimode wildcard-regexp-spell/phonetic tool to search
+ the index for terms. This uses aspell for the orthographic/phonetic part.
+
- A more dynamic advanced search window. You now have a choice of the top
+ level conjunction (OR/AND) and of any number of clauses, including NEAR
+ and PHRASE clauses with an adjustable proximity parameter.
+
- User-settable format for the result-list entries, which use an HTML
+ string with %xx printf-like replacements (accessible from the user
+ preferences).
+
- Real time monitoring/indexing support. This is not configured by
+ default, and must be specified at build time (configure --help).
+
- Improved phrase/group highlighting in abstracts and preview
+
- Better sample selection for synthetic abstracts.
+
- Improved performance of the text splitter, good for indexing and previewing.
+
- Shift+click link to open new preview window instead of tab in existing
+ window.
+
- The key sequence for term completion in the simple search entry was
+ changed from CTRL+TAB to "Escape Space" to avoid interaction with window
+ managers.
+
- Improved recall for phrases with composite words like email addresses.
+
+
+ Updating from 1.2 to 1.3 or 1.4 or 1.5:
+
- --------------------------------------
+ From version 1.3 up, there is a new feature to search specifically for file
+ names (with wildcard processing). If you want to take full advantage of
+ this, you should perform a full reindex after installing the new version
+ (ie: use recollindex -z, or delete ~/.recoll/xapiandb).
+ Also, we now use the central copies of configuration files for default
+ values, and the user ones only for overrides. Your old configuration files
+ will still work, but, you may want to remove them if they are unmodified,
+ or keep only the modified parameters.
+
+
+
+
+ - Fix bad timezone conversion in email dates. Display timezone in result
+ list dates.
+
+
+
+
+ - Fix stored and displayed dates which used to come from the file's ctime,
+ now use mtime (which was already used for deciding re-indexing).
+
- Fix problem with some weird MIME messages (with null boundaries) which
+ crashed the indexer.
+
+
+
+
+ - Small fixes dealing with the build process or compiler issues.
+ 1.5.6 has updated ukrainian and russian messages.
+ Otherwise no functional changes, and no need to upgrade from 1.5.1
+
+
+
+
+ - Fix serious bug with non ascii strings in simple search history
+
- Improve synthetic abstracts: remove size limitations, handle overlapping
+ extracts, avoid printing several terms from the same position.
+
+
+
+
+
+ - Added support for powerpoint and excel files, with the catdoc package.
+
- Allow viewing consecutive documents from the result list inside a single
+ preview window using the shift-arrow-up and shift-arrow-down keys.
+
- Colorize search terms in abstracts in the result list.
+
- A number of elements are now remembered between program invocations:
+ sort criteria, list of ignored file types (always starts inactive),
+ subtree restriction, better handling of the recent searches listbox, the
+ buildAbstract and replaceAbstract settings are not forgotten any more.
+
- New option to automatically add a phrase to simple searches.
+
- Possibility to adjust the length and context width for synthetic abstracts.
+
- Handle weird html better.
+
- When indexing mail messages, walk the full mime tree instead of staying
+ at the top level, index all text parts and attachement file names.
+
- Add -c option to recoll and recollindex to specify the
+ configuration directory on the command line
+
- Better synchronization between the active preview and the highlighted
+ paragraph inside the list
+
- Improved recall for some special cases of stemming.
+
- Much better handling of email dates, allowing better email sorting by
+ date (previously the message date was quite often the date when the file
+ was indexed).
+
- Store the external database lists in the configuration directory, not the
+ qt preferences.
+
- Ensure dialogs are sized according to font size
+
+
+
+
+
+ - Multiple search databases.
+
- Optionally auto-search when a word is entered in the simple search
+ field.
+
- Show possible term completions in simple search by typing CTRL+TAB
+
- Add 'more like this' option to result list right-click menu, to look for
+ documents related to the current result.
+
- Double-click in preview or result list adds the selected word to the
+ simple search text field.
+
- The simple search text entry field is now a combobox and remembers
+ previous searches.
+
- Additional OR field in complex search.
+
- Improved indexing cancellability (interrupting recollindex or closing
+ recoll with an indexing thread active), and status reporting.
+
- Fixed filters to handle file paths with embedded spaces.
+
- Misc small bug and memory leaks fixes.
+
- More compact result list.
+
- Set mode 0700 on .recoll directory by default
+
+
+
+
+ - Implement specific search on file names with wildcard
+ support. Indexing can optionally process all file names or only those
+ with mime types supported for normal indexing. UPDATING: you need a
+ full re-indexing to take advantage of this.
+
- Use links and a right-click popup menu to replace confusing use of
+ mouse clicks and double-clicks inside the result list.
+
- The 'example' configuration files are now used as default, and are not
+ copied any more to the user directory during installation. Overrides can
+ be set in the personal files for any value that the user wishes to
+ modify, with unchanged formats and file names (so that the files from
+ previous versions remain valid, but you may wish to trim them of values
+ that duplicate the central ones).
+
- Use NLS information (LC_CTYPE, LANG) do determine default charset when
+ possible.
+
- Mp3 file indexing, either filenames only or also id3 tags if id3info is
+ available. c/c++ ext edit. Use gnuclient instead of xemacs for text files.
+
- Russian and Ukrainian translations and many improvement ideas thanks to
+ Michael Shigorin.
+
+
+
+
+ - Added support for dvi (with dvips), and dvu (with DjVuLibre).
+
- Ensure that configure and make use the same qt version.
+
- Fix sorted sequence title display.
+
- Discriminate fatal errors and missing docs while loading a doc list.
+
- Improved and cleaned up way to position a preview on the first search term.
+
+
+
+
+ - Fix minor compilation glitches (FreeBSD 4, QT 3.1, xapian-config problem)
+
+
+
+
+ - Improved preview loading: don't highlight very big documents (over 1Mb),
+ allow cancellation while loading.
+
- Abstracts generated in the result list by looking at search term
+ contexts. This can slow down result list display for big documents, and
+ can be turned off in the preferences menu.
+
- Wrap query detail line displayed when clicking on result list header.
+
- Text splitting cleanup with less spurious terms should result in
+ slightly smaller databases.
+
- Sligthly improved presentation in preview, esp. line breaks.
+
- Color icons...
+
- Let the user select the html browser used for help display.
+
- autoconf/Makefile change: allow building UI from inside the qtgui
+ directory.
+
- autoconf/Makefile: improved search and diagnostics for qt/qmake.
+
- Internal code cleanup for maintainability: text splitting, user
+ interface.
+
- Added prototype kio_slave to show result inside Konqueror, doesn't seem
+ particularly useful.
+
+
+
+
+ - A much better user manual, which can be browsed from the help menu.
+
- man pages for recoll, recollindex, recoll.conf
+
- User/query interface configuration dialog.
+
- Click on result list header will display the exact boolean search which
+ was used.
+
- recollindex can be used to create stem expansion databases independantly
+ of a full indexing pass.
+
- Misc user interface improvements, like an 'all terms' checkbox for
+ simple search.
+
- Fixed case-insensitivity issues. Probably needs more testing.
+
+
+
+
+
+ - Minor installation tweaks for rpm compatibility
+
+
+
+
+ - Fix problems with prefix != /usr/local
+
- Remove '.*' from the default list of ignored file/dir names: this
+ prevented mozilla/thunderbird mail indexing.
+
- Fix some 64 bits issues
+
+
+
+
+ - Small changes for FreeBSD 4 compilation.
+
+
+
+
+ - Install of recollinstall program not done or needed any more.
+
+
+
+
+ - Fixed nasty html parsing bug introduced in 1.0.9 Html parsing failed
+ whenever the document charset name differed from the default only in
+ character case or punctuation.
+
+
+
+
+ - Create personal configuration on first start.
+
- Use qt toolbars.
+
- Also index terms in file paths.
+
- Tool for sorting on dates or mime types.
+
- Fixed pdf filter which was broken by more recent xpdf
+
- Filters now installed/executed from /usr/local
+
+
+
+
+
+ - Added tool to manage the history of consulted documents.
+
- Try harder to convert email messages with wrongly declared charsets.
+
- Add option to reset the database before indexing (easier than rm -rf).
+
- Small gui improvements.
+
- Install partial french translation as a tease for future translaters...
+
+
+
+
+
+ - Fixed 2 really ennoying bugs in 1.0.8: wouldn't preview 2nd document
+ from same file + spurious db close when filter could not be executed.
+
+
+
+
+
+ - Add support for rtf and gaim logs
+
- Optionally show icons to indicate mime types in result list
+
- Better (but imperfect) feedback during the preview
+ loading for big files
+
- Remember main window geometry when closing
+
- Fix stem expansion in advanced search
+
- Some autoconf
+
- Option to use the system's 'file' command as a final step of
+ identification for suffix-less or unknown files.
+
- Typo had removed support for .Z compression
+
- Use more appropriate conjonction operators when
+ computing the advanced search query (OP_AND_MAYBE,
+ OP_FILTER instead of OP_AND)
+
diff --git a/website/features.html b/website/features.html
index afa48d46..04ef522c 100644
--- a/website/features.html
+++ b/website/features.html
@@ -161,10 +161,15 @@
djvu with
DjVuLibre.
- mp3 tags support with
- id3info
- (id3lib) (compiling id3lib on recent systems may need
- a small patch, see here).
+ mp3/flac/ogg vorbis
+ tags support with
+ id3info (id3lib)
+ (compiling id3lib on recent systems may need
+ a small patch, see here.) or
+ the ogg and flac tools. Release 1.14 and later use a
+ python filter based on
+ mutagen
+ for all audio tags.
Image file tags support with