Recoll journal of user-visible changes
+ +1.11.0
+ +Recoll release 1.11 has relatively extensive changes that have + necessitated a modification of the index format. Hence installing this + release implies a full re-indexing, which is enforced by the + software.
+ +-
+
- Filtering on category (message/text/media etc.) as a function of + the main window for quick access. + +
- Use html for preview when available (ex: html files or "colorized" + python) instead of converting to text. This can be turned of in the + preferences. + +
- New Python query and index interfaces. The Python query + interface will be used for building a Xesam adapter for + Recoll when the specification is stabilized, and could be + useful for other things, such as indexing contents from an + RDBMS (see + + the manual for details). Restructured and cleaned up + internal Recoll interfaces. + +
- Improved filter framework. Can now process either html or text output + from the filters, and more easily execute "raw" commands instead of + Recoll scripts. Avoided wasteful repeated execution of filters for + which the helper application is missing. + +
- Query language now closer to Xesam specification, (but still far from a + complete implementation). See the Recoll manual and + + http://www.xesam.org/main/XesamUserSearchLanguage + +
- Much improved configuration for fields. Fields like + "author" can now be specified as storable (displayable in + results) and/or indexed (searchable). Added alias facility + for translating from user-level names to internal. + +
- Added "recipient" as an indexed/searchable field for emails. + +
- rcltext filter for processing text such as C code for which no specific + processing is needed when indexing but a specific viewer is + desired. +
1.10.6
+-
+
- Fix a simple and mildly nasty bug that would cause the + indexer to stop + indexing an mbox on encountering a specific but not exceptional error + condition (like a few dozen errors while indexing attachments for which + no filter was installed). +
1.10.5
+-
+
- Ensure that file names indexed as terms don't overflow the maximum term + size. + +
- Handle non-standard date format in mbox separator lines sometimes + generated by thunderbird. + +
- Use attachment file names to help identify a better mime type for + parts only described as application/octet-stream + +
- For Phrase/Near searches, highlight all term groups in preview, not just + the first + +
- Added Open XML filters + +
1.10.2
+-
+
+
- Fixed openSuse 11 compile issues. + +
- Fixed bug in interpreting email mime structure, which resulted in base-64 + decoding errors. + +
- Fixed "Prev" button in preview window. Would actually go forward when + walking the search terms. + +
- Allow setting the highlight color for search terms in result list and + preview (yes: feature change, should have waited for major release...) + +
- Added svg filter + +
1.10.1
+-
+
+
- Ensure that in case the data of a file can't be indexed because of some + error, at least the file name is indexed. + +
- Improve query language to support OR queries of terms with field + specifications (ie: title:someterm OR author:someauthor). + +
- Fix filename search to split patterns on white space, so that + a "*.jpg *.jpeg" search does what's expected. Means you now need to use + double-quotes if there is actual embedded white space. + +
- Jump directly to the external editor choice dialog instead of opening + preferences when an external viewer is not found. + +
- Allow stopping indexing through menu action (only works with qt4 for now). + +
- Create an "indexedmimetypes" configuration variable to allow explicitely + restricting the file types which do get indexed. + + +
1.10.0
+-
+
+
- Added a GUI dialog to configure the indexing parameters. + +
- Added better support for indexing CJK text (Chinese, Japanese, Korean). + Please note that: + - You will need a full reindex to take good advantage of this. (You + *don't* need to reindex if you don't need to search CJK, even if there + is some in your index). + - When entering CJK search terms, words (single or multiple characters) + should be separated with white space. + - The specific CJK processing can be turned off by setting the nocjk + variable to true in the configuration file (this may make sense if you + have a mixed cjk/other document base and don't want to index the cjk + part, as it will save some disk space and a minuscule amount of cpu). + +
- Changed the way Recoll handles searches including composite words (like + an email address). The new approach looks saner, but could have + side-effects, please report any problems in this area. + +
- The query language got a new "dir:" specifier to filter results on location. + +
- New rclimg perl filter for better indexing of picture tags, thanks to + Cedric Scott. This depends on Exiftool. +
- New rcltex filter. + +
- Changed and improved how the preview window local search finds the + query terms, this does not involve weird characters any more. The + display is cleaner and cut and paste works better. + +
- Fixed the fact that a newline-separated word list in simple search would + wrongly trigger a phrase search. + +
- Fixed the way we input text to the preview textedit (the old way would + sometimes confuse the window into displaying tags instead of acting on + them). + +
- Fixed transcoding to utf-8 for text/plain email attachments + +
- Improved mbox From_ line detection + +
- Added indexedmimetypes variables to allow restricting the list of indexed + mime types. + +
- KDE kicker applet: start a recoll search from the panel and get a + Recoll window. This is a clone from the find_applet, originally meant to + start a Tracker search. Not so useful presently because it will start a + new Recoll instance for every search. Not part of the main source (the + configure script is a whopping 1MB...), linked from the download page. +
- Added recoll command line options to define a query and execute it + immediately when the program starts. This is used in practice from the + applet and could be used from other programs. There is a also a new + option to not start the GUI and print the results to stdout. + +
1.9.0
+-
+
- Incompatible change: the icon image reference is now part of the result
+ list paragraph format string:
+ - If you had a standard config, you need do nothing.
+ - If you had a custom format string, you need to add
+
at its beginning to get the same result as + before. + - If you had unchecked the "show icons" option, you need to remove the + above string from the paragraph format to make the icons go away. + Changes to the format string are performed in the + "Preferences->Query Configuration->User Interface" dialog tab. + +
- New filters: wordperfect, abiword and kword, rcljpeg, rclflac, rclogg + (contributed filters). The jpeg and audio filters should be extended to + make use of the new field indexing/search capability (hint :) ) + +
- When searching for an empty string inside the preview window, position + the window to the next occurrence of a primary search term. + +
- Added ext: and mime: selectors to the query language. + +
- Added an adjustable flush threshold during indexing: should help control + memory usage. See the idxflushmb configuration variable. + +
- Added a check for file system free space. Indexing will stop if the + threshold is reached. See the maxfsoccuppc configuration parameter. + +
- Added 'followLinks' configuration option to have the indexer follow + symbolic links while walking the tree (the default is false). + +
- Allow symbolic links as 'topdirs' members. These are always followed. + +
- Add preference option to remember sort tool state between program + invocations (it is reset to inactive by default) + +
- Added File menu entry to erase document history. + +
- Bound the space and backspace keys to PgUp/PgDown in preview. + +
- (Hopefully) Improved abstract (keyword in context) generation + +
- Added support for arbitrary fields. Filters can now produce any number of + fields which will be selectively searchable through the query + language. This could be useful, for exemple, for the mp3 and jpeg filters + (but it is not currently used). + +
- Improved qt4 build: no more need for --enable-qt4. Note: the qt4 build + still needs the qt3 support library. + +
- Changed the icon to an ugly one. The previous one was nicer but looked + too much like Xapian's. + +
- Added some kind of support for a stopword list. + +
- Have email attachments inherit date and author from their parent message + (instead of mail folder). + +
- Fix bus error on rclmon exit + +
- Better handling of aspell errors inside rclmon + +
- Fixed a number of qt4 glitches: selection and keyboard shortcuts. + +
- New query configuration parameter to set the maximum text size beyond + which text won't be hilighted before preview (takes too much time). This + was a fixed value in 1.8. + +
1.8.2 2007-05-19
+-
+
- Fixed method name for compatibility with xapian 1.0.0 +
- Add .beagle to default list of skipped names (avoids indexing beagle + document cache...) +
- Fix configure.ac to use $libdir instead of /usr/lib +
- Fix recollinstall to properly copy translations and pictures for qt4 + +
1.8.1 2007-02-20
+-
+
- Add a small query language with some field-based searches (author, title, + etc.) +
- Add wildcard handling everywhere. *, ?, [] can be used in any + search. Warning: using a wild card at the left of a term can make + for a very slow search. +
- Allow skipping specific paths during indexing (in addition to file name + patterns) +
- Improved external index choice dialog, accessible from the top-level menu. +
- Many small bugs fixed: stemming language choice ignored in term explorer, + qt4 preview window reentrancy crashes, issues with saving the default + advanced search file, type filter, display more clearly missing helper + errors, etc. +
- Option to use the desktop defaults (with xdg-open) to choose the native + viewer for files (instead of recoll's mimeview). + +
1.7.6 2007-01-30
+-
+
- Fixes an issue with the openoffice filter on debian systems. +
- Adds Scribus and Lyx filters. + +
1.7.5 2007-01-15
+-
+
- Fixes two email indexing bugs in 1.7.3, which would bail out from an + mbox folder on the first attachment filtering error, and would decline + to handle multipart/signed bodies. You may need to run a full indexing + pass (recollindex -z), to force reindexing of old folders. + +
1.7.3 2007-01-09
+-
+
- Email attachments are now indexed. +
- Right-click menu option to access the parent document of an embedded + result (ie from mail attachment to parent message), or the parent folder + of a given file (which is opened with the application configured for + directories) +
- The sort tool has been improved: no need to restart the query after sort + criteria change. +
- Support for real-time indexing with inotify is now enabled by default + when appropriate. +
- Recoll now warns when the configured native viewer can not be found and + starts an interface for chosing another one. +
- Categories (text, presentation, spreadsheets, etc.) can be used instead + of raw mime types when filtering on file types in advanced search. +
- The port to qt4 is functional and can be enabled with configure --enable-qt4 +
- 'autophrase' option improved and may now actually be useful. +
- Improved highlighting (again...) +
- Display term frequencies in term explorer. +
- Recollindex -e to remove data from index for listed files. +
- Directory names now indexed. Directories can be 'edited' with the + configured application (rox by default) + +
1.6.3
+-
+
- Fixed problem with bad detection of mbox message boundaries. + Upgrading can change the message numbering in some cases, and you should + perform a full index update (recollindex -z) after installing + the new version. +
- Fixed problem with execution of external viewer for files with + single-quotes in the name. +
1.6.2
+-
+
- Minor solaris compilation glitches only. +
1.6.1
+-
+
- Term explorer: a multimode wildcard-regexp-spell/phonetic tool to search + the index for terms. This uses aspell for the orthographic/phonetic part. +
- A more dynamic advanced search window. You now have a choice of the top + level conjunction (OR/AND) and of any number of clauses, including NEAR + and PHRASE clauses with an adjustable proximity parameter. +
- User-settable format for the result-list entries, which use an HTML + string with %xx printf-like replacements (accessible from the user + preferences). +
- Real time monitoring/indexing support. This is not configured by + default, and must be specified at build time (configure --help). +
- Improved phrase/group highlighting in abstracts and preview +
- Better sample selection for synthetic abstracts. +
- Improved performance of the text splitter, good for indexing and previewing. +
- Shift+click link to open new preview window instead of tab in existing + window. +
- The key sequence for term completion in the simple search entry was + changed from CTRL+TAB to "Escape Space" to avoid interaction with window + managers. +
- Improved recall for phrases with composite words like email addresses. + + + Updating from 1.2 to 1.3 or 1.4 or 1.5: +
- -------------------------------------- + From version 1.3 up, there is a new feature to search specifically for file + names (with wildcard processing). If you want to take full advantage of + this, you should perform a full reindex after installing the new version + (ie: use recollindex -z, or delete ~/.recoll/xapiandb). + Also, we now use the central copies of configuration files for default + values, and the user ones only for overrides. Your old configuration files + will still work, but, you may want to remove them if they are unmodified, + or keep only the modified parameters. + +
1.5.9
+-
+
- Fix bad timezone conversion in email dates. Display timezone in result + list dates. + +
1.5.8
+-
+
- Fix stored and displayed dates which used to come from the file's ctime, + now use mtime (which was already used for deciding re-indexing). +
- Fix problem with some weird MIME messages (with null boundaries) which + crashed the indexer. + +
1.5.6
+-
+
- Small fixes dealing with the build process or compiler issues. + 1.5.6 has updated ukrainian and russian messages. + Otherwise no functional changes, and no need to upgrade from 1.5.1 + +
1.5.1
+-
+
- Fix serious bug with non ascii strings in simple search history +
- Improve synthetic abstracts: remove size limitations, handle overlapping + extracts, avoid printing several terms from the same position. + +
1.5.0 2006-09-20
+-
+
+
- Added support for powerpoint and excel files, with the catdoc package. +
- Allow viewing consecutive documents from the result list inside a single + preview window using the shift-arrow-up and shift-arrow-down keys. +
- Colorize search terms in abstracts in the result list. +
- A number of elements are now remembered between program invocations: + sort criteria, list of ignored file types (always starts inactive), + subtree restriction, better handling of the recent searches listbox, the + buildAbstract and replaceAbstract settings are not forgotten any more. +
- New option to automatically add a phrase to simple searches. +
- Possibility to adjust the length and context width for synthetic abstracts. +
- Handle weird html better. +
- When indexing mail messages, walk the full mime tree instead of staying + at the top level, index all text parts and attachement file names. +
- Add -c
option to recoll and recollindex to specify the + configuration directory on the command line + - Better synchronization between the active preview and the highlighted + paragraph inside the list +
- Improved recall for some special cases of stemming. +
- Much better handling of email dates, allowing better email sorting by + date (previously the message date was quite often the date when the file + was indexed). +
- Store the external database lists in the configuration directory, not the + qt preferences. +
- Ensure dialogs are sized according to font size + + +
1.4.3 2006-05-07
+-
+
- Multiple search databases. +
- Optionally auto-search when a word is entered in the simple search + field. +
- Show possible term completions in simple search by typing CTRL+TAB +
- Add 'more like this' option to result list right-click menu, to look for + documents related to the current result. +
- Double-click in preview or result list adds the selected word to the + simple search text field. +
- The simple search text entry field is now a combobox and remembers + previous searches. +
- Additional OR field in complex search. +
- Improved indexing cancellability (interrupting recollindex or closing + recoll with an indexing thread active), and status reporting. +
- Fixed filters to handle file paths with embedded spaces. +
- Misc small bug and memory leaks fixes. +
- More compact result list. +
- Set mode 0700 on .recoll directory by default + +
1.3.3 2006-04-04
+-
+
- Implement specific search on file names with wildcard + support. Indexation can optionally process all file names or only those + with mime types supported for normal indexation. UPDATING: you need a + full re-indexation to take advantage of this. +
- Use links and a right-click popup menu to replace confusing use of + mouse clicks and double-clicks inside the result list. +
- The 'example' configuration files are now used as default, and are not + copied any more to the user directory during installation. Overrides can + be set in the personal files for any value that the user wishes to + modify, with unchanged formats and file names (so that the files from + previous versions remain valid, but you may wish to trim them of values + that duplicate the central ones). +
- Use NLS information (LC_CTYPE, LANG) do determine default charset when + possible. +
- Mp3 file indexing, either filenames only or also id3 tags if id3info is + available. c/c++ ext edit. Use gnuclient instead of xemacs for text files. +
- Russian and Ukrainian translations and many improvement ideas thanks to + Michael Shigorin. + +
1.2.3 2006-03-03
+-
+
- Added support for dvi (with dvips), and dvu (with DjVuLibre). +
- Ensure that configure and make use the same qt version. +
- Fix sorted sequence title display. +
- Discriminate fatal errors and missing docs while loading a doc list. +
- Improved and cleaned up way to position a preview on the first search term. + +
1.2.2 2006-02-02
+-
+
- Fix minor compilation glitches (FreeBSD 4, QT 3.1, xapian-config problem) + +
1.2.0 2006-02-01
+-
+
- Improved preview loading: don't highlight very big documents (over 1Mb), + allow cancellation while loading. +
- Abstracts generated in the result list by looking at search term + contexts. This can slow down result list display for big documents, and + can be turned off in the preferences menu. +
- Wrap query detail line displayed when clicking on result list header. +
- Text splitting cleanup with less spurious terms should result in + slightly smaller databases. +
- Sligthly improved presentation in preview, esp. line breaks. +
- Color icons... +
- Let the user select the html browser used for help display. +
- autoconf/Makefile change: allow building UI from inside the qtgui + directory. +
- autoconf/Makefile: improved search and diagnostics for qt/qmake. +
- Internal code cleanup for maintainability: text splitting, user + interface. +
- Added prototype kio_slave to show result inside Konqueror, doesn't seem + particularly useful. +
1.1.0 2006-01-12
+-
+
- A much better user manual, which can be browsed from the help menu. +
- man pages for recoll, recollindex, recoll.conf +
- User/query interface configuration dialog. +
- Click on result list header will display the exact boolean search which + was used. +
- recollindex can be used to create stem expansion databases independantly + of a full indexing pass. +
- Misc user interface improvements, like an 'all terms' checkbox for + simple search. +
- Fixed case-insensitivity issues. Probably needs more testing. + +
1.0.16 2006-01-05
+-
+
- Minor installation tweaks for rpm compatibility +
1.0.15
+-
+
- Fix problems with prefix != /usr/local +
- Remove '.*' from the default list of ignored file/dir names: this + prevented mozilla/thunderbird mail indexing. +
- Fix some 64 bits issues +
1.0.14
+-
+
- Small changes for FreeBSD 4 compilation. +
1.0.13
+-
+
- Install of recollinstall program not done or needed any more. +
1.0.12
+-
+
- Fixed nasty html parsing bug introduced in 1.0.9 Html parsing failed + whenever the document charset name differed from the default only in + character case or punctuation. +
1.0.11
+-
+
- Create personal configuration on first start. +
- Use qt toolbars. +
- Also index terms in file paths. +
- Tool for sorting on dates or mime types. +
- Fixed pdf filter which was broken by more recent xpdf +
- Filters now installed/executed from /usr/local + +
1.0.10
+-
+
- Added tool to manage the history of consulted documents. +
- Try harder to convert email messages with wrongly declared charsets. +
- Add option to reset the database before indexing (easier than rm -rf). +
- Small gui improvements. +
- Install partial french translation as a tease for future translaters... + +
1.0.9
+-
+
- Fixed 2 really ennoying bugs in 1.0.8: wouldn't preview 2nd document + from same file + spurious db close when filter could not be executed. + +
1.0.8
+-
+
- Add support for rtf and gaim logs +
- Optionally show icons to indicate mime types in result list +
- Better (but imperfect) feedback during the preview + loading for big files +
- Remember main window geometry when closing +
- Fix stem expansion in advanced search +
- Some autoconf +
- Option to use the system's 'file' command as a final step of + identification for suffix-less or unknown files. +
- Typo had removed support for .Z compression +
- Use more appropriate conjonction operators when computing the advanced + search query (OP_AND_MAYBE, OP_FILTER instead of OP_AND) +
)
- for these: i386:
-