159 Commits

Author SHA1 Message Date
Jean-Francois Dockes
560041cab9 cleared out errant tabs 2020-05-30 15:54:49 +02:00
Jean-Francois Dockes
5f76c2527d GUI searching with saved query: restore external indexes from saved query 2020-05-19 14:20:21 +02:00
Jean-Francois Dockes
9565663f09 textsplit: create isNGRAMMED() method to replace isCJK() and let the latter actually return what it says 2020-04-14 09:27:26 +02:00
Jean-Francois Dockes
3f7d270691 GUI preview: improve operation when the index data is not up to date.
Avoid erasing all the file index data in case the subsequent update fails
(e.g. the file is locked). Improve the messages. Check for previous
indexing error, and modify the message.
2019-06-24 17:37:37 +02:00
Jean-Francois Dockes
be214c4a5a Take advantage of text storage when possible to display preview data for an unaccessible document 2019-06-16 11:49:18 +02:00
Jean-Francois Dockes
0cbc46732f Fixed the FSF address 2019-03-04 11:19:14 +01:00
Jean-Francois Dockes
399c633efd Avoid purging documents from absent mountable volumes 2019-02-03 18:51:52 +01:00
Jean-Francois Dockes
6441eea8aa Store the origin dbdir inside the GUI doc history, so we can later fetch documents from external indexes 2018-05-31 15:01:17 +02:00
Jean-Francois Dockes
21adaca229 Add parameter to truncate all document text to specified length 2018-04-08 10:54:09 +02:00
Jean-Francois Dockes
3d4fd3c62e When storing doc text, always use a metadata entry. Get rid of the code to
store it in the data record. Make storing the default.  Add "fetchtext"
parameter to getDoc() to fetch and store the text in doc.text. Make this
accessible from Python. Misc comments and indents.
2018-01-25 13:20:02 +01:00
Jean-Francois Dockes
3d22f2c436 Merge branch 'RECOLL_1_23_MAINT' 2018-01-08 14:30:30 +01:00
Jean-Francois Dockes
e39b27e920 Dir filtering: would miss results if the filtered-for directory was partly upper-case and had a lower-case quasi-homonym 2018-01-08 14:19:48 +01:00
Jean-Francois Dockes
2c76a70c0e Abstracts: storing raw doc text in user metadata records 2018-01-06 11:38:24 +01:00
Jean-Francois Dockes
b4493ed9e1 Snippets generation: add method for generating from doc stored text. Still needs refining, esp. for phrase/near 2017-12-30 08:43:14 +01:00
Jean-Francois Dockes
503edd5a92 Add function to list failed files to the term explorer 2017-09-03 10:46:11 +02:00
Jean-Francois Dockes
f853f39ef3 Partially revert change treating Katakana as words, going back to n-grams. Did not work well because of separator-less compounds mostly 2017-04-25 10:20:38 +02:00
Jean-Francois Dockes
9661a4431e wen 2017-04-18 14:39:12 +02:00
Jean-Francois Dockes
bde991c08a got rid of off_t 2017-02-28 20:36:01 +01:00
Jean-Francois Dockes
0b0385e459 got rid of the STD_SHARED_XX std/tr1 defines 2016-07-13 15:12:25 +02:00
Jean-Francois Dockes
53f57e75c6 rcldb syngroups: allow setting synonyms source after construction 2015-11-21 12:48:09 +01:00
Jean-Francois Dockes
e7a669b668 Support multi-word synonyms and add modifier to turn-off synonyms expansion 2015-08-23 12:15:52 +02:00
Jean-Francois Dockes
d8e8ce6ee2 Implement single-term query-time synonyms 2015-08-22 15:11:07 +02:00
Jean-Francois Dockes
4d1f679eac Use std[::tr1]::shared_ptr instead of local RefCntr by default 2015-08-09 13:54:24 +02:00
Jean-Francois Dockes
c8d34dc8ba Prevent highligting of bogus terms in results (prevent path elts, negative queries or internal stuff) 2015-08-08 21:56:45 +02:00
Jean-Francois Dockes
e6c0ca403d recollindex: do not retry files which previously failed to be indexed, except if they were changed since, or option -k is set 2015-04-23 10:37:37 +02:00
Jean-Francois Dockes
19da7020d6 Add new kind of aliases for field names, to be used only at query time 2014-07-23 16:52:08 +02:00
Jean-Francois Dockes
60d3ba11ac add parameter to set max stored length of metadata fields. Previously fixed at 150. Fixes issue #178 2014-03-26 18:43:49 +01:00
Jean-Francois Dockes
4e8aa756cb comment 2014-03-26 18:02:59 +01:00
Jean-Francois Dockes
56a56500c1 Handle partial indexing of document restricted to metadata from extended attributes 2013-10-04 10:57:11 +02:00
Jean-Francois Dockes
334e16bee5 comments 2013-09-29 18:29:53 +02:00
Jean-Francois Dockes
a1b7018cfd Fix problems which occurred when using functions like open-parents with multiple indexes containing identical paths (udis) 2013-05-25 11:26:57 +02:00
Jean-Francois Dockes
1805c367ea check the stripchars options when adding an external index 2013-04-29 15:34:17 +02:00
Jean-Francois Dockes
2b80c77c23 Add possibility to display a list of sub-documents for a given result 2013-04-24 16:33:53 +02:00
Jean-Francois Dockes
906e58feff added code to purge obsolete messages when a compound document (esp. mbox) is shortened and a partial update is performed (no general purge). Else the orphan docs remained in the index potentially forever (needed actual reindex of the file by a full pass to go away) 2013-04-22 11:32:49 +02:00
Jean-Francois Dockes
f05cae7344 arrange so that a default first indexing run for a given user runs a quick shallow pass, so that queries return some results quickly after indexing starts, avoiding user frustration 2013-04-18 14:29:32 +02:00
Jean-Francois Dockes
de82f13ddc add functions and interface to display the duplicates of a result document 2013-04-17 09:36:46 +02:00
Jean-Francois Dockes
aaa98a1156 make sure that expansions of * done to retrieve all years or all mime types from index are done against the appropriate field section of the main index, not the whole synexpand one 2013-04-13 16:34:09 +02:00
Jean-Francois Dockes
838ea6d623 got rid of STRIPCHARS compile-time variable to make the code more readable, and corresponding configure option. + make sure that CONFDIR from environment gets translated to absolute path 2013-03-27 17:38:11 +01:00
Jean-Francois Dockes
fe2f32fc38 split the term expansion code out of rcldb.cpp 2013-01-15 10:06:24 +01:00
Jean-Francois Dockes
d9e6030b66 reorganized the term expansion code so that the term explorer works fully with case and diac sensitivity options 2013-01-14 18:06:48 +01:00
Jean-Francois Dockes
29d4d959a5 Add db stats option to term explorer 2013-01-10 15:12:43 +01:00
Jean-Francois Dockes
edb1bb5eb1 protect the xrdb from multiple accesses from needupdate tests and write thread 2013-01-01 16:20:06 +01:00
Jean-Francois Dockes
4afde68f9e more config isolation. Still crashing 2012-11-30 16:45:02 +01:00
Jean-Francois Dockes
9146929f5e implemented configuration for thread structure. Still crashes, but not often... 2012-11-29 17:26:48 +01:00
Jean-Francois Dockes
f252050247 ensure that indexing threads operations are transparent to high level callers such as the real time monitor 2012-11-26 15:40:31 +01:00
Jean-Francois Dockes
cb167cab0a none 2012-11-18 11:55:54 +01:00
Jean-Francois Dockes
cd53c0a536 Multithreaded indexing seems not to crash anymore thanks to locked existence map 2012-11-02 21:43:51 +01:00
Jean-Francois Dockes
3d59c6933a comments 2012-11-02 11:37:23 +01:00
Jean-Francois Dockes
ee7d0f2ee7 1st parallel multithreaded version of indexing which can do my home without crashing... Let's checkpoint 2012-11-01 11:19:48 +01:00
Jean-Francois Dockes
bfeb681574 mimetype T prefix was mishandled for a raw index 2012-10-13 11:08:53 +02:00