295 Commits

Author SHA1 Message Date
Jean-Francois Dockes
e6c0ca403d recollindex: do not retry files which previously failed to be indexed, except if they were changed since, or option -k is set 2015-04-23 10:37:37 +02:00
Jean-Francois Dockes
12d1de2e9e Try to match as long a suffix as possible when determining MIME type. This will allow .tar.gz files to be indexed directly instead of being decompressed to a temp file first. 2015-04-16 19:52:04 +02:00
Jean-Francois Dockes
d10b557e4c Use special case for indexing empty files 2015-04-16 19:49:40 +02:00
Jean-Francois Dockes
ce4405a666 make sure that CPPFLAGS CXXFLAGS and LDFLAGS are used during the build 2015-03-01 12:43:09 +01:00
Jean-Francois Dockes
811e02bf87 Revert change in indexfiles which was preventing the real time indexer from processing the web queue 2015-02-16 19:35:24 +01:00
Jean-Francois Dockes
727c055c86 web queue: fix cache resizing utility + bug in indexer which would skip oldest entry 2015-02-15 16:08:36 +01:00
Jean-Francois Dockes
cd892ee1a7 usemtime config variable needs to be global 2014-12-09 15:58:23 +01:00
Jean-Francois Dockes
d630cbbaec Delete RCL_USE_XATTR configure/compile time variable, it was not
useful. Add configuration variable to use mtime instead of ctime for update
detection. Useful on a system where xattrs would be modified but not
indexed, to avoid excessive reindexing.
2014-12-09 11:15:17 +01:00
Jean-Francois Dockes
9346c4db9c fsindexer: avoid generating an empty bogus document when getting a subdoc error after the base doc has been seen 2014-10-29 08:18:20 +01:00
Jean-Francois Dockes
bba7875a10 Defining a topdir under a skippedPath is allowed and usefully works with normal indexing. This change fixes a bug which prevented it from working with the real-time monitor and recollindex -i 2014-10-10 10:17:57 +02:00
Jean-Francois Dockes
368ebf2f86 All docs get containerfilename field, indexed exclusively with prefix (only a field search containerfilename:someterm can match). 2014-07-23 19:30:41 +02:00
Jean-Francois Dockes
46e924f3ea Allow configuration of the last-resort external command used to identify mime types 2014-07-19 16:15:43 +02:00
Jean-Francois Dockes
088f1f9a9f try out modification to use container file name for display, but not search, purposes when the subdocument file name is not set 2014-07-16 17:05:26 +02:00
Jean-Francois Dockes
bf9c813ec9 Web history indexing: fmtime was not saved to the cache, resulting in empty dates in queries 2014-07-08 15:06:47 +02:00
Jean-Francois Dockes
cc052a31f8 don't add the top container file name to the subdocs terms 2014-06-12 14:17:18 +02:00
Jean-Francois Dockes
e867f855ad get rid of numerous probably inocuous valgrind/helgrind messages by ensuring that actual string copies are passed between threads, without refcount/shared data magic 2014-05-05 19:01:58 +02:00
Jean-Francois Dockes
ed7a970ee6 recollindex: allow passing -e -i for erasing index data before rebuilding 2014-05-01 08:04:44 +02:00
Jean-Francois Dockes
7c9b13d64e recollindex -i or -e: do not process the web queue 2014-04-30 11:12:08 +02:00
Jean-Francois Dockes
00f9e37a1d message 2013-11-18 09:03:32 +01:00
Jean-Francois Dockes
a75d4c24ff fix multiple typos in commit 25c58c7/8ce2914 2013-10-29 08:36:14 +01:00
medoc
621c075820 look for the expedited indexing request file in all possible user configuration directories 2013-10-21 08:47:42 +02:00
Jean-Francois Dockes
6ddb0836c1 comment 2013-10-08 10:29:22 +02:00
Jean-Francois Dockes
56a56500c1 Handle partial indexing of document restricted to metadata from extended attributes 2013-10-04 10:57:11 +02:00
Jean-Francois Dockes
ccf175068b allow disabling the modif accumulation wait by creating a magic file 2013-09-30 18:43:15 +02:00
Jean-Francois Dockes
f49264a3f5 message 2013-09-27 13:04:23 +02:00
Jean-Francois Dockes
9cb5371820 get rid of the OFFTPC nonsense and cast to long long use lld to print off_t 2013-09-26 14:35:50 +02:00
medoc
641acd3d68 move the execution of external metadata-gathering commands from fsindexer to internfile for consistency of handling with filter-generated metadata 2013-09-06 11:51:00 +02:00
Jean-Francois Dockes
e515b9a9f9 Ignore empty mime_type extended attribute returned by bug in CIFS linux kernel module (samba mounts) 2013-06-17 11:52:04 +02:00
Jean-Francois Dockes
e77c510dfe Store original working directory before changing, for later turning user args into absolute paths 2013-06-06 14:05:26 +02:00
Jean-Francois Dockes
92331bfcbc added recursive reindex option to recollindex command line 2013-04-29 08:50:54 +02:00
Jean-Francois Dockes
555487ed98 none 2013-04-29 08:13:51 +02:00
Jean-Francois Dockes
62ca9549a3 fixed a few connections in subdocs aux reslist 2013-04-25 14:21:24 +02:00
Jean-Francois Dockes
2b80c77c23 Add possibility to display a list of sub-documents for a given result 2013-04-24 16:33:53 +02:00
Jean-Francois Dockes
906e58feff added code to purge obsolete messages when a compound document (esp. mbox) is shortened and a partial update is performed (no general purge). Else the orphan docs remained in the index potentially forever (needed actual reindex of the file by a full pass to go away) 2013-04-22 11:32:49 +02:00
Jean-Francois Dockes
f05cae7344 arrange so that a default first indexing run for a given user runs a quick shallow pass, so that queries return some results quickly after indexing starts, avoiding user frustration 2013-04-18 14:29:32 +02:00
Jean-Francois Dockes
caf5d105df Add support for indexing symlinks as such. Use inode/directory instead of application/x-fsdirectory as directory mime type. Fix stat/lstat usage in places (wipedir) 2013-04-17 18:34:02 +02:00
Jean-Francois Dockes
a74d885309 use st_ctime always to detect file changes 2013-03-31 08:36:11 +02:00
Jean-Francois Dockes
3736c00a3b more small fnmatch() fixes 2013-03-27 16:56:54 +01:00
Jean-Francois Dockes
4a870f984d Add parameter to chdir on recollindex startup, /tmp by default. Closes issue #129 2013-03-07 11:05:54 +01:00
Jean-Francois Dockes
3c80e51940 simplified temp file handling for compressed documents and, for querying, implemented caching for last file uncompressed 2013-03-06 18:52:57 +01:00
Jean-Francois Dockes
d3631b5ddf cleaned up processing of metadata from diverse origins (doc,extattrs,localfields) 2013-01-29 14:33:57 +01:00
Jean-Francois Dockes
e8afc66d8f fix compile issue with idxthreads enabled 2013-01-28 09:45:33 +01:00
Jean-Francois Dockes
6c4cdbb5d4 test "mime_type" extended attribute before trying other type recognition methods 2013-01-23 11:55:15 +01:00
Jean-Francois Dockes
4272939c3d add possibility to import tags from external app, e.g. tmsu 2013-01-22 18:30:18 +01:00
Jean-Francois Dockes
cbc269abb1 define new searchdataclausepath to replace the old dir: filtering mechanism. ORing dirs now works 2013-01-05 16:21:30 +01:00
Jean-Francois Dockes
49ce79d6a6 build with lib moved to recoll subdir works for debian 2012-12-30 16:11:51 +01:00
Jean-Francois Dockes
d2f7f11715 Use dynamic lib for shared recoll code 2012-12-29 14:27:01 +01:00
Jean-Francois Dockes
7ee94dbf3e traces 2012-12-28 16:31:10 +01:00
Jean-Francois Dockes
a0e273ddd4 Drain the job pipeline in fsindexer::purgefiles like was done in fsindexer::indexfiles as there is no warranty that the latter will be called before a db::close for each iteration of the monitor loop 2012-12-15 09:21:50 +01:00
Jean-Francois Dockes
72927b3fc8 additional cleanup after switch from .beagle to .recollweb 2012-12-14 13:20:09 +01:00