Jean-Francois Dockes
e6c0ca403d
recollindex: do not retry files which previously failed to be indexed, except if they were changed since, or option -k is set
2015-04-23 10:37:37 +02:00
Jean-Francois Dockes
12d1de2e9e
Try to match as long a suffix as possible when determining MIME type. This will allow .tar.gz files to be indexed directly instead of being decompressed to a temp file first.
2015-04-16 19:52:04 +02:00
Jean-Francois Dockes
d10b557e4c
Use special case for indexing empty files
2015-04-16 19:49:40 +02:00
Jean-Francois Dockes
ce4405a666
make sure that CPPFLAGS CXXFLAGS and LDFLAGS are used during the build
2015-03-01 12:43:09 +01:00
Jean-Francois Dockes
811e02bf87
Revert change in indexfiles which was preventing the real time indexer from processing the web queue
2015-02-16 19:35:24 +01:00
Jean-Francois Dockes
727c055c86
web queue: fix cache resizing utility + bug in indexer which would skip oldest entry
2015-02-15 16:08:36 +01:00
Jean-Francois Dockes
cd892ee1a7
usemtime config variable needs to be global
2014-12-09 15:58:23 +01:00
Jean-Francois Dockes
d630cbbaec
Delete RCL_USE_XATTR configure/compile time variable, it was not
...
useful. Add configuration variable to use mtime instead of ctime for update
detection. Useful on a system where xattrs would be modified but not
indexed, to avoid excessive reindexing.
2014-12-09 11:15:17 +01:00
Jean-Francois Dockes
9346c4db9c
fsindexer: avoid generating an empty bogus document when getting a subdoc error after the base doc has been seen
2014-10-29 08:18:20 +01:00
Jean-Francois Dockes
bba7875a10
Defining a topdir under a skippedPath is allowed and usefully works with normal indexing. This change fixes a bug which prevented it from working with the real-time monitor and recollindex -i
2014-10-10 10:17:57 +02:00
Jean-Francois Dockes
368ebf2f86
All docs get containerfilename field, indexed exclusively with prefix (only a field search containerfilename:someterm can match).
2014-07-23 19:30:41 +02:00
Jean-Francois Dockes
46e924f3ea
Allow configuration of the last-resort external command used to identify mime types
2014-07-19 16:15:43 +02:00
Jean-Francois Dockes
088f1f9a9f
try out modification to use container file name for display, but not search, purposes when the subdocument file name is not set
2014-07-16 17:05:26 +02:00
Jean-Francois Dockes
bf9c813ec9
Web history indexing: fmtime was not saved to the cache, resulting in empty dates in queries
2014-07-08 15:06:47 +02:00
Jean-Francois Dockes
cc052a31f8
don't add the top container file name to the subdocs terms
2014-06-12 14:17:18 +02:00
Jean-Francois Dockes
e867f855ad
get rid of numerous probably inocuous valgrind/helgrind messages by ensuring that actual string copies are passed between threads, without refcount/shared data magic
2014-05-05 19:01:58 +02:00
Jean-Francois Dockes
ed7a970ee6
recollindex: allow passing -e -i for erasing index data before rebuilding
2014-05-01 08:04:44 +02:00
Jean-Francois Dockes
7c9b13d64e
recollindex -i or -e: do not process the web queue
2014-04-30 11:12:08 +02:00
Jean-Francois Dockes
00f9e37a1d
message
2013-11-18 09:03:32 +01:00
Jean-Francois Dockes
a75d4c24ff
fix multiple typos in commit 25c58c7/8ce2914
2013-10-29 08:36:14 +01:00
medoc
621c075820
look for the expedited indexing request file in all possible user configuration directories
2013-10-21 08:47:42 +02:00
Jean-Francois Dockes
6ddb0836c1
comment
2013-10-08 10:29:22 +02:00
Jean-Francois Dockes
56a56500c1
Handle partial indexing of document restricted to metadata from extended attributes
2013-10-04 10:57:11 +02:00
Jean-Francois Dockes
ccf175068b
allow disabling the modif accumulation wait by creating a magic file
2013-09-30 18:43:15 +02:00
Jean-Francois Dockes
f49264a3f5
message
2013-09-27 13:04:23 +02:00
Jean-Francois Dockes
9cb5371820
get rid of the OFFTPC nonsense and cast to long long use lld to print off_t
2013-09-26 14:35:50 +02:00
medoc
641acd3d68
move the execution of external metadata-gathering commands from fsindexer to internfile for consistency of handling with filter-generated metadata
2013-09-06 11:51:00 +02:00
Jean-Francois Dockes
e515b9a9f9
Ignore empty mime_type extended attribute returned by bug in CIFS linux kernel module (samba mounts)
2013-06-17 11:52:04 +02:00
Jean-Francois Dockes
e77c510dfe
Store original working directory before changing, for later turning user args into absolute paths
2013-06-06 14:05:26 +02:00
Jean-Francois Dockes
92331bfcbc
added recursive reindex option to recollindex command line
2013-04-29 08:50:54 +02:00
Jean-Francois Dockes
555487ed98
none
2013-04-29 08:13:51 +02:00
Jean-Francois Dockes
62ca9549a3
fixed a few connections in subdocs aux reslist
2013-04-25 14:21:24 +02:00
Jean-Francois Dockes
2b80c77c23
Add possibility to display a list of sub-documents for a given result
2013-04-24 16:33:53 +02:00
Jean-Francois Dockes
906e58feff
added code to purge obsolete messages when a compound document (esp. mbox) is shortened and a partial update is performed (no general purge). Else the orphan docs remained in the index potentially forever (needed actual reindex of the file by a full pass to go away)
2013-04-22 11:32:49 +02:00
Jean-Francois Dockes
f05cae7344
arrange so that a default first indexing run for a given user runs a quick shallow pass, so that queries return some results quickly after indexing starts, avoiding user frustration
2013-04-18 14:29:32 +02:00
Jean-Francois Dockes
caf5d105df
Add support for indexing symlinks as such. Use inode/directory instead of application/x-fsdirectory as directory mime type. Fix stat/lstat usage in places (wipedir)
2013-04-17 18:34:02 +02:00
Jean-Francois Dockes
a74d885309
use st_ctime always to detect file changes
2013-03-31 08:36:11 +02:00
Jean-Francois Dockes
3736c00a3b
more small fnmatch() fixes
2013-03-27 16:56:54 +01:00
Jean-Francois Dockes
4a870f984d
Add parameter to chdir on recollindex startup, /tmp by default. Closes issue #129
2013-03-07 11:05:54 +01:00
Jean-Francois Dockes
3c80e51940
simplified temp file handling for compressed documents and, for querying, implemented caching for last file uncompressed
2013-03-06 18:52:57 +01:00
Jean-Francois Dockes
d3631b5ddf
cleaned up processing of metadata from diverse origins (doc,extattrs,localfields)
2013-01-29 14:33:57 +01:00
Jean-Francois Dockes
e8afc66d8f
fix compile issue with idxthreads enabled
2013-01-28 09:45:33 +01:00
Jean-Francois Dockes
6c4cdbb5d4
test "mime_type" extended attribute before trying other type recognition methods
2013-01-23 11:55:15 +01:00
Jean-Francois Dockes
4272939c3d
add possibility to import tags from external app, e.g. tmsu
2013-01-22 18:30:18 +01:00
Jean-Francois Dockes
cbc269abb1
define new searchdataclausepath to replace the old dir: filtering mechanism. ORing dirs now works
2013-01-05 16:21:30 +01:00
Jean-Francois Dockes
49ce79d6a6
build with lib moved to recoll subdir works for debian
2012-12-30 16:11:51 +01:00
Jean-Francois Dockes
d2f7f11715
Use dynamic lib for shared recoll code
2012-12-29 14:27:01 +01:00
Jean-Francois Dockes
7ee94dbf3e
traces
2012-12-28 16:31:10 +01:00
Jean-Francois Dockes
a0e273ddd4
Drain the job pipeline in fsindexer::purgefiles like was done in fsindexer::indexfiles as there is no warranty that the latter will be called before a db::close for each iteration of the monitor loop
2012-12-15 09:21:50 +01:00
Jean-Francois Dockes
72927b3fc8
additional cleanup after switch from .beagle to .recollweb
2012-12-14 13:20:09 +01:00