57 Commits

Author SHA1 Message Date
Jean-Francois Dockes
088f1f9a9f try out modification to use container file name for display, but not search, purposes when the subdocument file name is not set 2014-07-16 17:05:26 +02:00
Jean-Francois Dockes
cc052a31f8 don't add the top container file name to the subdocs terms 2014-06-12 14:17:18 +02:00
Jean-Francois Dockes
e867f855ad get rid of numerous probably inocuous valgrind/helgrind messages by ensuring that actual string copies are passed between threads, without refcount/shared data magic 2014-05-05 19:01:58 +02:00
Jean-Francois Dockes
56a56500c1 Handle partial indexing of document restricted to metadata from extended attributes 2013-10-04 10:57:11 +02:00
Jean-Francois Dockes
f49264a3f5 message 2013-09-27 13:04:23 +02:00
Jean-Francois Dockes
9cb5371820 get rid of the OFFTPC nonsense and cast to long long use lld to print off_t 2013-09-26 14:35:50 +02:00
medoc
641acd3d68 move the execution of external metadata-gathering commands from fsindexer to internfile for consistency of handling with filter-generated metadata 2013-09-06 11:51:00 +02:00
Jean-Francois Dockes
2b80c77c23 Add possibility to display a list of sub-documents for a given result 2013-04-24 16:33:53 +02:00
Jean-Francois Dockes
906e58feff added code to purge obsolete messages when a compound document (esp. mbox) is shortened and a partial update is performed (no general purge). Else the orphan docs remained in the index potentially forever (needed actual reindex of the file by a full pass to go away) 2013-04-22 11:32:49 +02:00
Jean-Francois Dockes
f05cae7344 arrange so that a default first indexing run for a given user runs a quick shallow pass, so that queries return some results quickly after indexing starts, avoiding user frustration 2013-04-18 14:29:32 +02:00
Jean-Francois Dockes
a74d885309 use st_ctime always to detect file changes 2013-03-31 08:36:11 +02:00
Jean-Francois Dockes
3736c00a3b more small fnmatch() fixes 2013-03-27 16:56:54 +01:00
Jean-Francois Dockes
3c80e51940 simplified temp file handling for compressed documents and, for querying, implemented caching for last file uncompressed 2013-03-06 18:52:57 +01:00
Jean-Francois Dockes
d3631b5ddf cleaned up processing of metadata from diverse origins (doc,extattrs,localfields) 2013-01-29 14:33:57 +01:00
Jean-Francois Dockes
e8afc66d8f fix compile issue with idxthreads enabled 2013-01-28 09:45:33 +01:00
Jean-Francois Dockes
4272939c3d add possibility to import tags from external app, e.g. tmsu 2013-01-22 18:30:18 +01:00
Jean-Francois Dockes
7ee94dbf3e traces 2012-12-28 16:31:10 +01:00
Jean-Francois Dockes
a0e273ddd4 Drain the job pipeline in fsindexer::purgefiles like was done in fsindexer::indexfiles as there is no warranty that the latter will be called before a db::close for each iteration of the monitor loop 2012-12-15 09:21:50 +01:00
Jean-Francois Dockes
72927b3fc8 additional cleanup after switch from .beagle to .recollweb 2012-12-14 13:20:09 +01:00
Jean-Francois Dockes
4afde68f9e more config isolation. Still crashing 2012-11-30 16:45:02 +01:00
Jean-Francois Dockes
9146929f5e implemented configuration for thread structure. Still crashes, but not often... 2012-11-29 17:26:48 +01:00
Jean-Francois Dockes
f252050247 ensure that indexing threads operations are transparent to high level callers such as the real time monitor 2012-11-26 15:40:31 +01:00
Jean-Francois Dockes
9079258f91 Observe followSymlinks in indexfiles() 2012-11-26 09:14:51 +01:00
Jean-Francois Dockes
cc79ee3f73 threads: added recoll_threadinit calls to block signals 2012-11-23 13:36:05 +01:00
Jean-Francois Dockes
ee7d0f2ee7 1st parallel multithreaded version of indexing which can do my home without crashing... Let's checkpoint 2012-11-01 11:19:48 +01:00
Jean-Francois Dockes
e0bc65bfdd small mods inocuous or auxiliary to case/diac sensitivity but which can live in main branch 2012-09-13 12:25:01 +02:00
Jean-Francois Dockes
ee9dbda9fc comments doc and formatting 2012-08-24 10:26:16 +02:00
Jean-Francois Dockes
8b34610dde Cleaned up file name handling. Fixes that file names were sometimes indexed split, sometimes not. They now always are both, with different prefixes. Forces reindex 2012-04-13 09:18:08 +02:00
Jean-Francois Dockes
9f402d33cb got rid of unused csguess module 2012-04-06 15:14:01 +02:00
Jean-Francois Dockes
85166c93b2 Changed the way we handle document sizes. The fbytes field should now be in most cases the most "natural" document size. pcbytes holds the top external container size and dbytes the text size 2012-03-07 15:39:30 +01:00
Jean-Francois Dockes
9bc2fc8958 Experimented with multithreading the indexing pipeline. Left undef'd as 15%-30% improvement of indexing time does not seem worth the complexity 2012-02-21 17:09:02 +01:00
Jean-Francois Dockes
fccc0ac09c Get rid of the GUI indexing thread. All indexing now done by recollindex, with start/stop from the GUI 2012-02-07 17:07:07 +01:00
Jean-Francois Dockes
516863b5d6 GUI: perform up to date check before previewing a subdoc. This is for example to avoid showing the wrong message if a mail folder has been compacted 2012-01-20 17:48:55 +01:00
Jean-Francois Dockes
38e0957962 const string cleanup 2011-10-01 16:39:38 +02:00
Jean-Francois Dockes
e0aa67f0dc let dir go through indexfiles() (name will be indexed, non recursive) 2011-09-30 08:44:50 +02:00
"Jean-Francois Dockes ext:(%22)
88685d2e64 search/index: fixed a number of bad conversions to properly deal with text documents bigger than 2GB 2011-07-12 08:28:09 -07:00
Jean-Francois Dockes
b6c73ecdeb debug: improve consistency of log messages about up to date/processed files 2011-06-04 10:18:46 +02:00
Jean-Francois Dockes
478fb84ec4 index: add recollindex -f option to ignore skippedPaths/Names when using recollindex -i. Allows arbitrary (non-recoll) path handling for skipped subtree 2011-05-15 22:19:55 +02:00
Jean-Francois Dockes
55f124725f Fix problems that occurred when multiple threads were trying to read/convert files at the same time (ie: indexing and previewing threads in the GUI calling internfile()). Either get rid of or lock-protect all shared data, eliminate misc initialization possible conflicts by using static initializers. Hopefuly closes issue #51 2011-04-28 10:58:33 +02:00
Jean-Francois Dockes
b28eaf23fb Got rid of all the old RCS id strings 2011-04-27 08:22:17 +02:00
Jean-Francois Dockes
f4c1c3678d indexing: an error on an archive member could crash or block the indexing because of the unclean way the ipath was passed in/out of internfile(). Closes issue #55 2011-04-25 16:41:43 +02:00
Jean-Francois Dockes
52fa332239 1.15.2 2011-02-14 20:30:34 +01:00
Jean-Francois Dockes
6c03417195 Move locafields parsing code from fsindexer to rclconfig for possible reuse 2010-11-22 15:56:14 +01:00
Jean-Francois Dockes
061ffda545 checked/changed all sprintf calls 2010-11-15 11:57:39 +01:00
"Jean-Francois Dockes ext:(%22)
6aeef686fc Created class to simplify temp directory management 2010-07-24 12:08:06 +02:00
"Jean-Francois Dockes ext:(%22)
846bec8a73 fix english indexation -> indexing 2010-07-20 09:48:20 +02:00
Jean-Francois Dockes
1b8ce2fc72 restore indexallfilenames==0 function 2010-07-05 15:26:33 +02:00
Jean-Francois Dockes
d0e56e361f slightly improve status reporting during indexing 2010-05-04 17:49:40 +02:00
dockes
1ab7ea0558 use : as separator in localfields value before parsing as confsimple 2009-12-14 09:46:55 +00:00
dockes
ef46cd8087 make very sure ~/.beagle is in the skippedPaths 2009-11-16 16:11:14 +00:00