353 Commits

Author SHA1 Message Date
Jean-Francois Dockes
e7a86aee47 Repaired problem in commit 577fd9d3a634 which broke case-sensitive indexing term prefix wrapping 2014-07-24 15:29:19 +02:00
Jean-Francois Dockes
19da7020d6 Add new kind of aliases for field names, to be used only at query time 2014-07-23 16:52:08 +02:00
Jean-Francois Dockes
578511c3e2 Add flag qualifying field to be indexed exclusively with prefix 2014-07-23 15:28:16 +02:00
Jean-Francois Dockes
088f1f9a9f try out modification to use container file name for display, but not search, purposes when the subdocument file name is not set 2014-07-16 17:05:26 +02:00
Jean-Francois Dockes
50ee2838dc Use a single Xapian Database object for update and query operations while indexing instead of separately open read/write ones. Separate objects seem to trigger Xapian issues 2014-06-07 11:44:04 +02:00
Jean-Francois Dockes
6574ff514a doc 2014-05-15 15:21:08 +02:00
Jean-Francois Dockes
e867f855ad get rid of numerous probably inocuous valgrind/helgrind messages by ensuring that actual string copies are passed between threads, without refcount/shared data magic 2014-05-05 19:01:58 +02:00
Jean-Francois Dockes
deb4cc8d12 mt indexing: do not pass copies of Xapian::Document between threads: the reference counting is not mt-safe. Replace with pointers 2014-05-05 10:59:50 +02:00
Jean-Francois Dockes
e1b746bcfd restructured needUpdate() 2014-05-06 08:26:38 +02:00
Jean-Francois Dockes
60d3ba11ac add parameter to set max stored length of metadata fields. Previously fixed at 150. Fixes issue #178 2014-03-26 18:43:49 +01:00
Jean-Francois Dockes
8cde199b84 backslashes in abstract could fudge the format of the db data record... 2013-11-07 15:00:43 +01:00
Jean-Francois Dockes
56a56500c1 Handle partial indexing of document restricted to metadata from extended attributes 2013-10-04 10:57:11 +02:00
medoc
841b8cff03 flush the update queue before set_metadata 2013-08-28 11:10:40 +02:00
Jean-Francois Dockes
a1b7018cfd Fix problems which occurred when using functions like open-parents with multiple indexes containing identical paths (udis) 2013-05-25 11:26:57 +02:00
Jean-Francois Dockes
e6402efbfb always set the "haspages" flags when reading doc from index so that the Snippets link can be set even if no abstract is computed 2013-05-22 13:24:31 +02:00
Jean-Francois Dockes
dd6be25f13 fixed bug in path translation feature + confusing messages about reindexing missing files 2013-05-14 08:12:33 +02:00
Jean-Francois Dockes
1805c367ea check the stripchars options when adding an external index 2013-04-29 15:34:17 +02:00
Jean-Francois Dockes
b4c7efe490 Added (unifdefd) code to detect garbage data like undecoded base64 by looking at word length stats 2013-04-27 08:29:55 +02:00
Jean-Francois Dockes
2b80c77c23 Add possibility to display a list of sub-documents for a given result 2013-04-24 16:33:53 +02:00
Jean-Francois Dockes
9aad81c387 code moved around for clarity, no real changes 2013-04-23 15:59:06 +02:00
Jean-Francois Dockes
906e58feff added code to purge obsolete messages when a compound document (esp. mbox) is shortened and a partial update is performed (no general purge). Else the orphan docs remained in the index potentially forever (needed actual reindex of the file by a full pass to go away) 2013-04-22 11:32:49 +02:00
Jean-Francois Dockes
b88c28b856 be consistent in adding sig in value and doc record 2013-04-21 17:45:00 +02:00
Jean-Francois Dockes
f05cae7344 arrange so that a default first indexing run for a given user runs a quick shallow pass, so that queries return some results quickly after indexing starts, avoiding user frustration 2013-04-18 14:29:32 +02:00
Jean-Francois Dockes
48b3aa26c6 dont de-dup empty docs 2013-04-17 09:57:57 +02:00
Jean-Francois Dockes
cf2c0b74db use add_boolean_term instead of add_term where it makes sense 2013-04-16 16:01:06 +02:00
Jean-Francois Dockes
8a1ea35fea add terms for doc md5s 2013-04-16 15:56:49 +02:00
"Jean-Francois Dockes ext:(%22)
321978bfea replace atol with atoll to fix result size display bug. Also use atoll for time conversions, 2032 is coming... 2013-04-04 20:01:31 +02:00
Jean-Francois Dockes
426481e3af allow wild chars in dir clauses 2013-04-01 16:01:21 +02:00
Jean-Francois Dockes
838ea6d623 got rid of STRIPCHARS compile-time variable to make the code more readable, and corresponding configure option. + make sure that CONFDIR from environment gets translated to absolute path 2013-03-27 17:38:11 +01:00
Jean-Francois Dockes
6a4a385657 added basic facility for result path translation 2013-03-26 10:42:46 +01:00
Jean-Francois Dockes
d3631b5ddf cleaned up processing of metadata from diverse origins (doc,extattrs,localfields) 2013-01-29 14:33:57 +01:00
Jean-Francois Dockes
fe2f32fc38 split the term expansion code out of rcldb.cpp 2013-01-15 10:06:24 +01:00
Jean-Francois Dockes
d9e6030b66 reorganized the term expansion code so that the term explorer works fully with case and diac sensitivity options 2013-01-14 18:06:48 +01:00
Jean-Francois Dockes
f8280c88ca small fixups and compilation issues 2013-01-14 09:57:04 +01:00
Jean-Francois Dockes
29d4d959a5 Add db stats option to term explorer 2013-01-10 15:12:43 +01:00
Jean-Francois Dockes
69aa57d182 wild/reg spe chars defs moved to cstr 2013-01-04 10:06:08 +01:00
Jean-Francois Dockes
edb1bb5eb1 protect the xrdb from multiple accesses from needupdate tests and write thread 2013-01-01 16:20:06 +01:00
Jean-Francois Dockes
1384cd3d05 m_havewriteq was not always initialized, resulting in stray flush call 2012-12-28 16:34:15 +01:00
Jean-Francois Dockes
4c950fd6f8 better stats in workqueue 2012-12-03 19:11:59 +01:00
Jean-Francois Dockes
5ebc277114 fix no threads compile 2012-12-02 14:47:06 +01:00
Jean-Francois Dockes
4afde68f9e more config isolation. Still crashing 2012-11-30 16:45:02 +01:00
Jean-Francois Dockes
3a33a68e2d Use PTMutexLocker in workqueue, and make it restartable 2012-11-30 07:35:03 +01:00
Jean-Francois Dockes
9146929f5e implemented configuration for thread structure. Still crashes, but not often... 2012-11-29 17:26:48 +01:00
Jean-Francois Dockes
f252050247 ensure that indexing threads operations are transparent to high level callers such as the real time monitor 2012-11-26 15:40:31 +01:00
Jean-Francois Dockes
9ba04fc9c7 timing and traces 2012-11-26 09:16:06 +01:00
Jean-Francois Dockes
cc79ee3f73 threads: added recoll_threadinit calls to block signals 2012-11-23 13:36:05 +01:00
Jean-Francois Dockes
657c65d438 Prevent error caused by trying to add a posting for an empty term (created by unac on really weird data) 2012-11-16 17:41:14 +01:00
Jean-Francois Dockes
cd53c0a536 Multithreaded indexing seems not to crash anymore thanks to locked existence map 2012-11-02 21:43:51 +01:00
Jean-Francois Dockes
ee7d0f2ee7 1st parallel multithreaded version of indexing which can do my home without crashing... Let's checkpoint 2012-11-01 11:19:48 +01:00
Jean-Francois Dockes
9ccb862666 fix prefix stripping in maxYearSpan 2012-10-16 13:36:54 +02:00