214 Commits

Author SHA1 Message Date
Jean-Francois Dockes
5fd31172f5 New text to terms processing pipelines: results identical to 1.16 when used with empty stopfile 2011-10-07 07:53:49 +02:00
Jean-Francois Dockes
eda494153e simplify calls to isStop 2011-10-05 17:25:35 +02:00
Jean-Francois Dockes
acb297c9df comments + move the position jump to text_to_words 2011-10-04 16:33:44 +02:00
Jean-Francois Dockes
4ced9bee49 add termDocCnt method 2011-10-04 08:04:17 +02:00
Jean-Francois Dockes
38e0957962 const string cleanup 2011-10-01 16:39:38 +02:00
Jean-Francois Dockes
383468e2fc bump doc create/update messages updates to loginfo so that indexing progress can be monitored with less noise 2011-09-30 08:47:39 +02:00
Jean-Francois Dockes
424e4173ba threading cleanup: add mutex protection around moronic change to transcode. Add mutex to equiv issue in unac. Rename const strings everywhere to cstr_xx to ease future detection of potentially problematic static variables. Most probably close issue #65 2011-09-28 15:01:14 +02:00
Jean-Francois Dockes
e0d211d602 none 2011-09-20 17:16:41 +02:00
Jean-Francois Dockes
ee0d602ab3 Implement anchored searches: terms to be found at a maximum distance of the start or end of the text 2011-09-20 16:42:56 +02:00
Jean-Francois Dockes
c5ff0cdf52 Control memory usage when deleting documents: use idxflushmb as when adding/updating 2011-09-07 19:11:11 +02:00
Jean-Francois Dockes
a380873029 suppress some sources of spurious ellipsises in abstracts 2011-08-24 14:51:59 +02:00
Jean-Francois Dockes
d3fc258d85 avoid generating empty abstract field 2011-08-19 09:20:11 +02:00
"Jean-Francois Dockes ext:(%22)
ebbcc115a8 Allow setting a weight increase for field terms 2011-07-22 16:43:39 +02:00
"Jean-Francois Dockes ext:(%22)
48e86c99b5 GUI restable: fix sorting by file and doc size 2011-07-20 10:44:04 +02:00
Jean-Francois Dockes
469c544915 GUI: allow setting the snippet separator inside abstract (now a real html ellipsis by default) 2011-07-07 11:11:02 +02:00
Jean-Francois Dockes
b6c73ecdeb debug: improve consistency of log messages about up to date/processed files 2011-06-04 10:18:46 +02:00
Jean-Francois Dockes
08a65f5cfc experiment with xapian spell support (not ready yet) + take care of some static init issues showing up on the mac 2011-05-10 10:15:15 +02:00
Jean-Francois Dockes
84d59f18a0 GUI: when opening the index, discriminate errors on the main index from errors on external ones, to avoid starting the initial indexing dialog in the latter case 2011-04-29 16:16:04 +02:00
Jean-Francois Dockes
a4d1689581 try to be more responsive to user interrupts: do not build the aux databases after an interruption, and check for an interruption during the purge pass 2011-04-28 12:27:06 +02:00
Jean-Francois Dockes
55f124725f Fix problems that occurred when multiple threads were trying to read/convert files at the same time (ie: indexing and previewing threads in the GUI calling internfile()). Either get rid of or lock-protect all shared data, eliminate misc initialization possible conflicts by using static initializers. Hopefuly closes issue #51 2011-04-28 10:58:33 +02:00
Jean-Francois Dockes
01f24fa5fd cleaning up static variables 2011-04-27 09:09:01 +02:00
Jean-Francois Dockes
b28eaf23fb Got rid of all the old RCS id strings 2011-04-27 08:22:17 +02:00
Jean-Francois Dockes
963d7c50fd suppressed some overly repeated log messages 2011-03-11 11:49:54 +01:00
Jean-Francois Dockes
26929e9fb9 index: fixed the fix for path elts too long... 2011-02-14 20:30:26 +01:00
Jean-Francois Dockes
bf39719ac3 Indexing: need to truncate pathologically long path elements (would cause add_document error) 2011-02-13 10:07:25 +01:00
Jean-Francois Dockes
93fb51d59b query: add duplication indicator to relevancy rating 2011-01-17 16:04:07 +01:00
Jean-Francois Dockes
85b36d3c34 filename search fields: generate an AND of OR lists out of wildcard expansion instead of a global OR which did not make much sense 2011-01-13 11:47:35 +01:00
Jean-Francois Dockes
0a6063542f Gui: misc event/signals cleanups. No functional changes 2010-12-22 18:07:18 +01:00
Jean-Francois Dockes
45c08165f5 log message format 2010-12-21 10:34:02 +01:00
Jean-Francois Dockes
c79410da94 Move sort/filtering code out of reslist 2010-12-18 15:45:12 +01:00
Jean-Francois Dockes
61348a7731 GUI: got rid of the sort parameters dialog and sort by mime type, replaced by 2 arrows in toolbar for sorting by date, ascending or descending 2010-12-17 13:18:13 +01:00
Jean-Francois Dockes
21c6025ba7 Use a xapian phrase search on the split path for filtering on directory location (much faster than the current method) 2010-12-16 15:53:40 +01:00
Jean-Francois Dockes
7123ba530c When ipath is not a number, interpret it as a file path and index the file name (this is currently for zip and chm files) 2010-11-23 15:18:43 +01:00
Jean-Francois Dockes
061ffda545 checked/changed all sprintf calls 2010-11-15 11:57:39 +01:00
"Jean-Francois Dockes ext:(%22)
7ef7e16ac2 missing include time.h, thks to the anon. reporter on bitbucket. Closes issue #31 2010-11-01 19:37:16 +01:00
Jean-Francois Dockes
ceb996c8fb Implement date: date range filter/searches. Remove restriction on pure negative queries 2010-09-11 12:07:53 +02:00
Jean-Francois Dockes
7f71eaa8e5 fixed transcoded simple file name could not be a stored field, so undisplayable in results 2010-09-09 08:24:31 +02:00
Jean-Francois Dockes
387b66e165 Errors when opening additional indexes for a query are now fatal. They could easily go unnoticed before 2010-07-04 19:28:15 +02:00
Jean-Francois Dockes
b699bbe78f Write data version number at once into empty index. closes #8 2010-05-11 15:02:53 +02:00
Jean-Francois Dockes
4006825961 display more complete stats in spell window 2010-05-08 10:38:13 +02:00
Jean-Francois Dockes
4b75f88955 fix includes for xapian 1.2 compatibility (stays compatible with 1.0) 2010-05-04 15:37:41 +02:00
Jean-Francois Dockes
8b2b00bc72 cosmetics: use derived class for actual splitter instead of callback 2010-02-02 15:33:52 +01:00
dockes
bab030f846 Term expansion: handle field issues inside rcldb::termmatch, ensuring that we take the field name into account for all expansions. Ensures that File Name searches and filename: query language searches work the same, + overall better consistency 2009-12-07 13:27:57 +00:00
dockes
c05deda615 loglevel 2009-11-27 13:11:07 +00:00
dockes
f554960b9b suggest alternate spellings if no results 2009-11-26 14:03:02 +00:00
dockes
217b7018d6 use only match terms to build doc abstract, not all query terms (might save a little effort) 2009-11-26 07:17:40 +00:00
dockes
35f32a18a8 set udi in meta from getDoc(udi) 2009-11-23 16:11:37 +00:00
dockes
dd6acb07cc mh_mail: use truncate_to_word to avoid cutting an utf8 char. rcldb: logdeb text_to_word errors 2009-11-18 10:26:47 +00:00
dockes
6ef7b546f2 beaglequeue indexFiles 2009-11-14 08:21:45 +00:00
dockes
645af1aa55 removed the useless keep_updated flag 2009-11-10 17:32:13 +00:00