Jean-Francois Dockes
21adaca229
Add parameter to truncate all document text to specified length
2018-04-08 10:54:09 +02:00
Jean-Francois Dockes
3d4fd3c62e
When storing doc text, always use a metadata entry. Get rid of the code to
...
store it in the data record. Make storing the default. Add "fetchtext"
parameter to getDoc() to fetch and store the text in doc.text. Make this
accessible from Python. Misc comments and indents.
2018-01-25 13:20:02 +01:00
Jean-Francois Dockes
3d22f2c436
Merge branch 'RECOLL_1_23_MAINT'
2018-01-08 14:30:30 +01:00
Jean-Francois Dockes
e39b27e920
Dir filtering: would miss results if the filtered-for directory was partly upper-case and had a lower-case quasi-homonym
2018-01-08 14:19:48 +01:00
Jean-Francois Dockes
2c76a70c0e
Abstracts: storing raw doc text in user metadata records
2018-01-06 11:38:24 +01:00
Jean-Francois Dockes
b4493ed9e1
Snippets generation: add method for generating from doc stored text. Still needs refining, esp. for phrase/near
2017-12-30 08:43:14 +01:00
Jean-Francois Dockes
503edd5a92
Add function to list failed files to the term explorer
2017-09-03 10:46:11 +02:00
Jean-Francois Dockes
f853f39ef3
Partially revert change treating Katakana as words, going back to n-grams. Did not work well because of separator-less compounds mostly
2017-04-25 10:20:38 +02:00
Jean-Francois Dockes
9661a4431e
wen
2017-04-18 14:39:12 +02:00
Jean-Francois Dockes
bde991c08a
got rid of off_t
2017-02-28 20:36:01 +01:00
Jean-Francois Dockes
0b0385e459
got rid of the STD_SHARED_XX std/tr1 defines
2016-07-13 15:12:25 +02:00
Jean-Francois Dockes
53f57e75c6
rcldb syngroups: allow setting synonyms source after construction
2015-11-21 12:48:09 +01:00
Jean-Francois Dockes
e7a669b668
Support multi-word synonyms and add modifier to turn-off synonyms expansion
2015-08-23 12:15:52 +02:00
Jean-Francois Dockes
d8e8ce6ee2
Implement single-term query-time synonyms
2015-08-22 15:11:07 +02:00
Jean-Francois Dockes
4d1f679eac
Use std[::tr1]::shared_ptr instead of local RefCntr by default
2015-08-09 13:54:24 +02:00
Jean-Francois Dockes
c8d34dc8ba
Prevent highligting of bogus terms in results (prevent path elts, negative queries or internal stuff)
2015-08-08 21:56:45 +02:00
Jean-Francois Dockes
e6c0ca403d
recollindex: do not retry files which previously failed to be indexed, except if they were changed since, or option -k is set
2015-04-23 10:37:37 +02:00
Jean-Francois Dockes
19da7020d6
Add new kind of aliases for field names, to be used only at query time
2014-07-23 16:52:08 +02:00
Jean-Francois Dockes
60d3ba11ac
add parameter to set max stored length of metadata fields. Previously fixed at 150. Fixes issue #178
2014-03-26 18:43:49 +01:00
Jean-Francois Dockes
4e8aa756cb
comment
2014-03-26 18:02:59 +01:00
Jean-Francois Dockes
56a56500c1
Handle partial indexing of document restricted to metadata from extended attributes
2013-10-04 10:57:11 +02:00
Jean-Francois Dockes
334e16bee5
comments
2013-09-29 18:29:53 +02:00
Jean-Francois Dockes
a1b7018cfd
Fix problems which occurred when using functions like open-parents with multiple indexes containing identical paths (udis)
2013-05-25 11:26:57 +02:00
Jean-Francois Dockes
1805c367ea
check the stripchars options when adding an external index
2013-04-29 15:34:17 +02:00
Jean-Francois Dockes
2b80c77c23
Add possibility to display a list of sub-documents for a given result
2013-04-24 16:33:53 +02:00
Jean-Francois Dockes
906e58feff
added code to purge obsolete messages when a compound document (esp. mbox) is shortened and a partial update is performed (no general purge). Else the orphan docs remained in the index potentially forever (needed actual reindex of the file by a full pass to go away)
2013-04-22 11:32:49 +02:00
Jean-Francois Dockes
f05cae7344
arrange so that a default first indexing run for a given user runs a quick shallow pass, so that queries return some results quickly after indexing starts, avoiding user frustration
2013-04-18 14:29:32 +02:00
Jean-Francois Dockes
de82f13ddc
add functions and interface to display the duplicates of a result document
2013-04-17 09:36:46 +02:00
Jean-Francois Dockes
aaa98a1156
make sure that expansions of * done to retrieve all years or all mime types from index are done against the appropriate field section of the main index, not the whole synexpand one
2013-04-13 16:34:09 +02:00
Jean-Francois Dockes
838ea6d623
got rid of STRIPCHARS compile-time variable to make the code more readable, and corresponding configure option. + make sure that CONFDIR from environment gets translated to absolute path
2013-03-27 17:38:11 +01:00
Jean-Francois Dockes
fe2f32fc38
split the term expansion code out of rcldb.cpp
2013-01-15 10:06:24 +01:00
Jean-Francois Dockes
d9e6030b66
reorganized the term expansion code so that the term explorer works fully with case and diac sensitivity options
2013-01-14 18:06:48 +01:00
Jean-Francois Dockes
29d4d959a5
Add db stats option to term explorer
2013-01-10 15:12:43 +01:00
Jean-Francois Dockes
edb1bb5eb1
protect the xrdb from multiple accesses from needupdate tests and write thread
2013-01-01 16:20:06 +01:00
Jean-Francois Dockes
4afde68f9e
more config isolation. Still crashing
2012-11-30 16:45:02 +01:00
Jean-Francois Dockes
9146929f5e
implemented configuration for thread structure. Still crashes, but not often...
2012-11-29 17:26:48 +01:00
Jean-Francois Dockes
f252050247
ensure that indexing threads operations are transparent to high level callers such as the real time monitor
2012-11-26 15:40:31 +01:00
Jean-Francois Dockes
cb167cab0a
none
2012-11-18 11:55:54 +01:00
Jean-Francois Dockes
cd53c0a536
Multithreaded indexing seems not to crash anymore thanks to locked existence map
2012-11-02 21:43:51 +01:00
Jean-Francois Dockes
3d59c6933a
comments
2012-11-02 11:37:23 +01:00
Jean-Francois Dockes
ee7d0f2ee7
1st parallel multithreaded version of indexing which can do my home without crashing... Let's checkpoint
2012-11-01 11:19:48 +01:00
Jean-Francois Dockes
bfeb681574
mimetype T prefix was mishandled for a raw index
2012-10-13 11:08:53 +02:00
Jean-Francois Dockes
c9f6612c10
implemented proper limitation and error reporting in case of truncation for term and query expansions
2012-10-05 12:36:19 +02:00
Jean-Francois Dockes
bfd111ecaa
removed list size truncature on filename expansion
2012-10-05 09:19:42 +02:00
Jean-Francois Dockes
c589419267
Abstracts: improve the way we group terms for quality computation
2012-10-03 11:17:16 +02:00
Jean-Francois Dockes
3f331ebb3e
fix glitch caused by udi prefix change
2012-10-03 08:05:39 +02:00
Jean-Francois Dockes
4a17bac9e3
improve message printed for aspell dictionary creation error
2012-10-02 16:26:18 +02:00
"Jean-Francois Dockes ext:(%22)
af2d031e50
moved snippets generation code from db to query object
2012-09-26 12:13:40 +02:00
"Jean-Francois Dockes ext:(%22)
52bc9f4aa3
merged the case/diac sensitivity code back into trunk
2012-09-25 19:20:24 +02:00
"Jean-Francois Dockes ext:(%22)
ab32062fcc
Separate count and context for snippets in the snippets popup from the default values for the result list
2012-09-23 18:19:43 +02:00