Jean-Francois Dockes
|
eb58602b5d
|
indent
|
2020-09-28 13:59:53 +02:00 |
|
Jean-Francois Dockes
|
641abf358b
|
autoconfig version
|
2020-09-14 16:14:43 +01:00 |
|
Jean-Francois Dockes
|
16a9d8eba8
|
fix span trimming loop when underscoreasletter is set
|
2020-09-13 17:53:59 +02:00 |
|
Jean-Francois Dockes
|
df09d65a4e
|
add underscoreasletter config variable to process _ as a letter
|
2020-09-13 15:40:28 +02:00 |
|
Jean-Francois Dockes
|
c1ef2187d3
|
Fixed LOG calls obsolescence issues preventing build with staticverbosity 7
|
2020-09-06 14:59:00 +01:00 |
|
Jean-Francois Dockes
|
58a568fb14
|
bumped version
|
2020-09-05 09:32:14 +02:00 |
|
Jean-Francois Dockes
|
5ea2f7cc64
|
recollindex: make sure that the computed lock file path is the same in all cases. Take the lock in a number of forgotten cases
|
2020-08-31 11:28:44 +02:00 |
|
Jean-Francois Dockes
|
9b0ec434b4
|
Small win warning fix
|
2020-08-22 09:28:46 +01:00 |
|
Jean-Francois Dockes
|
3f1dfa564c
|
Restore nonumbers number indexing exclusion function
|
2020-08-22 10:07:58 +02:00 |
|
Jean-Francois Dockes
|
1b9dafbbe1
|
bumped version
|
2020-08-15 13:58:22 +02:00 |
|
Jean-Francois Dockes
|
09ad94f3b7
|
removed obsolete test mains Makefiles
|
2020-08-06 11:46:11 +02:00 |
|
Jean-Francois Dockes
|
610e3282c3
|
Fix previous fix about locating lockfile in XDG_RUNTIME_DIR: would always compute the same lock name
|
2020-07-21 10:47:10 +02:00 |
|
Jean-Francois Dockes
|
7771e0669e
|
Bump version to 1.27.3 for small cumulated fixes
|
2020-06-27 14:30:22 +02:00 |
|
Jean-Francois Dockes
|
89375f5e80
|
If XDG_RUNTIME_DIR is set, locate index.pid in it. Thanks to Madhu for providing this fix
|
2020-06-23 09:03:07 +02:00 |
|
Jean-Francois Dockes
|
2d96fea11e
|
Windows: Bad define for localtime_r resulted in wrong date terms generated and date field search failures
|
2020-06-09 14:33:35 +01:00 |
|
Jean-Francois Dockes
|
f3858a7e3a
|
limit max size of korean single-word span
|
2020-05-31 09:57:58 +02:00 |
|
Jean-Francois Dockes
|
560041cab9
|
cleared out errant tabs
|
2020-05-30 15:54:49 +02:00 |
|
Jean-Francois Dockes
|
8d84127059
|
none
|
2020-05-25 09:28:21 +02:00 |
|
Jean-Francois Dockes
|
8ac74ca8f5
|
log levels
|
2020-05-24 14:39:06 +02:00 |
|
Jean-Francois Dockes
|
a5bab94ae3
|
korean splitter: break on digits
|
2020-05-24 14:02:23 +02:00 |
|
Jean-Francois Dockes
|
fc981e3733
|
new variation on the korean splitter. Index both the space-less spans whole and the mecab split output
|
2020-05-22 16:48:05 +02:00 |
|
Jean-Francois Dockes
|
4c39034f5d
|
small tweaks to facilitate the mac homebrew build
|
2020-05-21 09:41:58 +02:00 |
|
Jean-Francois Dockes
|
e61ec4b7af
|
autoconf malloc.h, and clear old c++ conf tests
|
2020-05-20 18:50:43 +02:00 |
|
Jean-Francois Dockes
|
ea2db676ed
|
korean: reactivate option to generate both noun,jx and noun+jx
|
2020-05-19 09:23:03 +02:00 |
|
Jean-Francois Dockes
|
97f3212f80
|
korean splitter: disable the noun+jx emitting thing
|
2020-05-14 09:23:09 +02:00 |
|
Jean-Francois Dockes
|
2f45ceb1dc
|
protect conf_post against double inclusion
|
2020-05-11 07:23:44 +01:00 |
|
Jean-Francois Dockes
|
0379d4fd61
|
bumped version to 1.27.1
|
2020-05-11 07:45:50 +02:00 |
|
Jean-Francois Dockes
|
d58fec0b81
|
korean: for now dont filter tags, until it is better understood what should be done
|
2020-05-11 07:33:54 +02:00 |
|
Jean-Francois Dockes
|
48d4678770
|
experiment: Korean when Noun then JX emit both Noun and Noun+JX
|
2020-04-25 14:19:54 +02:00 |
|
Jean-Francois Dockes
|
2f794be314
|
Fix Windows gcc build. Needs some def to get w7+ windows api
|
2020-04-25 11:41:37 +02:00 |
|
Jean-Francois Dockes
|
07e3387fc1
|
Avoid calling isalpha() with big ints, may crash, depending on version
|
2020-04-25 11:19:52 +02:00 |
|
Jean-Francois Dockes
|
39c152bada
|
Fixed MSVC warnings, all inocuous
|
2020-04-17 14:26:40 +01:00 |
|
Jean-Francois Dockes
|
12ebb7ac6e
|
Windows: deal with non-ASCII user login, non-ascii paths in confdir etc.
|
2020-04-15 14:03:04 +01:00 |
|
Jean-Francois Dockes
|
9565663f09
|
textsplit: create isNGRAMMED() method to replace isCJK() and let the latter actually return what it says
|
2020-04-14 09:27:26 +02:00 |
|
Jean-Francois Dockes
|
eb53b598d6
|
Textsplit: lost char at korean->ascii transition
|
2020-04-10 14:54:13 +01:00 |
|
Jean-Francois Dockes
|
ec7379f837
|
textsplitko: start cmd as python kosplitter.py
|
2020-04-10 14:34:50 +01:00 |
|
Jean-Francois Dockes
|
de246349da
|
textsplit: use more regular test for ISHANGUL. CJK: do not ignore whitespace, break on alphabetic non cjk character
|
2020-04-10 14:28:14 +02:00 |
|
Jean-Francois Dockes
|
6999284c42
|
indent and decls
|
2020-04-05 13:46:47 +01:00 |
|
Jean-Francois Dockes
|
a468406e17
|
windows/qtcreator msvc adjustements
|
2020-04-04 14:00:39 +01:00 |
|
Jean-Francois Dockes
|
7656d1b2ef
|
Merge branch 'master' of https://framagit.org/medoc90/recoll
|
2020-04-03 07:34:41 +01:00 |
|
Jean-Francois Dockes
|
b0fb7612ee
|
some msvc changes
|
2020-04-03 07:33:27 +01:00 |
|
Jean-Francois Dockes
|
afcacf63c0
|
Fix page handling in Korean spitter, bug would shift the byte positions, with bad consequences for snippets
|
2020-03-31 16:11:37 +02:00 |
|
Jean-Francois Dockes
|
7de66aae60
|
Korean splitter: suppress some ctl chars from Komoran input. Better compute pages
|
2020-03-26 18:44:59 +01:00 |
|
Jean-Francois Dockes
|
9b3a5fac12
|
Merge branch 'kopostag'
|
2020-03-26 14:03:17 +01:00 |
|
Jean-Francois Dockes
|
f755505e98
|
bumpedversion
|
2020-03-26 11:02:37 +01:00 |
|
Jean-Francois Dockes
|
1afc606718
|
textsplit: break on it.error() not only it.eof(). Seems to make a difference in rare cases? Add Komoran support but this one often fails
|
2020-03-26 09:31:19 +01:00 |
|
Jean-Francois Dockes
|
b677171fa8
|
GUI: Experimental: create a list of MIME types (compiled in for now: hwp) for which we prefer to use stored text for preview because extraction is slow
|
2020-03-25 18:13:00 +01:00 |
|
Jean-Francois Dockes
|
97e89c408a
|
korean splitter: only break korean stretch on non-korean alphabetic (e.g. not numbers or punctuation)
|
2020-03-25 16:57:42 +01:00 |
|
Jean-Francois Dockes
|
207bfec93e
|
korean splitter: restart the python/java splitter from time to time because it leaks memory
|
2020-03-24 11:27:10 +01:00 |
|
Jean-Francois Dockes
|
a323472876
|
typo in textsplitko would prevent use of Mecab
|
2020-03-24 08:50:24 +01:00 |
|