Jean-Francois Dockes
|
16a9d8eba8
|
fix span trimming loop when underscoreasletter is set
|
2020-09-13 17:53:59 +02:00 |
|
Jean-Francois Dockes
|
60e9949663
|
texsplit test driver: add options for korean tagger
|
2020-05-06 15:27:27 +02:00 |
|
Jean-Francois Dockes
|
8d92b9debd
|
trtextsplit: add option for max term length
|
2019-09-13 13:01:35 +02:00 |
|
Jean-Francois Dockes
|
41c9ea92c7
|
add test driver for hldata:matchGroup + some help from textsplit
|
2019-07-21 19:13:24 +02:00 |
|
Jean-Francois Dockes
|
6b058e9758
|
Regularise processing of hangul characters (there was a mixup of cjk/regular processing), and add a build-time option to either use cjk/ngram or regular term splitting for them
|
2019-07-21 19:09:51 +02:00 |
|
Jean-Francois Dockes
|
2c337caf94
|
setup directory for small test and trials programs
|
2019-02-01 16:56:15 +01:00 |
|