Jean-Francois Dockes
|
73f2836317
|
korean splitter: add inactive option to split on white space before calling the tagger
|
2020-05-19 09:22:16 +02:00 |
|
Jean-Francois Dockes
|
b63cc1b712
|
Korean splitter script: use python-mecab-ko if possible, else konlpy
|
2020-04-10 14:27:06 +02:00 |
|
Jean-Francois Dockes
|
e8194dea9d
|
comment
|
2020-04-08 09:51:37 +02:00 |
|
Jean-Francois Dockes
|
1afc606718
|
textsplit: break on it.error() not only it.eof(). Seems to make a difference in rare cases? Add Komoran support but this one often fails
|
2020-03-26 09:31:19 +01:00 |
|
Jean-Francois Dockes
|
9719177c82
|
Korean external splitter: add some support for Mecab
|
2020-03-23 16:20:32 +01:00 |
|
Jean-Francois Dockes
|
c9667b5ba7
|
Korean text: sort-of-working version, in need of validation
|
2020-03-22 15:49:24 +01:00 |
|
Jean-Francois Dockes
|
384e3a1087
|
korean textsplit with extern help from konlpy, first step
|
2020-03-22 10:09:50 +01:00 |
|