Do not process hangul as words, but as ngrams. Same issues as with Katakana: word separation too hard

This commit is contained in:
Jean-Francois Dockes 2019-07-05 17:57:00 +02:00
parent 4ad8a08030
commit 00eb803f5d

View File

@ -44,8 +44,8 @@
// ngrams
#undef KATAKANA_AS_WORDS
// Same for Korean syllabic
#define HANGUL_AS_WORDS
// Same for Korean syllabic, and same problem, not used.
#undef HANGUL_AS_WORDS
using namespace std;