Logo
Explore Help
Register Sign In
tris/recoll
1
0
Fork 0
You've already forked recoll
Code Issues Pull Requests Projects Releases Wiki Activity
5,942 Commits 4 Branches 0 Tags
Commit Graph

11 Commits

Author SHA1 Message Date
Jean-Francois Dockes
06cd2bfd87 unac: exclude Tamil, move to Unicode 14.0.0, modernize autoxx, fix C build 2022-09-24 09:14:51 +02:00
Jean-Francois Dockes
4fdfe04ce5 independantly->independently 2019-12-02 10:46:46 +01:00
Jean-Francois Dockes
b6eb3589ba do not unaccent Bengali characters (process like the Hindi ones) 2014-07-16 12:47:30 +02:00
medoc
698affcfc8 Dont strip diacritics from Hindi Devanagari characters, they are determinant to word meaning 2013-10-26 18:56:25 +02:00
Jean-Francois Dockes
913dffc597 added code for unac to perform pure case-folding 2012-08-27 12:40:57 +02:00
Jean-Francois Dockes
0d24b5620b Make unac suppress combining accents found in input. Input in decomposed form was previously not unaccented 2011-11-04 21:06:48 +01:00
dockes
0fc81d26b6 new unac approach for japanese: dont decompose at all 2009-01-06 18:40:41 +00:00
dockes
36919ab728 no going out of the basic plane! 2008-12-18 11:58:13 +00:00
dockes
869d75ee03 use unicode 5.1.0 + dont unaccent katakana/hiragana. Main change in unicode is that letters ae and o with stroke dont decompose anymore into a+e and o+e we may actually want to restore this if it proves a problem 2008-12-18 11:04:47 +00:00
dockes
00b954c4ef implemented additional case-folding 2006-01-06 13:10:08 +00:00
dockes
b396d2c39f initial import 2006-01-06 13:08:12 +00:00
Powered by Gitea Version: 1.23.5 Page: 333ms Template: 56ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API