12 Commits

Author SHA1 Message Date
Jean-Francois Dockes
c1fad4afc7 Replaced pthread with std:: thread and mutex 2016-07-12 18:08:21 +02:00
Jean-Francois Dockes
1d9047df1a fix linux build of windows branch
--HG--
branch : WINDOWSPORT
2015-08-30 15:50:17 +02:00
Jean-Francois Dockes
b6eb3589ba do not unaccent Bengali characters (process like the Hindi ones) 2014-07-16 12:47:30 +02:00
medoc
55782670c4 Dont strip diacritics from Hindi Devanagari characters, they are determinant to word meaning 2013-10-26 18:56:51 +02:00
Jean-Francois Dockes
913dffc597 added code for unac to perform pure case-folding 2012-08-27 12:40:57 +02:00
Jean-Francois Dockes
a4c17941b1 Added a configuration parameter to set specific unaccenting/lowercasing for some characters to be handled differently than would result from using the Unicode database. Exemple: "a with ring above" could be set to be preserved by a Swedish locutor 2012-04-09 12:42:23 +02:00
Jean-Francois Dockes
0d24b5620b Make unac suppress combining accents found in input. Input in decomposed form was previously not unaccented 2011-11-04 21:06:48 +01:00
Jean-Francois Dockes
424e4173ba threading cleanup: add mutex protection around moronic change to transcode. Add mutex to equiv issue in unac. Rename const strings everywhere to cstr_xx to ease future detection of potentially problematic static variables. Most probably close issue #65 2011-09-28 15:01:14 +02:00
dockes
8c627af212 new unac approach for japanese: dont decompose at all 2008-12-21 13:17:44 +00:00
dockes
0821f0cc29 dont unaccent japanese + fix bug in unac/split ordering in searchdata 2008-12-19 09:44:39 +00:00
dockes
33f54536ed integrated case-folding into unac for better performance 2006-01-06 13:19:38 +00:00
dockes
ab473faa8c unac 1.7.0 2004-12-17 15:04:34 +00:00