18 Commits

Author SHA1 Message Date
freddii
89c7efe682 fixed typos 2021-02-04 17:12:22 +01:00
Jean-Francois Dockes
cd46ba62fc multiword synonyms: fix position wrong by 1, phrase prox to adjacent term failed 2021-01-15 15:42:09 +01:00
Jean-Francois Dockes
aa2f0bfd73 implemented multi-word terms indexing for phrase/prox search on multiword synonyms 2021-01-15 14:13:08 +01:00
Jean-Francois Dockes
5dd8774b3c whitespace and indents only 2020-04-14 09:25:13 +02:00
Jean-Francois Dockes
6999284c42 indent and decls 2020-04-05 13:46:47 +01:00
Jean-Francois Dockes
0cbc46732f Fixed the FSF address 2019-03-04 11:19:14 +01:00
Jean-Francois Dockes
fca5e19504 Remove 'prolounged sound mark' from the end of katakana words 2017-04-21 12:13:10 +02:00
Jean-Francois Dockes
8f77b987ca fixed a few forgotten log statements 2016-07-13 07:47:56 +02:00
Jean-Francois Dockes
50b9194885 add missing std:: qualifiers 2016-06-20 15:35:33 +02:00
Jean-Francois Dockes
04cd868950 Handle the case where unac produces whitespace, which may occur with letter-less accents 2015-08-13 18:22:09 +02:00
Jean-Francois Dockes
94b94593e3 comments and indent 2015-06-09 19:34:15 +02:00
Jean-Francois Dockes
657c65d438 Prevent error caused by trying to add a posting for an empty term (created by unac on really weird data) 2012-11-16 17:41:14 +01:00
Jean-Francois Dockes
913dffc597 added code for unac to perform pure case-folding 2012-08-27 12:40:57 +02:00
Jean-Francois Dockes
ee9dbda9fc comments doc and formatting 2012-08-24 10:26:16 +02:00
"Jean-Francois Dockes ext:(%22)
0ebfc496d8 add capability to remember page breaks generated by, e.g. pdftotext, and use them to start an external viewer on a match page 2012-08-21 15:03:02 +02:00
Jean-Francois Dockes
0860b559ee get rid of a few garbage terms during indexing. Set a threshold for conversion errors after which we discard the doc. Stabilize the new termproc pipeline but no commongrams for now 2011-10-12 17:55:58 +02:00
Jean-Francois Dockes
4a7ff398b2 comments 2011-10-07 08:05:36 +02:00
Jean-Francois Dockes
5fd31172f5 New text to terms processing pipelines: results identical to 1.16 when used with empty stopfile 2011-10-07 07:53:49 +02:00