34 Commits

Author SHA1 Message Date
Jean-Francois Dockes
eed31f9ef1 html index: throw an exception after parsing in all cases so that the same code path is always used. The previous approach sometimes resulted in a bad charset used for preview 2012-01-25 17:33:41 +01:00
Jean-Francois Dockes
49554e42c2 Factorized common text transcoding code in separate module 2011-10-20 17:53:42 +02:00
Jean-Francois Dockes
38e0957962 const string cleanup 2011-10-01 16:39:38 +02:00
Jean-Francois Dockes
b28eaf23fb Got rid of all the old RCS id strings 2011-04-27 08:22:17 +02:00
Jean-Francois Dockes
320a869d6e Indexing filters: somewhat clarified and unified some charset-related parameters 2011-02-01 15:04:49 +01:00
dockes
bf3ac8e053 small amd64 fixes: 64 bits size_type, signed chars 2009-12-13 16:13:59 +00:00
dockes
229645a0e2 added optional extended file attributes support 2009-01-21 13:55:12 +00:00
dockes
f57d4a91f9 compute md5 checksums for all docs and optionally collapse duplicates in results 2009-01-09 14:56:36 +00:00
dockes
016bd4226e save transcoded html for preview 2008-10-03 06:17:46 +00:00
dockes
a2659b48e4 renamed the html charset values to stick to omega usage 2007-06-19 12:17:07 +00:00
dockes
0c74bd6e36 added open-ended field name handling 2007-06-19 08:36:24 +00:00
dockes
c5ebe00247 improve transcode error printing 2007-05-30 12:31:19 +00:00
dockes
8e51cf42d3 let email attachments inherit date and author from parent message 2007-05-22 08:33:03 +00:00
dockes
1d683ad411 added field/prefixes for author and title + command line query language 2007-01-17 13:53:41 +00:00
dockes
229eb0de78 test data indexing result same terms as 1.6.3 2006-12-15 16:33:15 +00:00
dockes
33c95ef1ba Dijon filters 1st step: mostly working needs check and optim 2006-12-15 12:40:24 +00:00
dockes
be485e8059 allow indexing individual files. Fix pb with preview and charsets (local defcharset ignored) 2005-12-14 11:00:48 +00:00
dockes
0122545ece process text from html files without a </body> tag 2005-12-08 08:44:14 +00:00
dockes
c8e18ccc81 previous html fix didnt work 2005-12-06 09:40:18 +00:00
dockes
d2b54d6af2 fix nasty html parse bug introduced in 1.0.9 2005-12-06 08:35:48 +00:00
dockes
ae8ff5abb3 *** empty log message *** 2005-11-24 07:16:16 +00:00
dockes
44fb0eb359 improve charset name comparison 2005-11-23 10:16:28 +00:00
dockes
ad67a6cbb7 mimemap processing recentered in rclconfig. Handle directory-local suffix to mime-type definitions. Implement gaim log handling 2005-11-21 14:31:24 +00:00
dockes
6cba3b65c1 restructuring on mimehandler files 2005-11-18 13:23:46 +00:00
dockes
baa0ff491b renamed MimeHandler::worker to mkDoc + comments for doxygen 2005-11-08 21:02:55 +00:00
dockes
50b927f65c *** empty log message *** 2005-04-04 13:18:47 +00:00
dockes
04b279dcd5 mail handling 1st working version 2005-03-31 10:04:07 +00:00
dockes
d392d317bb mail ckpt 2005-03-25 09:40:28 +00:00
dockes
1f8fbc0d39 *** empty log message *** 2005-02-04 09:39:44 +00:00
dockes
d0aaf92220 added external filters and pdf handling 2005-02-01 17:20:06 +00:00
dockes
6d35f5430c merged modifs from xapian/omega 0.8.5 2005-01-28 09:37:37 +00:00
dockes
370032740c xapian 0.8.3 2005-01-28 08:46:27 +00:00
dockes
b9bb21f118 sort of indexes html 2005-01-26 13:03:02 +00:00
dockes
0b18276947 ckpt 2005-01-26 11:47:27 +00:00