192 Commits

Author SHA1 Message Date
Jean-Francois Dockes
292859a3ac Index: improve processing/rejection for binary files disguising as scripts (ie: shar archives). Use "internal text/plain" instead of "exec rcltext" for script files so that normal text/plain processing is done (max size, splits). Reject text if more than 25% iconv errors 2011-03-01 08:39:30 +01:00
Jean-Francois Dockes
93a761785a mh_execm: send/receive charset-related parms (no filter use them for now) 2011-02-01 19:16:32 +01:00
Jean-Francois Dockes
320a869d6e Indexing filters: somewhat clarified and unified some charset-related parameters 2011-02-01 15:04:49 +01:00
Jean-Francois Dockes
91e740074e mh_execm: removed incorrect subdocerrors handling leftover from previous change 2011-01-31 09:31:35 +01:00
Jean-Francois Dockes
9b26100e6a comment 2011-01-29 16:18:37 +01:00
Jean-Francois Dockes
d80f4478fc Support thunderbird naked "^From $" separators 2011-01-11 18:36:40 +01:00
Jean-Francois Dockes
fccc9a590f mimehandler: accept additional parameter from config after internal for using different mime type 2011-01-08 19:22:09 +01:00
Jean-Francois Dockes
6ebc4b4fad fix r2093 which broke indexallfilenames 2010-12-15 15:45:24 +01:00
Jean-Francois Dockes
52e845a9fb debug traces: add is_unknown() method to filters to help with pointing out unhandled mime types 2010-12-14 18:21:39 +01:00
Jean-Francois Dockes
084740cd2b simplified the mbox-reading code 2010-11-30 15:21:44 +01:00
Jean-Francois Dockes
629e62e2b8 mbox: test driver improved 2010-11-30 11:35:21 +01:00
Jean-Francois Dockes
2f837a89b3 fix thunderbird hack breakage for 1,14,4 2010-11-29 22:43:41 +01:00
Jean-Francois Dockes
34151006fe Index: add call to get rid of filter subprocesses at end of indexing (for the GUI thread) 2010-11-23 19:35:44 +01:00
Jean-Francois Dockes
f10e14658f add quirks flag to handle bad thunderbird mbox formats 2010-11-22 16:25:25 +01:00
Jean-Francois Dockes
f4aa7fcec4 properly differentiate between subdoc or fatal error in execm protocol 2010-11-22 15:50:14 +01:00
Jean-Francois Dockes
21e16bb8a2 comments 2010-11-22 15:49:09 +01:00
Jean-Francois Dockes
061ffda545 checked/changed all sprintf calls 2010-11-15 11:57:39 +01:00
Jean-Francois Dockes
5b8bb28c5d define renameormove() function and use it in internfile 2010-09-27 12:59:42 +02:00
Jean-Francois Dockes
0fa92899f9 gcc44 compile 2010-09-23 19:05:11 +02:00
Jean-Francois Dockes
e8671c83be mh_execm: get rid of child process in case of error, for a clean start on next request (handle rclzip crash on encrypted files) 2010-09-23 18:32:52 +02:00
Jean-Francois Dockes
ad4f24923f uncompress file before starting external viewer except if in the nouncompforviewmts list 2010-09-20 10:35:26 +02:00
Jean-Francois Dockes
e327ea0b4e make stringtotokens use vector 2010-09-13 14:17:22 +02:00
Jean-Francois Dockes
716b75dc45 Send file mime type as parameter to help rclaudio 2010-09-12 19:29:49 +02:00
Jean-Francois Dockes
53ec6fefc2 comment 2010-09-12 19:19:34 +02:00
Jean-Francois Dockes
8279d0f425 adjusted log messages 2010-09-10 19:02:04 +02:00
"Jean-Francois Dockes ext:(%22)
6aeef686fc Created class to simplify temp directory management 2010-07-24 12:08:06 +02:00
"Jean-Francois Dockes ext:(%22)
e5f41aeb05 Add large file support 2010-07-16 17:08:07 +02:00
Jean-Francois Dockes
c7c3f2e269 html files would still be indexed even when filtered out by indexedmimetypes because the handler cache was searched before using the config data 2010-07-09 09:21:07 +02:00
Jean-Francois Dockes
e6d5f72886 added the possibility to extract arbitrary mail headers and use them as document fields. This forced an incompatible change in the format of the [stored] section inside the "fields" config file 2010-07-06 17:16:36 +02:00
Jean-Francois Dockes
1b8ce2fc72 restore indexallfilenames==0 function 2010-07-05 15:26:33 +02:00
Jean-Francois Dockes
162cb7fe98 Ensure that GUI "Open" works with firefox/beagle cache data: use the backend-agnostic internfile interface in iDocToFile 2010-06-09 16:34:59 +02:00
Jean-Francois Dockes
b87a23bfca separated out the cache access part from beaglequeueindexer. this avoids having to link the pure query programs with indexing code 2010-02-05 12:46:41 +01:00
dockes
bf3ac8e053 small amd64 fixes: 64 bits size_type, signed chars 2009-12-13 16:13:59 +00:00
dockes
93d0aa3fee add <pre> tag to text/plain translated into qt html to preserve indentation. Removes need for rcltext (which did just this). Allow specifying any text/xxx as internal (allows having specific editor but no filter) 2009-11-30 10:04:27 +00:00
dockes
e7b2bc4b46 new glibc missing includes 2009-11-28 09:15:46 +00:00
dockes
a029de8be9 set defaults usedesktoprefs, maxtext 20mb pagesz 1000k webcache 40m 2009-11-28 08:14:05 +00:00
dockes
457e7d2e17 converted iostream to stdio because of mysterious read errors at the last position in the offsets file 2009-11-28 06:39:06 +00:00
dockes
c78a3bb567 add cnf(maildefcharset) to set specific mail default charset (mainly for readpst extracts which are utf-8 but have no charset set) 2009-11-27 13:23:13 +00:00
dockes
17bcce5b98 include sys/stat 2009-11-27 13:08:50 +00:00
dockes
2ad0326ed7 implemented a cache for mbox message header offsets 2009-11-27 12:41:05 +00:00
dockes
d5c80cdf16 Support From "bla bla" (quoted) From lines 2009-11-27 07:07:38 +00:00
dockes
ef7c432841 update test driver 2009-11-27 07:00:21 +00:00
dockes
4cac0f56b5 in FileInterner::FileInterner(Rcl::Doc) (query), declare the BeagleQueue static so that the cache persists between FileInterner objects 2009-11-23 17:37:59 +00:00
dockes
631d564123 comments 2009-11-23 17:36:44 +00:00
dockes
e57408bf7c allow setting attrs on mimeview defs, factorize some code with mhExecFactory 2009-11-21 13:36:56 +00:00
dockes
1cd62ace41 use a confsimple to parse the additional filter attributes 2009-11-21 11:18:02 +00:00
dockes
8118002942 handle REFILTERROR in execm 2009-11-18 12:33:05 +00:00
dockes
dd6acb07cc mh_mail: use truncate_to_word to avoid cutting an utf8 char. rcldb: logdeb text_to_word errors 2009-11-18 10:26:47 +00:00
dockes
120cf441ba Lack of error checking after have_document() in preview case could lead to looping, and cancellation was not checked to make things worse 2009-11-16 16:18:04 +00:00
dockes
7d18c22142 reason msg 2009-11-16 16:10:31 +00:00