45 Commits

Author SHA1 Message Date
Jean-Francois Dockes
19a4b2a287 Do not filter out text/html when it results from a conversion, even if excluded by indexedmimetypes/excludedmimetypes 2017-06-08 10:09:05 +02:00
Jean-Francois Dockes
5863d29e49 debug function 2017-05-12 10:12:48 +02:00
Jean-Francois Dockes
b55f4b3b0a add nomd5types parameter to set file types for which dedup is not that useful and computation is expensive (e.g. audio files). Replace "call parent" misfeature with call to virtual in MimeHandler constructor. Fix log calls indent 2017-02-02 18:09:00 +01:00
Jean-Francois Dockes
90bae886c2 increased max attributes value to 200 2017-01-28 10:01:59 +01:00
Jean-Francois Dockes
217eb388e2 log formats 2017-01-28 10:00:07 +01:00
Jean-Francois Dockes
f6a999de84 logging now uses c++ streams 2016-07-12 09:41:04 +02:00
Jean-Francois Dockes
b9e672abda Allow execm input handlers to set arbitrary data fields 2016-07-11 18:13:39 +02:00
Jean-Francois Dockes
627da5a39b Handler timeout should not interrupt the whole indexing pass 2016-04-14 15:48:01 +02:00
Jean-Francois Dockes
a783ab17dc mh_execm: compute file md5 before activating filter to avoid concurrent open issues on Windows 2015-12-02 10:30:04 +01:00
Jean-Francois Dockes
ffc7d04bdf fix timeouts for mh_execm filters 2015-10-08 10:10:04 +02:00
Jean-Francois Dockes
330c7fc30d Python filters beginning to work, still issues.
--HG--
branch : WINDOWSPORT
2015-09-11 16:16:16 +02:00
Jean-Francois Dockes
06f43c573e 1st successful use of mh_exec (direct antiword exec)
--HG--
branch : WINDOWSPORT
2015-09-11 11:26:53 +02:00
Jean-Francois Dockes
e59bf51445 Impose memory usage limit on external filters. Fixes issue #259 2015-07-18 19:30:49 +02:00
Jean-Francois Dockes
9ba0b3e8bc Replaced RSA md5 code with public domain OpenBSD/debian dpkg version 2015-03-01 14:28:01 +01:00
Jean-Francois Dockes
77cba9d0b3 export RECOLL_CONFDIR to filters 2013-06-06 13:39:48 +02:00
Jean-Francois Dockes
43ee96ae1a comments 2013-04-16 15:54:54 +02:00
"Jean-Francois Dockes ext:(%22)
860521be88 internfile: do not compute md5 when in preview mode 2013-04-09 12:40:46 +02:00
Jean-Francois Dockes
d4edbbaedb rclepub: use elt ids instead of hrefs + debug traces 2012-10-11 15:35:15 +02:00
Jean-Francois Dockes
f624d3b10e doc 2012-10-06 21:04:03 +02:00
Jean-Francois Dockes
2fc294a9c6 factored out common charset handling code in exec and execm, cleaned up charset and textplain handling in mh_mail 2012-10-06 12:14:04 +02:00
Jean-Francois Dockes
29fe1e4927 implemented maxmemberkb limit for multidoc (e.g. archive) members 2012-10-06 09:05:35 +02:00
Jean-Francois Dockes
ec7b40a52e cosmetics: list -> vector in more places 2012-04-11 19:58:08 +02:00
Jean-Francois Dockes
78bd8d63da use vector instead of list for execmd arg list 2012-04-11 15:36:49 +02:00
Jean-Francois Dockes
638d468796 clarified the use of string keys inside the Filter metaData array 2012-03-07 10:13:46 +01:00
Jean-Francois Dockes
a5af2b93bd "md5"->cstr_md5 2012-02-25 10:41:27 +01:00
Jean-Francois Dockes
49554e42c2 Factorized common text transcoding code in separate module 2011-10-20 17:53:42 +02:00
Jean-Francois Dockes
f544b28b4a Transcode mh_execm text/plain output like we do for mh_exec. Adjust handling of transcoding errors. These changes should fix most cases of non-utf8 text making it to unac/index 2011-10-20 14:00:38 +02:00
Jean-Francois Dockes
38e0957962 const string cleanup 2011-10-01 16:39:38 +02:00
Jean-Francois Dockes
b28eaf23fb Got rid of all the old RCS id strings 2011-04-27 08:22:17 +02:00
Jean-Francois Dockes
f4c1c3678d indexing: an error on an archive member could crash or block the indexing because of the unclean way the ipath was passed in/out of internfile(). Closes issue #55 2011-04-25 16:41:43 +02:00
Jean-Francois Dockes
7eb182f53c index: escape colon characters inside ipaths. This could potentially happen with the zip (ie: zipped maildir) and chm filters 2011-03-12 12:03:39 +01:00
Jean-Francois Dockes
e1a20aa810 got rid of accesses to global config through getMainConfig() 2011-03-02 13:47:07 +01:00
Jean-Francois Dockes
93a761785a mh_execm: send/receive charset-related parms (no filter use them for now) 2011-02-01 19:16:32 +01:00
Jean-Francois Dockes
320a869d6e Indexing filters: somewhat clarified and unified some charset-related parameters 2011-02-01 15:04:49 +01:00
Jean-Francois Dockes
91e740074e mh_execm: removed incorrect subdocerrors handling leftover from previous change 2011-01-31 09:31:35 +01:00
Jean-Francois Dockes
f4aa7fcec4 properly differentiate between subdoc or fatal error in execm protocol 2010-11-22 15:50:14 +01:00
Jean-Francois Dockes
e8671c83be mh_execm: get rid of child process in case of error, for a clean start on next request (handle rclzip crash on encrypted files) 2010-09-23 18:32:52 +02:00
Jean-Francois Dockes
e327ea0b4e make stringtotokens use vector 2010-09-13 14:17:22 +02:00
Jean-Francois Dockes
716b75dc45 Send file mime type as parameter to help rclaudio 2010-09-12 19:29:49 +02:00
dockes
e7b2bc4b46 new glibc missing includes 2009-11-28 09:15:46 +00:00
dockes
8118002942 handle REFILTERROR in execm 2009-11-18 12:33:05 +00:00
dockes
ebd10680f8 dont use 0-sized doc to mean eof now 2009-11-13 08:15:19 +00:00
dockes
7f5f5cf761 set mimetype for the non-ipath case 2009-11-11 18:07:01 +00:00
dockes
ce0078081a execm first working zip version 2009-10-14 06:21:44 +00:00
dockes
c8a88029f7 execm persistent filters 2009-10-09 13:58:32 +00:00