205 Commits

Author SHA1 Message Date
Jean-Francois Dockes
c7a241d26e htmlparse: merged some updates from xapian 1.2.6 2011-06-24 10:41:54 +02:00
Jean-Francois Dockes
67ad817e52 internfile: revert 2314:17098b627784 which was unneeded and wrong 2011-06-22 17:49:51 +02:00
Jean-Francois Dockes
ce44c0a875 preview: use the index idea of the mime type after decompression instead or re-running mimetype(). This will fix preview for compressed man pages (which were identified as text/troff after decomp because not under man/ 2011-06-22 16:09:55 +02:00
Jean-Francois Dockes
ba5e0c41b4 index: fixed the way we process some mime type aliases, which resulted in accumulating handlers in the handler cache 2011-06-21 19:18:55 +02:00
Jean-Francois Dockes
631121e24e internfile: keep around temp file for possible caller use 2011-05-09 07:00:34 +02:00
Jean-Francois Dockes
c45cdd7561 common data locking: remove deadlock in mbox cache locking 2011-04-28 14:28:19 +02:00
Jean-Francois Dockes
55f124725f Fix problems that occurred when multiple threads were trying to read/convert files at the same time (ie: indexing and previewing threads in the GUI calling internfile()). Either get rid of or lock-protect all shared data, eliminate misc initialization possible conflicts by using static initializers. Hopefuly closes issue #51 2011-04-28 10:58:33 +02:00
Jean-Francois Dockes
b28eaf23fb Got rid of all the old RCS id strings 2011-04-27 08:22:17 +02:00
Jean-Francois Dockes
2d8e57ee4f Gui preview, internfile: handle case where target doc of a compound ipath still needs further translation (is not text or html) 2011-04-26 08:26:09 +02:00
Jean-Francois Dockes
f4c1c3678d indexing: an error on an archive member could crash or block the indexing because of the unclean way the ipath was passed in/out of internfile(). Closes issue #55 2011-04-25 16:41:43 +02:00
Jean-Francois Dockes
52fda2a075 GUI: lock handler cache against multiple thread access 2011-04-24 08:47:27 +02:00
Jean-Francois Dockes
7eb182f53c index: escape colon characters inside ipaths. This could potentially happen with the zip (ie: zipped maildir) and chm filters 2011-03-12 12:03:39 +01:00
Jean-Francois Dockes
e1a20aa810 got rid of accesses to global config through getMainConfig() 2011-03-02 13:47:07 +01:00
Jean-Francois Dockes
292859a3ac Index: improve processing/rejection for binary files disguising as scripts (ie: shar archives). Use "internal text/plain" instead of "exec rcltext" for script files so that normal text/plain processing is done (max size, splits). Reject text if more than 25% iconv errors 2011-03-01 08:39:30 +01:00
Jean-Francois Dockes
93a761785a mh_execm: send/receive charset-related parms (no filter use them for now) 2011-02-01 19:16:32 +01:00
Jean-Francois Dockes
320a869d6e Indexing filters: somewhat clarified and unified some charset-related parameters 2011-02-01 15:04:49 +01:00
Jean-Francois Dockes
91e740074e mh_execm: removed incorrect subdocerrors handling leftover from previous change 2011-01-31 09:31:35 +01:00
Jean-Francois Dockes
9b26100e6a comment 2011-01-29 16:18:37 +01:00
Jean-Francois Dockes
d80f4478fc Support thunderbird naked "^From $" separators 2011-01-11 18:36:40 +01:00
Jean-Francois Dockes
fccc9a590f mimehandler: accept additional parameter from config after internal for using different mime type 2011-01-08 19:22:09 +01:00
Jean-Francois Dockes
6ebc4b4fad fix r2093 which broke indexallfilenames 2010-12-15 15:45:24 +01:00
Jean-Francois Dockes
52e845a9fb debug traces: add is_unknown() method to filters to help with pointing out unhandled mime types 2010-12-14 18:21:39 +01:00
Jean-Francois Dockes
084740cd2b simplified the mbox-reading code 2010-11-30 15:21:44 +01:00
Jean-Francois Dockes
629e62e2b8 mbox: test driver improved 2010-11-30 11:35:21 +01:00
Jean-Francois Dockes
2f837a89b3 fix thunderbird hack breakage for 1,14,4 2010-11-29 22:43:41 +01:00
Jean-Francois Dockes
34151006fe Index: add call to get rid of filter subprocesses at end of indexing (for the GUI thread) 2010-11-23 19:35:44 +01:00
Jean-Francois Dockes
f10e14658f add quirks flag to handle bad thunderbird mbox formats 2010-11-22 16:25:25 +01:00
Jean-Francois Dockes
f4aa7fcec4 properly differentiate between subdoc or fatal error in execm protocol 2010-11-22 15:50:14 +01:00
Jean-Francois Dockes
21e16bb8a2 comments 2010-11-22 15:49:09 +01:00
Jean-Francois Dockes
061ffda545 checked/changed all sprintf calls 2010-11-15 11:57:39 +01:00
Jean-Francois Dockes
5b8bb28c5d define renameormove() function and use it in internfile 2010-09-27 12:59:42 +02:00
Jean-Francois Dockes
0fa92899f9 gcc44 compile 2010-09-23 19:05:11 +02:00
Jean-Francois Dockes
e8671c83be mh_execm: get rid of child process in case of error, for a clean start on next request (handle rclzip crash on encrypted files) 2010-09-23 18:32:52 +02:00
Jean-Francois Dockes
ad4f24923f uncompress file before starting external viewer except if in the nouncompforviewmts list 2010-09-20 10:35:26 +02:00
Jean-Francois Dockes
e327ea0b4e make stringtotokens use vector 2010-09-13 14:17:22 +02:00
Jean-Francois Dockes
716b75dc45 Send file mime type as parameter to help rclaudio 2010-09-12 19:29:49 +02:00
Jean-Francois Dockes
53ec6fefc2 comment 2010-09-12 19:19:34 +02:00
Jean-Francois Dockes
8279d0f425 adjusted log messages 2010-09-10 19:02:04 +02:00
"Jean-Francois Dockes ext:(%22)
6aeef686fc Created class to simplify temp directory management 2010-07-24 12:08:06 +02:00
"Jean-Francois Dockes ext:(%22)
e5f41aeb05 Add large file support 2010-07-16 17:08:07 +02:00
Jean-Francois Dockes
c7c3f2e269 html files would still be indexed even when filtered out by indexedmimetypes because the handler cache was searched before using the config data 2010-07-09 09:21:07 +02:00
Jean-Francois Dockes
e6d5f72886 added the possibility to extract arbitrary mail headers and use them as document fields. This forced an incompatible change in the format of the [stored] section inside the "fields" config file 2010-07-06 17:16:36 +02:00
Jean-Francois Dockes
1b8ce2fc72 restore indexallfilenames==0 function 2010-07-05 15:26:33 +02:00
Jean-Francois Dockes
162cb7fe98 Ensure that GUI "Open" works with firefox/beagle cache data: use the backend-agnostic internfile interface in iDocToFile 2010-06-09 16:34:59 +02:00
Jean-Francois Dockes
b87a23bfca separated out the cache access part from beaglequeueindexer. this avoids having to link the pure query programs with indexing code 2010-02-05 12:46:41 +01:00
dockes
bf3ac8e053 small amd64 fixes: 64 bits size_type, signed chars 2009-12-13 16:13:59 +00:00
dockes
93d0aa3fee add <pre> tag to text/plain translated into qt html to preserve indentation. Removes need for rcltext (which did just this). Allow specifying any text/xxx as internal (allows having specific editor but no filter) 2009-11-30 10:04:27 +00:00
dockes
e7b2bc4b46 new glibc missing includes 2009-11-28 09:15:46 +00:00
dockes
a029de8be9 set defaults usedesktoprefs, maxtext 20mb pagesz 1000k webcache 40m 2009-11-28 08:14:05 +00:00
dockes
457e7d2e17 converted iostream to stdio because of mysterious read errors at the last position in the offsets file 2009-11-28 06:39:06 +00:00