118 Commits

Author SHA1 Message Date
Jean-Francois Dockes
0840daf20e Avoid replacing (instead of concatenating) the current author field value with the internal one when the document is a top-level one. This allows metadata from metadatacmds to be used 2015-08-06 08:08:36 +02:00
Jean-Francois Dockes
4d35cbabfb Also index non-html files from the web queue and fix the Open operation for them 2015-07-24 16:30:13 +02:00
Jean-Francois Dockes
d630cbbaec Delete RCL_USE_XATTR configure/compile time variable, it was not
useful. Add configuration variable to use mtime instead of ctime for update
detection. Useful on a system where xattrs would be modified but not
indexed, to avoid excessive reindexing.
2014-12-09 11:15:17 +01:00
Jean-Francois Dockes
4ac34cb134 Off by one error in maximum embedding depth test caused overflow of FileInterner m_tmpflgs temp flags array and possibly bus error depending on arch (only seen on 32 bits arch) 2014-05-15 15:15:01 +02:00
Jean-Francois Dockes
9487a0cffa Code for reaping xattrs and cmd metadata did not need to be implemented as internfile members and can be used in other contexts 2013-10-03 09:38:35 +02:00
Jean-Francois Dockes
ebe9b44a2c fix metadatacmds multifield modif, didnt set anything at all... 2013-09-27 13:04:05 +02:00
Jean-Francois Dockes
3fbcbc8c2b allow multiple field output from metadatacmds entry beginning with rclmulti. Add noxattrfields config variable to allow disabling extended attributes usage 2013-09-27 12:07:32 +02:00
medoc
641acd3d68 move the execution of external metadata-gathering commands from fsindexer to internfile for consistency of handling with filter-generated metadata 2013-09-06 11:51:00 +02:00
Jean-Francois Dockes
243ac82526 missing return statement... 2013-05-26 15:25:16 +02:00
Jean-Francois Dockes
a1b7018cfd Fix problems which occurred when using functions like open-parents with multiple indexes containing identical paths (udis) 2013-05-25 11:26:57 +02:00
Jean-Francois Dockes
167c8a4286 fix minor issues in multisave and popup menus 2013-04-28 16:58:05 +02:00
Jean-Francois Dockes
a7728ceb91 changed the mime handler cache key (was the mime type), to avoid having multiple copies of the same filter when applied to different mime types. This reduces a lot the number of processes during indexing, with no impact on performance 2013-04-25 18:18:48 +02:00
Jean-Francois Dockes
2b80c77c23 Add possibility to display a list of sub-documents for a given result 2013-04-24 16:33:53 +02:00
Jean-Francois Dockes
3c80e51940 simplified temp file handling for compressed documents and, for querying, implemented caching for last file uncompressed 2013-03-06 18:52:57 +01:00
Jean-Francois Dockes
50135e3428 process extended attributes by default 2013-02-19 16:12:24 +01:00
Jean-Francois Dockes
d3631b5ddf cleaned up processing of metadata from diverse origins (doc,extattrs,localfields) 2013-01-29 14:33:57 +01:00
Jean-Francois Dockes
d2f7f11715 Use dynamic lib for shared recoll code 2012-12-29 14:27:01 +01:00
Jean-Francois Dockes
2d5c2a8058 split the iDocToFile method into static and member parts for use from python module 2012-12-20 11:15:10 +01:00
Jean-Francois Dockes
5fc8f240fe from 1.18 branch: Adjust things for using the new Firefox plugin: remove visible Beagle references + fix 1.18 web queue indexing bugs 2012-11-01 11:30:39 +01:00
Jean-Francois Dockes
ee7d0f2ee7 1st parallel multithreaded version of indexing which can do my home without crashing... Let's checkpoint 2012-11-01 11:19:48 +01:00
Jean-Francois Dockes
b8963db4b1 cleaned up the missing helper storage class 2012-10-28 16:43:19 +01:00
Jean-Francois Dockes
95ef518ec7 the missing filter detection code was broken 2012-10-23 19:40:51 +02:00
Jean-Francois Dockes
5add2e2384 Arrange so we can now open the parent of a document (e.g. chm file instead of temp copy of html page inside chm), even when the parent is itself embedded in an archive 2012-10-12 16:54:52 +02:00
Jean-Francois Dockes
8e1ed842d2 message 2012-10-09 14:52:32 +02:00
Jean-Francois Dockes
1329265b7b check for empty file name in internfile, else gets stuck later because empty fn is interpreted as read stdin in md5 2012-10-05 16:42:13 +02:00
"Jean-Francois Dockes ext:(%22)
2870274f80 slightly simplified temp file handling 2012-08-21 08:35:39 +02:00
Jean-Francois Dockes
643f4d56bb internals: virtualized the doc fetcher interface 2012-06-05 07:16:11 +02:00
Jean-Francois Dockes
8b34610dde Cleaned up file name handling. Fixes that file names were sometimes indexed split, sometimes not. They now always are both, with different prefixes. Forces reindex 2012-04-13 09:18:08 +02:00
Jean-Francois Dockes
ec7b40a52e cosmetics: list -> vector in more places 2012-04-11 19:58:08 +02:00
Jean-Francois Dockes
78bd8d63da use vector instead of list for execmd arg list 2012-04-11 15:36:49 +02:00
Jean-Francois Dockes
85166c93b2 Changed the way we handle document sizes. The fbytes field should now be in most cases the most "natural" document size. pcbytes holds the top external container size and dbytes the text size 2012-03-07 15:39:30 +01:00
Jean-Francois Dockes
638d468796 clarified the use of string keys inside the Filter metaData array 2012-03-07 10:13:46 +01:00
Jean-Francois Dockes
516863b5d6 GUI: perform up to date check before previewing a subdoc. This is for example to avoid showing the wrong message if a mail folder has been compacted 2012-01-20 17:48:55 +01:00
Jean-Francois Dockes
036937e8bf added getmeta() method to Rcl::Doc and use in misc places 2012-01-20 14:48:50 +01:00
Jean-Francois Dockes
1931595637 GUI: added menu entry to show all the mime types actually indexed (by content) 2011-11-25 19:47:56 +01:00
Jean-Francois Dockes
38e0957962 const string cleanup 2011-10-01 16:39:38 +02:00
Jean-Francois Dockes
487b623faf log 2011-10-01 09:31:38 +02:00
Jean-Francois Dockes
424e4173ba threading cleanup: add mutex protection around moronic change to transcode. Add mutex to equiv issue in unac. Rename const strings everywhere to cstr_xx to ease future detection of potentially problematic static variables. Most probably close issue #65 2011-09-28 15:01:14 +02:00
"Jean-Francois Dockes ext:(%22)
6b04fe7f2c The record for an attachment for which conversion failed (ie: image without exiftool) would erase the message's record because its ipath was not updated 2011-07-16 11:53:54 +02:00
Jean-Francois Dockes
67ad817e52 internfile: revert 2314:17098b627784 which was unneeded and wrong 2011-06-22 17:49:51 +02:00
Jean-Francois Dockes
ce44c0a875 preview: use the index idea of the mime type after decompression instead or re-running mimetype(). This will fix preview for compressed man pages (which were identified as text/troff after decomp because not under man/ 2011-06-22 16:09:55 +02:00
Jean-Francois Dockes
631121e24e internfile: keep around temp file for possible caller use 2011-05-09 07:00:34 +02:00
Jean-Francois Dockes
55f124725f Fix problems that occurred when multiple threads were trying to read/convert files at the same time (ie: indexing and previewing threads in the GUI calling internfile()). Either get rid of or lock-protect all shared data, eliminate misc initialization possible conflicts by using static initializers. Hopefuly closes issue #51 2011-04-28 10:58:33 +02:00
Jean-Francois Dockes
b28eaf23fb Got rid of all the old RCS id strings 2011-04-27 08:22:17 +02:00
Jean-Francois Dockes
2d8e57ee4f Gui preview, internfile: handle case where target doc of a compound ipath still needs further translation (is not text or html) 2011-04-26 08:26:09 +02:00
Jean-Francois Dockes
f4c1c3678d indexing: an error on an archive member could crash or block the indexing because of the unclean way the ipath was passed in/out of internfile(). Closes issue #55 2011-04-25 16:41:43 +02:00
Jean-Francois Dockes
7eb182f53c index: escape colon characters inside ipaths. This could potentially happen with the zip (ie: zipped maildir) and chm filters 2011-03-12 12:03:39 +01:00
Jean-Francois Dockes
e1a20aa810 got rid of accesses to global config through getMainConfig() 2011-03-02 13:47:07 +01:00
Jean-Francois Dockes
320a869d6e Indexing filters: somewhat clarified and unified some charset-related parameters 2011-02-01 15:04:49 +01:00
Jean-Francois Dockes
6ebc4b4fad fix r2093 which broke indexallfilenames 2010-12-15 15:45:24 +01:00