42 Commits

Author SHA1 Message Date
Jean-Francois Dockes
7aa4edf91b Merged utility file versions with other packages 2016-03-22 13:35:35 +01:00
Jean-Francois Dockes
a4fd4ee5be moved code around to make smallut and pathut less recoll-specific and reusable. No actual changes 2016-03-21 12:55:31 +01:00
Jean-Francois Dockes
ff15f8fb1c Centralize stat calls to ensure consistency of time fields on windows 2016-01-08 11:23:10 +01:00
Jean-Francois Dockes
d942242047 replace all %lld instances 2015-10-03 17:25:17 +02:00
Jean-Francois Dockes
62f4f7bd81 more safexx posix stuff 2015-08-22 08:37:58 +02:00
Jean-Francois Dockes
c6e228b7c6 Prepared windows port by removing a number of spurious reference to unix-specific interfaces, and using some xapian posix adaptor includes 2015-08-19 14:41:10 +02:00
Jean-Francois Dockes
4d35cbabfb Also index non-html files from the web queue and fix the Open operation for them 2015-07-24 16:30:13 +02:00
Jean-Francois Dockes
727c055c86 web queue: fix cache resizing utility + bug in indexer which would skip oldest entry 2015-02-15 16:08:36 +01:00
Jean-Francois Dockes
bf9c813ec9 Web history indexing: fmtime was not saved to the cache, resulting in empty dates in queries 2014-07-08 15:06:47 +02:00
Jean-Francois Dockes
9cb5371820 get rid of the OFFTPC nonsense and cast to long long use lld to print off_t 2013-09-26 14:35:50 +02:00
Jean-Francois Dockes
3c80e51940 simplified temp file handling for compressed documents and, for querying, implemented caching for last file uncompressed 2013-03-06 18:52:57 +01:00
Jean-Francois Dockes
72927b3fc8 additional cleanup after switch from .beagle to .recollweb 2012-12-14 13:20:09 +01:00
Jean-Francois Dockes
4afde68f9e more config isolation. Still crashing 2012-11-30 16:45:02 +01:00
Jean-Francois Dockes
5fc8f240fe from 1.18 branch: Adjust things for using the new Firefox plugin: remove visible Beagle references + fix 1.18 web queue indexing bugs 2012-11-01 11:30:39 +01:00
Jean-Francois Dockes
ee7d0f2ee7 1st parallel multithreaded version of indexing which can do my home without crashing... Let's checkpoint 2012-11-01 11:19:48 +01:00
Jean-Francois Dockes
172fc68f3f Fixed tilde expansion for beaglequeuedir 2012-10-29 15:17:43 +01:00
Jean-Francois Dockes
c030a15780 Remove improper assertion use from beagle cache handling code 2012-09-13 09:44:47 +02:00
Jean-Francois Dockes
ec7b40a52e cosmetics: list -> vector in more places 2012-04-11 19:58:08 +02:00
Jean-Francois Dockes
85166c93b2 Changed the way we handle document sizes. The fbytes field should now be in most cases the most "natural" document size. pcbytes holds the top external container size and dbytes the text size 2012-03-07 15:39:30 +01:00
Jean-Francois Dockes
638d468796 clarified the use of string keys inside the Filter metaData array 2012-03-07 10:13:46 +01:00
Jean-Francois Dockes
38e0957962 const string cleanup 2011-10-01 16:39:38 +02:00
Jean-Francois Dockes
424e4173ba threading cleanup: add mutex protection around moronic change to transcode. Add mutex to equiv issue in unac. Rename const strings everywhere to cstr_xx to ease future detection of potentially problematic static variables. Most probably close issue #65 2011-09-28 15:01:14 +02:00
"Jean-Francois Dockes ext:(%22)
88685d2e64 search/index: fixed a number of bad conversions to properly deal with text documents bigger than 2GB 2011-07-12 08:28:09 -07:00
Jean-Francois Dockes
b28eaf23fb Got rid of all the old RCS id strings 2011-04-27 08:22:17 +02:00
Jean-Francois Dockes
f4c1c3678d indexing: an error on an archive member could crash or block the indexing because of the unclean way the ipath was passed in/out of internfile(). Closes issue #55 2011-04-25 16:41:43 +02:00
Jean-Francois Dockes
98257f7066 Missing includes for gcc-4.4.5. Closes #34 2010-11-17 07:18:48 +01:00
Jean-Francois Dockes
061ffda545 checked/changed all sprintf calls 2010-11-15 11:57:39 +01:00
"Jean-Francois Dockes ext:(%22)
6aeef686fc Created class to simplify temp directory management 2010-07-24 12:08:06 +02:00
Jean-Francois Dockes
d0e56e361f slightly improve status reporting during indexing 2010-05-04 17:49:40 +02:00
Jean-Francois Dockes
b87a23bfca separated out the cache access part from beaglequeueindexer. this avoids having to link the pure query programs with indexing code 2010-02-05 12:46:41 +01:00
dockes
a029de8be9 set defaults usedesktoprefs, maxtext 20mb pagesz 1000k webcache 40m 2009-11-28 08:14:05 +00:00
dockes
380d98ff88 store beagle fields before interning the file 2009-11-23 19:51:14 +00:00
dockes
3c40b0bb1e Beaglequeue: simplify index from cache now that udi entries are unique in cache 2009-11-23 16:10:38 +00:00
dockes
7f856f482b only keep the latest entry for a given udi in the cache 2009-11-22 17:27:34 +00:00
dockes
f5637445d6 beaglequeue fully functional, small fixes remaining? 2009-11-18 08:24:06 +00:00
dockes
0a9537fef6 ok with compression 2009-11-18 07:57:47 +00:00
dockes
e636480622 catch cancel exceptions cast by internfile() 2009-11-15 16:41:05 +00:00
dockes
061aa959c6 monitor the beagle queue 2009-11-14 10:29:29 +00:00
dockes
6ef7b546f2 beaglequeue indexFiles 2009-11-14 08:21:45 +00:00
dockes
bbba826c06 1st beagle version with index/preview working 2009-11-13 13:29:34 +00:00
dockes
4503971dd0 integrate beaglequeueindexer for indexing. Work remains for indexfiles() at least 2009-11-13 09:07:18 +00:00
dockes
0fe1574439 2009-11-10 18:11:35 +00:00