162 Commits

Author SHA1 Message Date
Jean-Francois Dockes
816980a1c4 implemented advanced search history feature 2012-10-16 13:37:56 +02:00
Jean-Francois Dockes
5add2e2384 Arrange so we can now open the parent of a document (e.g. chm file instead of temp copy of html page inside chm), even when the parent is itself embedded in an archive 2012-10-12 16:54:52 +02:00
Jean-Francois Dockes
c7a35a176c none 2012-10-12 13:35:21 +02:00
Jean-Francois Dockes
7fcb7c9bf7 ensure chm file can be renamed 2012-10-12 13:34:56 +02:00
Jean-Francois Dockes
d4edbbaedb rclepub: use elt ids instead of hrefs + debug traces 2012-10-11 15:35:15 +02:00
Jean-Francois Dockes
7c18d74541 add epub viewer and set rclaptg meta tag for chm and info 2012-10-11 14:03:30 +02:00
Jean-Francois Dockes
7037e1ca38 fix 8bit file name processing 2012-10-06 12:00:05 +02:00
Jean-Francois Dockes
ff2e12f149 glitch in maxmemberkb handling 2012-10-06 11:59:48 +02:00
Jean-Francois Dockes
29fe1e4927 implemented maxmemberkb limit for multidoc (e.g. archive) members 2012-10-06 09:05:35 +02:00
Jean-Francois Dockes
5b3cb69ee9 let rcldvi and rclps emit ^L page markers for use with %p and evince 2012-10-04 09:49:03 +02:00
Jean-Francois Dockes
b321b0babb skip very big files (50M) in zip tar and rar extractors 2012-10-04 08:22:33 +02:00
Jean-Francois Dockes
2bb14cc6ff none 2012-10-04 08:21:54 +02:00
"Jean-Francois Dockes ext:(%22)
0ebfc496d8 add capability to remember page breaks generated by, e.g. pdftotext, and use them to start an external viewer on a match page 2012-08-21 15:03:02 +02:00
Jean-Francois Dockes
df91cff95f rclsoff: modified to correctly handle exported google docs. Also improves handling regular libreoffice files: spaces were eaten around <span> tags 2012-05-28 09:45:08 +02:00
Jean-Francois Dockes
97ad15c42c Added contributed rcltar filter 2012-05-25 17:04:22 +02:00
Jean-Francois Dockes
eeaf564d4e Handle non-standard file name suffixes during decompression. Recoll should now index arbitrary compressed XML formats. Closes issue #93 2012-05-21 11:50:09 +02:00
Jean-Francois Dockes
cbe7fd21cb rclxml 2012-05-19 09:23:24 +02:00
"Jean-Francois Dockes ext:(%22)
22655319e3 rcldia fix from the author 2012-04-21 20:48:44 +02:00
"Jean-Francois Dockes ext:(%22)
ae01899962 added contributed dia filter 2012-04-03 17:30:08 +02:00
"Jean-Francois Dockes ext:(%22)
544e687afe rclchm: add concatenating mode 2012-04-03 17:29:01 +02:00
"Jean-Francois Dockes ext:(%22)
5f9095b472 Fixed python filter html escaping 2012-04-03 16:46:16 +02:00
Jean-Francois Dockes
8074523a56 rclchm: decode internal urls 2012-03-27 18:51:27 +02:00
Jean-Francois Dockes
fde36ecccc Handle garbled unrtf http-equiv header causing pbs with html5 handler 2012-01-26 19:30:43 +01:00
Jean-Francois Dockes
4c382b00b3 comment 2012-01-23 21:52:46 +01:00
Jean-Francois Dockes
f0a5eb006c okular notes: remove bit of test code 2012-01-23 21:21:11 +01:00
Jean-Francois Dockes
17542969a5 new gnumeric and okular notes filters 2012-01-23 20:25:55 +01:00
Jean-Francois Dockes
dc3aa5d564 stopwords-based charset guessing: use merged dictionary for all words instead of one dictionary per language/charset. Very marginal speed improvement but somewhat cleaner 2012-01-20 14:45:34 +01:00
Jean-Francois Dockes
f9a6be302b karaoke charset guessing: added greek, updated some languages 2012-01-20 14:43:24 +01:00
Jean-Francois Dockes
6d651cf043 karaoke filter/language guesser: use sets to store common words 2012-01-04 16:16:29 +01:00
Jean-Francois Dockes
9aeda04ccb augment the number of test words 10->20, + comments 2012-01-03 21:17:11 +01:00
Jean-Francois Dockes
636b935904 rclchm: use posixpath not path when dealing with internal paths 2011-12-27 17:59:33 +01:00
Jean-Francois Dockes
502f7e783e chm filter: handle files lacking a topics node 2011-12-17 16:41:45 +01:00
Jean-Francois Dockes
5fa720f23d Typo in error-message printing line crashed rclexecm.py 2011-12-17 16:41:16 +01:00
Jean-Francois Dockes
2afc769c38 rclpython: catch exception caused by indentation error in doc 2011-11-28 17:47:02 +01:00
Jean-Francois Dockes
f9f424de42 removed filters replaced by rclaudio/mutagen 2011-11-24 11:59:42 +01:00
Jean-Francois Dockes
ea61e85b8f multi-doc filter: getnext error would cause uncaught exception because of access to uninitialized eof variable 2011-11-04 17:32:14 +01:00
Jean-Francois Dockes
152181123e rcllyx: fixed lyx version number test for lyx 2.0 2011-09-28 15:32:36 +02:00
Jean-Francois Dockes
2c2c0dadf2 comment 2011-09-20 07:35:58 +02:00
"Jean-Francois Dockes ext:(%22)
cdaeba390d rar need special handling for directory entries 2011-08-23 11:03:15 +02:00
Jean-Francois Dockes
4318891b48 added support for rar archives 2011-08-18 16:20:12 +02:00
Jean-Francois Dockes
b9be9e58d5 GUI: clicking an open link or menu entry inside the result table would start the external application 3 times. Closes issue #59 2011-05-14 09:56:58 +02:00
Jean-Francois Dockes
dd8f42253c Improve rcldoc filter and switch back to using it for indexing instead of direct antiword exec. This is slightly slower but it does catch a number of .doc files which would not be indexed otherwise 2011-05-10 09:03:13 +02:00
Jean-Francois Dockes
f1c651deeb comment 2011-05-08 22:20:32 +02:00
Jean-Francois Dockes
6dcf21b8e5 Fixed the man filter to get rid of groff temp files, and add a few possible extensions for man pages. Closes issue #56 2011-04-30 10:55:31 +02:00
Jean-Francois Dockes
d0cb158d26 index: support webarchive (.war) and mimehtml (.mhtml) formats 2011-03-26 17:29:04 +01:00
Jean-Francois Dockes
205fdde5a9 try to handle the special handling of utf-8 paths inside zipfile 2011-03-13 15:13:30 +01:00
Jean-Francois Dockes
8ad41f3fb1 index: add --nonet --novalid options to all xsltproc invocations to avoid 30S timeouts when accessing external dtds (ie: for svg files) 2011-03-01 08:59:05 +01:00
Jean-Francois Dockes
0634650282 karaoke: add russian cp1251 stopwords file for charset identification 2011-02-13 10:22:54 +01:00
Jean-Francois Dockes
a4241cff6a rclkar: renamed files for compat with install script 2011-01-31 20:23:56 +01:00
Jean-Francois Dockes
879225d687 Added language-based helper for classifying iso-8859-x encodings 2011-01-31 09:32:26 +01:00