32 Commits

Author SHA1 Message Date
Jean-Francois Dockes
8b3792026f Renamed a few extension-less python handlers with a .py extension for consistency 2022-01-14 12:12:22 +01:00
Jean-Francois Dockes
728129e5ce Text splitter: move apos and dash character conversions to unac_except_trans.
This was complicated and caused problems with highlight areas position computations in
plaintorich. Also, simplify the code for processing some dangling characters.
2021-11-02 14:32:38 +01:00
Jean-Francois Dockes
e771a24148 fix pdfattach test 2020-08-20 11:29:12 +02:00
Jean-Francois Dockes
1d7868d93c tests: create the index in the temp directory instead of src tree 2020-08-20 09:15:05 +02:00
Jean-Francois Dockes
061e06a711 deb packaging version bump 2020-08-20 09:14:40 +02:00
Jean-Francois Dockes
1f0296a873 adjust the test config for the new ocr 2020-02-28 11:13:56 +01:00
Jean-Francois Dockes
b43d1b3287 pdf xmp: pdfextrametafix: add method which takes the xml elt as arg instead of the text content 2019-11-14 18:19:33 +01:00
Jean-Francois Dockes
2e801812fe rclpdf: restore pdfextrametafix function and add test 2019-09-04 09:38:11 +02:00
Jean-Francois Dockes
45043b816f add onlyNames config variable for filtering file names 2019-06-17 08:28:14 +02:00
Jean-Francois Dockes
bec40e9a31 test: fix small issue in config introduced by previous change 2019-06-13 16:15:47 +02:00
Jean-Francois Dockes
5ff1a92a51 pdf: ocr: small fixes, plus make pdfocr redefinable in subdirs 2019-06-13 09:47:25 +02:00
Jean-Francois Dockes
4c205e44e0 tests: test the xmp metadata extraction 2019-06-12 19:22:30 +02:00
Jean-Francois Dockes
9c608ec177 added test for excluding text/html 2017-06-08 10:17:42 +02:00
Jean-Francois Dockes
66270c6270 New tests for new noContentSuffixes+- and skippedNames+- variables 2017-02-22 16:06:45 +01:00
Jean-Francois Dockes
3cda808ac4 skip long timeout file while running test set 2016-01-29 13:41:06 +01:00
Jean-Francois Dockes
f344e8fedd first pass at converting the filters for python 2/3 compat 2015-11-06 16:49:03 +01:00
Jean-Francois Dockes
ac453b4ad0 rclpurple: fix for current log format 2014-10-01 11:37:52 +02:00
Jean-Francois Dockes
412a5e6f78 none 2014-06-10 17:40:56 +02:00
Jean-Francois Dockes
030e576cdb add excludedmimetypes configuration variable 2014-05-02 10:07:26 +02:00
Jean-Francois Dockes
56a56500c1 Handle partial indexing of document restricted to metadata from extended attributes 2013-10-04 10:57:11 +02:00
Jean-Francois Dockes
d3631b5ddf cleaned up processing of metadata from diverse origins (doc,extattrs,localfields) 2013-01-29 14:33:57 +01:00
Jean-Francois Dockes
68955d9427 define non default unac_except_trans for tests 2012-10-16 13:35:35 +02:00
Jean-Francois Dockes
d0a1545fff fix a few tests to better run in an utf-8 locale 2012-10-06 15:49:07 +02:00
Jean-Francois Dockes
d29719e0f1 small test fixups 2012-10-06 12:11:51 +02:00
Jean-Francois Dockes
9b273d94e8 ensure that recoll configured with indexStripChars=1 runs as compiled with -DRCL_INDEX_STRIPCHARS
--HG--
branch : CASEDIACSENS
2012-09-15 15:16:20 +02:00
Jean-Francois Dockes
97ad15c42c Added contributed rcltar filter 2012-05-25 17:04:22 +02:00
Jean-Francois Dockes
a4c17941b1 Added a configuration parameter to set specific unaccenting/lowercasing for some characters to be handled differently than would result from using the Unicode database. Exemple: "a with ring above" could be set to be preserved by a Swedish locutor 2012-04-09 12:42:23 +02:00
Jean-Francois Dockes
c53ca49f07 test: html5 meta charset 2012-01-26 19:31:06 +01:00
Jean-Francois Dockes
94989747ba test: okular notes 2012-01-23 21:19:02 +01:00
Jean-Francois Dockes
f1f6d0cf07 rerooted test results 2011-08-24 09:37:02 +02:00
"Jean-Francois Dockes ext:(%22)
38d5f9a2d9 rerooted test results 2011-08-23 10:29:19 +02:00
"Jean-Francois Dockes ext:(%22)
bd25305cee put test config under vc 2011-08-22 10:14:16 +02:00