Jean-Francois Dockes
8b3792026f
Renamed a few extension-less python handlers with a .py extension for consistency
2022-01-14 12:12:22 +01:00
Jean-Francois Dockes
728129e5ce
Text splitter: move apos and dash character conversions to unac_except_trans.
...
This was complicated and caused problems with highlight areas position computations in
plaintorich. Also, simplify the code for processing some dangling characters.
2021-11-02 14:32:38 +01:00
Jean-Francois Dockes
e771a24148
fix pdfattach test
2020-08-20 11:29:12 +02:00
Jean-Francois Dockes
1d7868d93c
tests: create the index in the temp directory instead of src tree
2020-08-20 09:15:05 +02:00
Jean-Francois Dockes
061e06a711
deb packaging version bump
2020-08-20 09:14:40 +02:00
Jean-Francois Dockes
1f0296a873
adjust the test config for the new ocr
2020-02-28 11:13:56 +01:00
Jean-Francois Dockes
b43d1b3287
pdf xmp: pdfextrametafix: add method which takes the xml elt as arg instead of the text content
2019-11-14 18:19:33 +01:00
Jean-Francois Dockes
2e801812fe
rclpdf: restore pdfextrametafix function and add test
2019-09-04 09:38:11 +02:00
Jean-Francois Dockes
45043b816f
add onlyNames config variable for filtering file names
2019-06-17 08:28:14 +02:00
Jean-Francois Dockes
bec40e9a31
test: fix small issue in config introduced by previous change
2019-06-13 16:15:47 +02:00
Jean-Francois Dockes
5ff1a92a51
pdf: ocr: small fixes, plus make pdfocr redefinable in subdirs
2019-06-13 09:47:25 +02:00
Jean-Francois Dockes
4c205e44e0
tests: test the xmp metadata extraction
2019-06-12 19:22:30 +02:00
Jean-Francois Dockes
9c608ec177
added test for excluding text/html
2017-06-08 10:17:42 +02:00
Jean-Francois Dockes
66270c6270
New tests for new noContentSuffixes+- and skippedNames+- variables
2017-02-22 16:06:45 +01:00
Jean-Francois Dockes
3cda808ac4
skip long timeout file while running test set
2016-01-29 13:41:06 +01:00
Jean-Francois Dockes
f344e8fedd
first pass at converting the filters for python 2/3 compat
2015-11-06 16:49:03 +01:00
Jean-Francois Dockes
ac453b4ad0
rclpurple: fix for current log format
2014-10-01 11:37:52 +02:00
Jean-Francois Dockes
412a5e6f78
none
2014-06-10 17:40:56 +02:00
Jean-Francois Dockes
030e576cdb
add excludedmimetypes configuration variable
2014-05-02 10:07:26 +02:00
Jean-Francois Dockes
56a56500c1
Handle partial indexing of document restricted to metadata from extended attributes
2013-10-04 10:57:11 +02:00
Jean-Francois Dockes
d3631b5ddf
cleaned up processing of metadata from diverse origins (doc,extattrs,localfields)
2013-01-29 14:33:57 +01:00
Jean-Francois Dockes
68955d9427
define non default unac_except_trans for tests
2012-10-16 13:35:35 +02:00
Jean-Francois Dockes
d0a1545fff
fix a few tests to better run in an utf-8 locale
2012-10-06 15:49:07 +02:00
Jean-Francois Dockes
d29719e0f1
small test fixups
2012-10-06 12:11:51 +02:00
Jean-Francois Dockes
9b273d94e8
ensure that recoll configured with indexStripChars=1 runs as compiled with -DRCL_INDEX_STRIPCHARS
...
--HG--
branch : CASEDIACSENS
2012-09-15 15:16:20 +02:00
Jean-Francois Dockes
97ad15c42c
Added contributed rcltar filter
2012-05-25 17:04:22 +02:00
Jean-Francois Dockes
a4c17941b1
Added a configuration parameter to set specific unaccenting/lowercasing for some characters to be handled differently than would result from using the Unicode database. Exemple: "a with ring above" could be set to be preserved by a Swedish locutor
2012-04-09 12:42:23 +02:00
Jean-Francois Dockes
c53ca49f07
test: html5 meta charset
2012-01-26 19:31:06 +01:00
Jean-Francois Dockes
94989747ba
test: okular notes
2012-01-23 21:19:02 +01:00
Jean-Francois Dockes
f1f6d0cf07
rerooted test results
2011-08-24 09:37:02 +02:00
"Jean-Francois Dockes ext:(%22)
38d5f9a2d9
rerooted test results
2011-08-23 10:29:19 +02:00
"Jean-Francois Dockes ext:(%22)
bd25305cee
put test config under vc
2011-08-22 10:14:16 +02:00