12 Commits

Author SHA1 Message Date
Jean-Francois Dockes
bbeaebf632 textsplit: process unicode apostrophes and right quotation mark as ascii single quote 2019-02-01 16:10:51 +01:00
Jean-Francois Dockes
f897f087aa HTML: do not concatenate text found before body tag with the title. Fixes issue #125 2013-01-12 14:06:40 +01:00
Jean-Francois Dockes
e6191b51a8 Html: Just ignore opening and closing <body> and <html> tags. Current browsers show text before or after the body and ignore multiple body tags. Not pushed to 1.17 maint because of possible disruption. Closes issue #92 2012-05-16 10:07:09 +02:00
Jean-Francois Dockes
a8f124f637 added test cases 2012-03-20 11:17:41 +01:00
Jean-Francois Dockes
c53ca49f07 test: html5 meta charset 2012-01-26 19:31:06 +01:00
Jean-Francois Dockes
f1f6d0cf07 rerooted test results 2011-08-24 09:37:02 +02:00
"Jean-Francois Dockes ext:(%22)
38d5f9a2d9 rerooted test results 2011-08-23 10:29:19 +02:00
Jean-Francois Dockes
36a97cb8aa test: added html field extraction test 2011-06-24 11:08:12 +02:00
Jean-Francois Dockes
8fe524bd7f add html charset test 2011-06-24 10:40:29 +02:00
dockes
85426a6d91 remove recoll query text from compared test outputs 2009-01-27 11:19:48 +00:00
dockes
3a4301268f 1.10+small changes in dataset 2007-11-13 18:39:56 +00:00
dockes
183e01e34a *** empty log message *** 2007-02-14 15:02:32 +00:00