Jean-Francois Dockes
080db9eba3
none
2020-08-17 13:08:04 +02:00
Jean-Francois Dockes
cca69cbd31
Windows: text/plain: check if already utf-8 before trying transcode
2020-08-15 11:16:10 +01:00
Jean-Francois Dockes
d9c1a9648c
Windows msvc: rename dirent.h->msvc_dirent.h. mh_text: fix mimeconf-win and warning
2020-08-15 10:12:36 +01:00
Jean-Francois Dockes
15924ce037
Process text/plain subdocuments like .txt files (paging big ones, etc.)
2020-08-15 10:20:48 +02:00
Jean-Francois Dockes
13333e6512
use common method when concatenating multiple values for a metadata element. Use a comma as separator
2020-08-11 11:39:22 +02:00
Jean-Francois Dockes
fd0cf698a1
indents
2020-08-10 16:52:41 +02:00
Jean-Francois Dockes
09ad94f3b7
removed obsolete test mains Makefiles
2020-08-06 11:46:11 +02:00
Jean-Francois Dockes
5cfd95226d
Fix pdf duplicate detection which was broken by setting the md5 attribute twice, for the doc and for the html content. May have affected some other doc types
2020-07-27 09:08:47 +02:00
Jean-Francois Dockes
24b947500f
tabs
2020-05-30 15:59:46 +02:00
Jean-Francois Dockes
560041cab9
cleared out errant tabs
2020-05-30 15:54:49 +02:00
Jean-Francois Dockes
e61ec4b7af
autoconf malloc.h, and clear old c++ conf tests
2020-05-20 18:50:43 +02:00
Jean-Francois Dockes
2f794be314
Fix Windows gcc build. Needs some def to get w7+ windows api
2020-04-25 11:41:37 +02:00
Jean-Francois Dockes
126ac47dba
tabs and indents
2020-04-24 13:45:41 +02:00
Jean-Francois Dockes
19149020e3
internal xslt: support multiple archive members containing metadata or body text. Apply to indexing openxml foot/endnotes
2020-04-22 14:04:20 +02:00
Jean-Francois Dockes
39c152bada
Fixed MSVC warnings, all inocuous
2020-04-17 14:26:40 +01:00
Jean-Francois Dockes
34d23589a2
remove struct stat from interfaces
2020-03-31 11:17:07 +02:00
Jean-Francois Dockes
d1c99e83d7
log message level
2020-03-27 17:18:41 +01:00
Jean-Francois Dockes
90dd64fc61
Have RclExecM inherit the shared CmdTalk now that the latter is used anyway for the korean splitter. Main diff: cmdtalk strips the colon from param names and does not lowercase them
2020-03-27 11:07:51 +01:00
Jean-Francois Dockes
25a5f3a7e0
Explicitly test for malloc_trim() in configure
2020-02-25 16:45:29 +01:00
Jean-Francois Dockes
e5af1651fa
html/xml meta: avoid appending a value that is already present in the string
2020-01-30 08:37:46 +01:00
Jean-Francois Dockes
1ffd54f834
no malloc_trim on windows
2020-01-11 16:49:04 +01:00
Jean-Francois Dockes
9e05c167e9
Add a malloc_trim() call when deallocating the parser in mh_xslt. Seems to help with memory fragmentation, reason a bit mysterious
2020-01-05 17:35:46 +01:00
Jean-Francois Dockes
f6302b5760
Forgotten log verbosity raise in mh_mbox
2019-11-29 10:50:22 +01:00
Jean-Francois Dockes
1b59790b02
mimehandler: tabs change only
2019-11-24 11:00:43 +01:00
Jean-Francois Dockes
c11cac2868
orthograph, mostly in comments, also man pages
2019-10-18 09:13:10 +02:00
Jean-Francois Dockes
05354373e1
added parameter mboxmaxmsgmbs to adjust max mbox entry size (dlft 100 MB)
2019-10-03 10:15:58 +02:00
Jean-Francois Dockes
dd5cf808b5
mbox: trim verbosity
2019-09-23 09:26:19 +02:00
Jean-Francois Dockes
85a3291fd7
mbox: bad return from function would result in wrong (always first) message being previewed if the cache did not exist
2019-08-27 14:42:14 +02:00
Jean-Francois Dockes
54b8bea942
mbox: new std::stream-based implementation of offset cache ok on windows..
2019-08-08 13:44:41 +02:00
Jean-Francois Dockes
ad5beb43c2
mh_mbox: fix on Windows.
...
The 2 main issues were that the files were fopen'd in text mode, and that
the std-based SimpleRegexp implementation was wrong (match instead of search).
Also, make sure that we are using 64 bits interfaces.
2019-08-07 17:07:06 +02:00
Jean-Francois Dockes
9a68157907
mh_mbox: improve diags for win debugging
2019-08-07 13:32:23 +02:00
Jean-Francois Dockes
049ba1e7e4
Windows: build with UNICODE, get rid of all TCHAR/TEXT(), use explicit
...
xxA() interfaces and wchar_t in some places. Add a static cleanup retry
method to TempFile, called after clearing the MimeHandler cache (killing
the subprocesses which might hold an open file).
2019-07-21 16:23:16 +02:00
Jean-Francois Dockes
be214c4a5a
Take advantage of text storage when possible to display preview data for an unaccessible document
2019-06-16 11:49:18 +02:00
Jean-Francois Dockes
dca18bc585
Try to give possible explanations when opening a preview fails
2019-06-15 19:21:52 +02:00
Jean-Francois Dockes
37e203d535
mh_text: log message when skipping file with size over max
2019-05-17 09:32:46 +02:00
Jean-Francois Dockes
33e1847b26
suppress misc warnings on fedora and macosx
2019-04-28 15:39:15 +02:00
Jean-Francois Dockes
35d2d5bf49
Fixed a number of recollinit invocations. Most in dead/test code
2019-03-21 15:28:02 +01:00
Jean-Francois Dockes
1cf8327525
internfile: let the constructor succeed even on uncompression error, so that the doc record is created and retry choices can be done for other runs
2019-03-19 16:41:01 +01:00
Jean-Francois Dockes
bfa786dfeb
internfile: do not process set_document_xx errors. Wait until the next_document() call so that the file names or ipaths are indexed
2019-03-12 14:55:31 +01:00
Jean-Francois Dockes
0cbc46732f
Fixed the FSF address
2019-03-04 11:19:14 +01:00
Jean-Francois Dockes
e3c5f51519
Make sure we dont grow the ipath with each consecutive error
2019-02-19 20:54:57 +01:00
Jean-Francois Dockes
478739f1e7
uncomp: better error message.
2019-01-25 15:18:50 +01:00
Jean-Francois Dockes
f6eacd5949
mh_html: print explanation for read errors
2019-01-23 14:50:51 +01:00
Jean-Francois Dockes
3be55f4ad4
Internal xsltproc: small Windows adjustments
2019-01-08 14:38:30 +01:00
Jean-Francois Dockes
f6f4d8426a
comments + really compute md5 on uncompressed data
2018-12-28 10:32:01 +01:00
Jean-Francois Dockes
586ff90dc0
internal xslt: openoffice zip format working
2018-12-27 16:20:12 +01:00
Jean-Francois Dockes
00c0c5168b
internal xslt working for single-sheet (abw). Still leaking memory?
2018-12-25 10:57:26 +01:00
Jean-Francois Dockes
abc45bc156
internfile: transfer metadata from the last extracted (file-like) stage to the final document
2018-11-30 11:55:30 +01:00
Jean-Francois Dockes
495bd66bf5
mh_text: use c++11 for init
2018-11-22 17:46:06 +01:00
Jean-Francois Dockes
b4dfa40cbf
mh_mail: use rfc2047 on additional headers requested by config. comments and small cleanups
2018-11-22 17:44:33 +01:00