Jean-Francois Dockes
|
cedff8ce7c
|
rclchm: python3 modifications
|
2018-04-08 10:53:15 +02:00 |
|
Jean-Francois Dockes
|
93ac830079
|
All format handlers compatible with python3 except chm
|
2018-03-09 15:25:11 +01:00 |
|
Jean-Francois Dockes
|
7f49de5d97
|
rcldoc.py: port to python3. We by default exec antiword directly anyway
|
2018-03-08 20:38:51 +01:00 |
|
Jean-Francois Dockes
|
b8fa3005dd
|
rclkar/python3: small simplifications
|
2018-03-08 20:37:34 +01:00 |
|
Jean-Francois Dockes
|
d9afcdf8a3
|
Modified xls and ppt filter to be compatible with python3
|
2018-03-08 15:51:12 +01:00 |
|
Jean-Francois Dockes
|
c56c1d6f46
|
rclchm: very small change in support of py3, but there are lots of issue in python-chm itself
|
2018-02-22 15:54:33 +01:00 |
|
Jean-Francois Dockes
|
5c80488465
|
imported midi.py module after python3 port, stripped most of the write part.
|
2018-02-22 09:50:45 +01:00 |
|
Jean-Francois Dockes
|
e72bac02c3
|
imported python-midi 0.2.1 @4:783219460045 (py3 port)
|
2018-02-22 09:34:44 +01:00 |
|
Jean-Francois Dockes
|
fead7bb491
|
ported rclkar to python3
|
2018-02-22 09:30:42 +01:00 |
|
Jean-Francois Dockes
|
dc0241d53a
|
comments and messages
|
2018-02-09 18:15:20 +01:00 |
|
Jean-Francois Dockes
|
9d10bd857f
|
none
|
2018-02-09 18:15:02 +01:00 |
|
Jean-Francois Dockes
|
116318f1f5
|
Added small script to process bibtex files
|
2018-02-09 09:30:41 +01:00 |
|
Jean-Francois Dockes
|
0b8988cd64
|
Fix Windows PDF indexing. The successful test for poppler/pdftotext was not acknowledged and pdf indexing always failed
|
2018-01-19 13:15:51 +01:00 |
|
Jean-Francois Dockes
|
b99372d379
|
Merge branch 'RECOLL_1_23_MAINT'
|
2018-01-05 17:56:44 +01:00 |
|
Jean-Francois Dockes
|
413c710f34
|
rclchm, rclepub: define config variables chmcatenate and epubcatenate to specify that the files should be indexed as a whole instead of as individual chapters
|
2018-01-05 17:56:19 +01:00 |
|
Jean-Francois Dockes
|
216c69ff2d
|
comment
|
2017-12-08 13:16:26 +01:00 |
|
Jean-Francois Dockes
|
7346105dcb
|
rclaudio: properly process unicode tags
|
2017-12-03 19:01:50 +01:00 |
|
Jean-Francois Dockes
|
bbb30d3351
|
rclaudio: properly parse mp4 trkn = (x,y)
|
2017-12-03 17:57:37 +01:00 |
|
Jean-Francois Dockes
|
5afe1aa631
|
Add and interface a script to move the files generated by the WebExtensions new browser extension into the web input queue
|
2017-11-24 15:30:27 +01:00 |
|
Jean-Francois Dockes
|
cd44aa33e1
|
added adaptor script for new browser plugin
|
2017-11-24 11:10:45 +01:00 |
|
Jean-Francois Dockes
|
f8ce677e65
|
rclimg: remove perl option -w
|
2017-07-10 22:51:29 +02:00 |
|
Jean-Francois Dockes
|
d5732e6a74
|
allow perl to not be /usr/bin/perl
|
2017-06-30 15:25:52 +02:00 |
|
Jean-Francois Dockes
|
123d5b36ad
|
pdf: add and document MetaFixer::wrapup() method
|
2017-05-17 08:32:23 +02:00 |
|
Jean-Francois Dockes
|
ef9e7a935b
|
PDF XMP: move field editing code to external script, document
|
2017-05-17 06:57:52 +02:00 |
|
Jean-Francois Dockes
|
9e046187da
|
pdf xmp metadata: handle the case where the x:xmpmeta node is omitted and the XML root is rdf:RDF
|
2017-05-16 03:20:57 +02:00 |
|
Jean-Francois Dockes
|
6f44dce466
|
pdf: Added field-fixing method for Xml metadata
|
2017-05-15 14:04:55 +02:00 |
|
Jean-Francois Dockes
|
ccc0398155
|
Handle a unicode conversion issue. Avoid returning None as document for an empty document
|
2017-05-15 12:35:59 +02:00 |
|
Jean-Francois Dockes
|
d87d410f11
|
pdf: added capability to extract metadata from XML packet
|
2017-05-12 10:27:12 +02:00 |
|
Jean-Francois Dockes
|
d6a1f2a7f4
|
rclaudio: process additional tags
|
2017-04-25 10:16:48 +02:00 |
|
Jean-Francois Dockes
|
06e8424048
|
Changed input handler shebang lines to use explicit python2 instead of python. Cant switch to python3 because of msodump anyway
|
2017-04-09 04:09:02 +02:00 |
|
Jean-Francois Dockes
|
3e141cb2d5
|
support odf flat xml formats
|
2017-03-07 18:29:31 +01:00 |
|
Jean-Francois Dockes
|
4de12c11b7
|
odf file metadata was not properly processed
|
2017-03-07 18:28:23 +01:00 |
|
Jean-Francois Dockes
|
d35c2a557a
|
Process a few non-standard tag names found in the wild + check for embedded images
|
2017-02-27 17:15:15 +01:00 |
|
Jean-Francois Dockes
|
d891488687
|
Get rid of using the "Easy" wrapper and process the original tags instead
|
2017-02-19 12:38:18 +01:00 |
|
Jean-Francois Dockes
|
28bf7ff93c
|
rclaudio: let mutagen create the right object type. Extract more fields. Use the setfield() method instead of html meta tags. Needs the recent increase in max field count in mh_execm
|
2017-02-02 18:05:35 +01:00 |
|
Jean-Francois Dockes
|
7567025ad3
|
added "all in one" rclepub1 filter (no individual indexing of chapters)
|
2016-12-05 15:19:02 +01:00 |
|
Jean-Francois Dockes
|
d6b230043c
|
Check for newer pdftotext version to avoid double HTML escaping. fixes issue #318
|
2016-08-05 08:51:34 +02:00 |
|
Jean-Francois Dockes
|
b9e672abda
|
Allow execm input handlers to set arbitrary data fields
|
2016-07-11 18:13:39 +02:00 |
|
Jean-Francois Dockes
|
236900ee2a
|
comments
|
2016-05-23 19:16:31 +02:00 |
|
Jean-Francois Dockes
|
b2bd67cee8
|
added bogus minimum sample execm handler, indexing text lines as docs
|
2016-05-23 18:59:00 +02:00 |
|
Jean-Francois Dockes
|
b421f86f72
|
renamed rclmpdf.py to more normal rclpdf.py
|
2016-04-11 13:59:07 +02:00 |
|
Jean-Francois Dockes
|
4830e35a1b
|
pdf: add config variables to control if we attempt attachment extraction and ocr
|
2016-04-11 13:57:58 +02:00 |
|
Jean-Francois Dockes
|
74088bdada
|
doc
|
2016-04-09 20:01:48 +02:00 |
|
Jean-Francois Dockes
|
b995cfb4e8
|
added module for simplified interface to libxmp
|
2016-04-08 11:37:23 +02:00 |
|
Jean-Francois Dockes
|
031cdf9761
|
converted rcldjvu to python
|
2016-04-08 10:24:52 +02:00 |
|
Jean-Francois Dockes
|
95bd49b420
|
Restore PDF OCR capability from shell version of rclpdf script
|
2016-04-08 09:00:23 +02:00 |
|
Jean-Francois Dockes
|
92bb5bfc43
|
xls filter: catch HTML files disguising as XLS
|
2016-02-26 09:35:23 +01:00 |
|
Jean-Francois Dockes
|
b4c1fd033a
|
effect-less typo
|
2016-02-26 08:45:07 +01:00 |
|
Jean-Francois Dockes
|
d115bcfaa2
|
rclmpdf.py: p2/3 compat
|
2015-11-21 12:46:58 +01:00 |
|
Jean-Francois Dockes
|
5776c4bc20
|
rclinfo: remove trace message
|
2015-11-21 12:46:28 +01:00 |
|