296 Commits

Author SHA1 Message Date
Jean-Francois Dockes
f0bedb2201 rclaudio: more fixes: complicated because the different file type handlers (e.g. flac, mp3) return data in different types. 2018-06-21 12:30:43 +02:00
Jean-Francois Dockes
a012b831fa rclaudio: more py3 string/bytes types issues 2018-06-21 10:55:01 +02:00
Jean-Francois Dockes
9bb50ccdd5 Renamed bundled pychm to recollchm to ease cohexistence with possible system version 2018-06-12 19:29:37 +02:00
Jean-Francois Dockes
61e471a0e2 use rclbasehandler in more filters 2018-06-04 15:49:21 +02:00
Jean-Francois Dockes
0d24cc35da factorize boilerplate in simple filters 2018-06-04 15:08:06 +02:00
Jean-Francois Dockes
211ea8010c more filter cleanup: factorize code in the vanilla xslt ones, move a few more to python. 2018-06-04 13:30:09 +02:00
Jean-Francois Dockes
2a45f7fef6 filters cleanup continued: remove unused rclps, translate rclabw to python 2018-06-04 10:53:31 +02:00
Jean-Francois Dockes
9bb65fd970 get rid of rclwpd which was not used (wpd2html has been executed directly for who knows how many years) 2018-06-04 10:35:48 +02:00
Jean-Francois Dockes
603369b8f6 windows rcluncomp: let python vers unspecified for now 2018-06-04 10:31:23 +02:00
Jean-Francois Dockes
9ce4f4fcbe fix typo in error message 2018-06-04 10:30:34 +02:00
Jean-Francois Dockes
f3d3f5b0bf rclzip: add useSkippedNames variable to also use the base skippedNames 2018-06-04 09:05:23 +02:00
Jean-Francois Dockes
52d3bfa54f Change the shebang line from python2 to python3 for all scripts 2018-06-01 14:55:10 +02:00
Jean-Francois Dockes
29e63aeda1 CHM handler: bundle pychm for Python3 2018-06-01 14:52:12 +02:00
Jean-Francois Dockes
7b5f701b1d rclxslt: avoid spurious exception when the input (openoffice doc) is empty 2018-05-23 11:34:39 +02:00
Jean-Francois Dockes
4b950384e0 fix mode 2018-04-10 13:46:24 +02:00
Jean-Francois Dockes
cedff8ce7c rclchm: python3 modifications 2018-04-08 10:53:15 +02:00
Jean-Francois Dockes
93ac830079 All format handlers compatible with python3 except chm 2018-03-09 15:25:11 +01:00
Jean-Francois Dockes
7f49de5d97 rcldoc.py: port to python3. We by default exec antiword directly anyway 2018-03-08 20:38:51 +01:00
Jean-Francois Dockes
b8fa3005dd rclkar/python3: small simplifications 2018-03-08 20:37:34 +01:00
Jean-Francois Dockes
d9afcdf8a3 Modified xls and ppt filter to be compatible with python3 2018-03-08 15:51:12 +01:00
Jean-Francois Dockes
c56c1d6f46 rclchm: very small change in support of py3, but there are lots of issue in python-chm itself 2018-02-22 15:54:33 +01:00
Jean-Francois Dockes
5c80488465 imported midi.py module after python3 port, stripped most of the write part. 2018-02-22 09:50:45 +01:00
Jean-Francois Dockes
e72bac02c3 imported python-midi 0.2.1 @4:783219460045 (py3 port) 2018-02-22 09:34:44 +01:00
Jean-Francois Dockes
fead7bb491 ported rclkar to python3 2018-02-22 09:30:42 +01:00
Jean-Francois Dockes
dc0241d53a comments and messages 2018-02-09 18:15:20 +01:00
Jean-Francois Dockes
9d10bd857f none 2018-02-09 18:15:02 +01:00
Jean-Francois Dockes
116318f1f5 Added small script to process bibtex files 2018-02-09 09:30:41 +01:00
Jean-Francois Dockes
0b8988cd64 Fix Windows PDF indexing. The successful test for poppler/pdftotext was not acknowledged and pdf indexing always failed 2018-01-19 13:15:51 +01:00
Jean-Francois Dockes
b99372d379 Merge branch 'RECOLL_1_23_MAINT' 2018-01-05 17:56:44 +01:00
Jean-Francois Dockes
413c710f34 rclchm, rclepub: define config variables chmcatenate and epubcatenate to specify that the files should be indexed as a whole instead of as individual chapters 2018-01-05 17:56:19 +01:00
Jean-Francois Dockes
216c69ff2d comment 2017-12-08 13:16:26 +01:00
Jean-Francois Dockes
7346105dcb rclaudio: properly process unicode tags 2017-12-03 19:01:50 +01:00
Jean-Francois Dockes
bbb30d3351 rclaudio: properly parse mp4 trkn = (x,y) 2017-12-03 17:57:37 +01:00
Jean-Francois Dockes
5afe1aa631 Add and interface a script to move the files generated by the WebExtensions new browser extension into the web input queue 2017-11-24 15:30:27 +01:00
Jean-Francois Dockes
cd44aa33e1 added adaptor script for new browser plugin 2017-11-24 11:10:45 +01:00
Jean-Francois Dockes
f8ce677e65 rclimg: remove perl option -w 2017-07-10 22:51:29 +02:00
Jean-Francois Dockes
d5732e6a74 allow perl to not be /usr/bin/perl 2017-06-30 15:25:52 +02:00
Jean-Francois Dockes
123d5b36ad pdf: add and document MetaFixer::wrapup() method 2017-05-17 08:32:23 +02:00
Jean-Francois Dockes
ef9e7a935b PDF XMP: move field editing code to external script, document 2017-05-17 06:57:52 +02:00
Jean-Francois Dockes
9e046187da pdf xmp metadata: handle the case where the x:xmpmeta node is omitted and the XML root is rdf:RDF 2017-05-16 03:20:57 +02:00
Jean-Francois Dockes
6f44dce466 pdf: Added field-fixing method for Xml metadata 2017-05-15 14:04:55 +02:00
Jean-Francois Dockes
ccc0398155 Handle a unicode conversion issue. Avoid returning None as document for an empty document 2017-05-15 12:35:59 +02:00
Jean-Francois Dockes
d87d410f11 pdf: added capability to extract metadata from XML packet 2017-05-12 10:27:12 +02:00
Jean-Francois Dockes
d6a1f2a7f4 rclaudio: process additional tags 2017-04-25 10:16:48 +02:00
Jean-Francois Dockes
06e8424048 Changed input handler shebang lines to use explicit python2 instead of python. Cant switch to python3 because of msodump anyway 2017-04-09 04:09:02 +02:00
Jean-Francois Dockes
3e141cb2d5 support odf flat xml formats 2017-03-07 18:29:31 +01:00
Jean-Francois Dockes
4de12c11b7 odf file metadata was not properly processed 2017-03-07 18:28:23 +01:00
Jean-Francois Dockes
d35c2a557a Process a few non-standard tag names found in the wild + check for embedded images 2017-02-27 17:15:15 +01:00
Jean-Francois Dockes
d891488687 Get rid of using the "Easy" wrapper and process the original tags instead 2017-02-19 12:38:18 +01:00
Jean-Francois Dockes
28bf7ff93c rclaudio: let mutagen create the right object type. Extract more fields. Use the setfield() method instead of html meta tags. Needs the recent increase in max field count in mh_execm 2017-02-02 18:05:35 +01:00