10 Commits

Author SHA1 Message Date
Jean-Francois Dockes
8358742132 get things to build on centos7.5 (cosmetic changes) 2018-09-02 18:47:03 +02:00
Jean-Francois Dockes
84abb8ac04 Fix regex used for cleaning up snippets 2018-04-12 12:25:05 +02:00
Jean-Francois Dockes
e4e5ee35d6 cleanup repeated punctuation in snippets 2018-04-10 13:07:27 +02:00
Jean-Francois Dockes
3d4fd3c62e When storing doc text, always use a metadata entry. Get rid of the code to
store it in the data record. Make storing the default.  Add "fetchtext"
parameter to getDoc() to fetch and store the text in doc.text. Make this
accessible from Python. Misc comments and indents.
2018-01-25 13:20:02 +01:00
Jean-Francois Dockes
2c76a70c0e Abstracts: storing raw doc text in user metadata records 2018-01-06 11:38:24 +01:00
Jean-Francois Dockes
57d9ece876 rclabsfromtext: do not add page numbers if there are no pages 2018-01-06 10:39:02 +01:00
Jean-Francois Dockes
a35de1ef1e snippets: fix to the group matching code 2018-01-03 15:53:04 +01:00
Jean-Francois Dockes
567401233a Building abstract/snippets from the doc text: process phrase/group terms 2018-01-03 15:28:46 +01:00
Jean-Francois Dockes
bb810f9ceb Changed new param name storerawtext->storedoctext. + comments 2018-01-02 19:23:12 +01:00
Jean-Francois Dockes
b4493ed9e1 Snippets generation: add method for generating from doc stored text. Still needs refining, esp. for phrase/near 2017-12-30 08:43:14 +01:00