From de9a06f04c90bf3921884a7dc9c7a6d0f64360b7 Mon Sep 17 00:00:00 2001 From: Jean-Francois Dockes Date: Tue, 6 Mar 2012 07:26:59 +0100 Subject: [PATCH] none --- website/download.html | 6 ++ website/rclidxfmt.html | 149 +++++++++++++++++++++++++++-------------- 2 files changed, 105 insertions(+), 50 deletions(-) diff --git a/website/download.html b/website/download.html index 0b2e57b8..eaa6a87b 100644 --- a/website/download.html +++ b/website/download.html @@ -372,6 +372,12 @@ I now use the OpenSUSE build service to create Recoll OpenSUSE packages.

Updated 1.16 translations that became available after the release:

+ +

A new Spanish translation for 1.16.2, thanks to JCP. + recoll_es.ts + recoll_es.qm +

+

The following are up to date in 1.16.2, but may be useful if you are running 1.16.1.

Czech, thanks to Pavel. diff --git a/website/rclidxfmt.html b/website/rclidxfmt.html index e1e3c461..0cbdfef4 100644 --- a/website/rclidxfmt.html +++ b/website/rclidxfmt.html @@ -19,20 +19,23 @@

Recoll index format details

-

A comparison of index formats for recoll 1.8 and omega - 1.0.1

+

A comparison of index formats for recoll 1.17 and omega + 1.0.1

Recoll terms are not stemmed before being stored. They are turned to all minuscule letters with no accents. An auxiliary database handles stem expansion. Omega stores both raw - terms and stemmed versions (with prefix Z)

+ terms (with prefix R) and stemmed versions (with prefix Z). + The xapian-side of the information here comes from the relevant + xapian-omega documentation + page. +

Special prefixed terms:

A comparison of prefixed term usage between Recoll and - omega/xapian. xapian-core in the Omega column means - that the prefix is not used by Omega, but mentionned as - allocated in the xapian prefix definition document.

+ omega/xapian.

@@ -40,63 +43,109 @@ - - + - - - - - + + + YYYYMMDD + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + - - - - - - - - - - - - - - - -
Tmime typeSame
AAuthorSame
PTruncated/hashed version of file path. For - single-document files, and for the file part of a - multi-document file. Used for up-to-date checks and for - retrieving a document by path. Path part of URL (no - hashing). Uses U for the equivalent - term used for up to date checks.
Qpathhash+ipath same + internal path for - documents inside multi-document files. Used to set the - existence flag for subdocs when a multi-document file is found - to be up to date, or for deleting all subdocs for a file, or - for retrieving a document by path+ipath. Compatible - with Q definition in xapian/termprefixes.txt: unique - identifier.None
BUnusedReserved
CUnusedReserved
Ddate: modification date of file, like - YYYYMMDDSameSame
EUnused. Recoll uses XEfile name extension folded to lowercase
FUnusedReserved
GUnusednewGroup / forum name
HUnusedhost name
IUnused"Can see"
JUnusedReserved
KKeywordSame
LUnusedISO language code
Mmonth: YYYYMMSame
NUnusedISO country code
OUnusedOwner
PUnusedPath part of URL
QUnique Id. fs backend: trunc-hashed path+ipath + Other backends may use a different unique id. + Unique Id
RUnusedRaw (unstemmed) term
SSubject/titleSame
Tmime typeSame
UUnusedFull Url of indexed + document. Truncated/hashed version of URL. Used for + duplicate checks.
VUnused"Can't see"
WUnusedOwner
XPrefix prefix for multichar prefixesSame
Yyear YYYYSame
ZUnusedStemmed term
XEFile name extension folded as lowercase + (omega uses E)Unused
XPPath elements (for phrase-based directory filtering) + Unused
XSFNutf8 lowercased/unaccented version of + file name. Used for specific file name searches. NOT SPLIT + (spaces as normal chars).None
XTORecipientNone
XXSTNot really a prefix: start of field + marker (for anchored phrase searches)None
XXNDNot really a prefix: end of field + marker (for anchored phrase searches)None
Mmonth: YYYYMMSame
Yyear YYYYSame
XSFNutf8 version of file name. Used for specific - file name searchesNone
UNoneUrl term. Truncated/hashed version - of URL. Used for duplicate checks.
SSubject/titlexapian-core
AAuthorxapian-core
KKeywordxapian-core
-

None of the "date" terms are currently used by recoll queries

Values

-

Recoll currently stores no document values.

-

Omega stores 2 values, for the md5 hash of the file, and the - last modification date (as unix time). The md5 value doesn't - appear to be currently used ?

+ + + + + + + + + + + + +
Value slotRecoll useOmega use
0UnusedUnix modification time
1MD5Same
2UnusedSize
10Signature: value to be checked for + up-to-dateness, ie mtime|size for the fs + backendUnused
+

Document data record format

+

Recoll has the same line based / prefixed data record format - as omega (name=value\n).

+ as omega (name=value\n). The Omega data below is quite out of + date.

@@ -141,7 +190,7 @@
Jean-Francois Dockes
-Last modified: Thu Jun 14 11:14:38 CEST 2007 +Last modified: Sat Feb 25 09:14:38 CEST 2012