shipping manifest cleanup
This commit is contained in:
parent
79119c9432
commit
7def227c19
@ -1,58 +0,0 @@
|
||||
= 2014-04-30: Notes about the hyphen-minus character '-':
|
||||
|
||||
Ascii hyphen-minus used to be glue, but stopped around version 1.18, then
|
||||
was re-instated in 1.20.
|
||||
|
||||
Having - as glue avoids generating phrase searches with bad performance.
|
||||
|
||||
== Dashes
|
||||
|
||||
There is a diversity of Unicode characters used mostly indistinctly (and
|
||||
independant of their correct intent) as dash/minus/hyphen (hyphen, n-dash,
|
||||
em-dash, etc.) in real-world texts.
|
||||
|
||||
The Unicode dashes are properly treated as word-breaking by the splitter,
|
||||
but it means that there will sometimes be a discrepancy between the
|
||||
character in the search (usually an ascii hyphen-minus), and the character
|
||||
in the text (which could be anything because of mis-use).
|
||||
|
||||
It does happen (incorrectly) that a dash is used in a text instead of an
|
||||
hyphen to join a compound word, resulting in no span constructed, and a
|
||||
minus in the question, generating a span search, resulting in missed
|
||||
match.
|
||||
|
||||
A possible solution consisting in changing all dash signs into minus signs
|
||||
at indexing time has been dismissed because this would introduce problems
|
||||
with *correct* uses of dashes (which should be treated as space). This
|
||||
would not be a major issue though because a matching search would probably
|
||||
use white space in this case, and single terms are also generated for the
|
||||
span.
|
||||
|
||||
There are auxiliary arguments:
|
||||
|
||||
- Treating all dash/hyphen/minus as whitespace (except at eol) makes for a
|
||||
smaller index.
|
||||
- Which is especially significant for raw indexes because of
|
||||
multiplicative effects ("jean francois" "Jean francois" "jean Francois"
|
||||
...)
|
||||
|
||||
== Hyphens
|
||||
|
||||
Hyphens have several distinct uses which should yield different treatment:
|
||||
|
||||
- Use with prefixes and suffixes: co-worker should probably be transformed
|
||||
into or supplemented by coworker
|
||||
- Use in compound words: American-football in "American-football player"
|
||||
should certainly not be collapsed.
|
||||
|
||||
If an hyphen-minus is present in the text in the first case, as will be
|
||||
current in practise, there is no way we can get it right anyway, except by
|
||||
using a language dictionary.
|
||||
|
||||
So, given that even a real hyphen needs an ambiguous treatment, we don't
|
||||
try and we just replace a Unicode hyphen (0x2010) with an ascii
|
||||
hyphen-minus while indexing. This has the best chance of matching what a
|
||||
user would type.
|
||||
|
||||
The current (1.20) recoll is unable to match coworker and co-worker. The
|
||||
best treatment for this would probably be synonym expansion at search time.
|
||||
@ -110,8 +110,7 @@ fi
|
||||
################################### Documentation
|
||||
###### Html doc
|
||||
RECOLLDOC=${RECOLLDOC:=doc/user}
|
||||
(cd $RECOLLDOC; sh xmlmake.sh) || exit 1
|
||||
rm -f $RECOLLDOC/usermanual-xml.html $RECOLLDOC/usermanual.xml
|
||||
(cd $RECOLLDOC; make) || exit 1
|
||||
|
||||
###### Text Doc
|
||||
chmod +w README INSTALL
|
||||
|
||||
@ -53,6 +53,7 @@ configure.ac
|
||||
desktop/
|
||||
desktop/hotrecoll.py
|
||||
desktop/recoll-searchgui.desktop
|
||||
desktop/recoll.appdata.xml
|
||||
desktop/recoll.png
|
||||
desktop/recoll.svg
|
||||
desktop/recoll.xcf
|
||||
@ -78,7 +79,6 @@ doc/prog/top.txt
|
||||
doc/user/
|
||||
doc/user/00README.txt
|
||||
doc/user/Makefile
|
||||
doc/user/Makefile.fbsd
|
||||
doc/user/RCL.INDEXING.CONFIG.html
|
||||
doc/user/RCL.INDEXING.EXTATTR.html
|
||||
doc/user/RCL.INDEXING.EXTTAGS.html
|
||||
@ -110,8 +110,7 @@ doc/user/docbook.css
|
||||
doc/user/index.html
|
||||
doc/user/usermanual-italian.html
|
||||
doc/user/usermanual.html
|
||||
doc/user/usermanual.sgml
|
||||
doc/user/xmlmake.sh
|
||||
doc/user/usermanual.xml
|
||||
filters/
|
||||
filters/injectcommon.sh
|
||||
filters/msodump.zip
|
||||
@ -276,6 +275,7 @@ python/recoll/recoll/__init__.py
|
||||
python/recoll/recoll/rclconfig.py
|
||||
python/recoll/setup.py.in
|
||||
python/samples/
|
||||
python/samples/docdups.py
|
||||
python/samples/mutt-recoll.py
|
||||
python/samples/rcldlkp.py
|
||||
python/samples/rclmbox.py
|
||||
@ -328,6 +328,8 @@ qtgui/i18n/recoll_it.qm
|
||||
qtgui/i18n/recoll_it.ts
|
||||
qtgui/i18n/recoll_lt.qm
|
||||
qtgui/i18n/recoll_lt.ts
|
||||
qtgui/i18n/recoll_pl.qm
|
||||
qtgui/i18n/recoll_pl.ts
|
||||
qtgui/i18n/recoll_ru.qm
|
||||
qtgui/i18n/recoll_ru.ts
|
||||
qtgui/i18n/recoll_tr.qm
|
||||
@ -509,6 +511,8 @@ unac/unac.h
|
||||
unac/unac_version.h
|
||||
utils/
|
||||
utils/Makefile
|
||||
utils/appformime.cpp
|
||||
utils/appformime.h
|
||||
utils/base64.cpp
|
||||
utils/base64.h
|
||||
utils/cancelcheck.h
|
||||
|
||||
@ -86,7 +86,8 @@ done
|
||||
|
||||
@NOQTMAKE@test -d ${datadir}/applications \
|
||||
@NOQTMAKE@ || mkdir -p ${datadir}/applications || exit 1
|
||||
|
||||
@NOQTMAKE@test -d ${datadir}/appdata \
|
||||
@NOQTMAKE@ || mkdir -p ${datadir}/appdata || exit 1
|
||||
|
||||
# Use the xdg utilies to install the desktop file and icon? Couldn't find
|
||||
# out how to get this to work sanely. So keep the old way
|
||||
@ -94,6 +95,7 @@ done
|
||||
#export PATH
|
||||
#xdg-desktop-menu install desktop/recoll-searchgui.desktop
|
||||
#xdg-icon-resource install --size 48 desktop/recoll.png
|
||||
@NOQTMAKE@${INSTALL} -m 0444 desktop/recoll.appdata.xml ${datadir}/appdata
|
||||
@NOQTMAKE@${INSTALL} -m 0444 desktop/recoll-searchgui.desktop ${datadir}/applications
|
||||
@NOQTMAKE@${INSTALL} -m 0444 desktop/recoll.png ${datadir}/icons/hicolor/48x48/apps
|
||||
@NOQTMAKE@${INSTALL} -m 0444 desktop/recoll.png ${datadir}/pixmaps/
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user