Home
last modified time | relevance | path

Searched hist:"93 a60ad223af9f649d62d2acbdffd22ce5ad1b97" (Results 1 – 2 of 2) sorted by relevance

/dokuwiki/inc/
H A Dfulltext.php93a60ad223af9f649d62d2acbdffd22ce5ad1b97 Sun Sep 25 17:54:51 UTC 2005 Andreas Gohr <andi@splitbrain.org> asian language support for the indexer #563

Asian languages do not use spaces to seperate words. The indexer however does
a word based lookup. Splitting for example Japanese texts into real words is
only possible with complicated natural language processing, something
completely out of scope for DokuWiki.

This patch solves the problem by treating all asian characters as single
words. When an asian word (consisting of multiple characters) is searched it
is treated as a phrase search, looking up each charcter by it self first,
then checking for the phrase in found documents.

darcs-hash:20050925175451-7ad00-933b33b51b5f2fa05e736c18b8db58a5fdbf41ce.gz

H A Dindexer.php93a60ad223af9f649d62d2acbdffd22ce5ad1b97 Sun Sep 25 17:54:51 UTC 2005 Andreas Gohr <andi@splitbrain.org> asian language support for the indexer #563

Asian languages do not use spaces to seperate words. The indexer however does
a word based lookup. Splitting for example Japanese texts into real words is
only possible with complicated natural language processing, something
completely out of scope for DokuWiki.

This patch solves the problem by treating all asian characters as single
words. When an asian word (consisting of multiple characters) is searched it
is treated as a phrase search, looking up each charcter by it self first,
then checking for the phrase in found documents.

darcs-hash:20050925175451-7ad00-933b33b51b5f2fa05e736c18b8db58a5fdbf41ce.gz