History log of /dokuwiki/lib/exe/indexer.php (Results 51 – 75 of 93)
Revision Date Author Comments
# cd3c835d 25-Jan-2009 Andreas Gohr <andi@splitbrain.org>

removed importoldchangelog and importoldindex plugins

Ignore-this: fb48b24cecb52541a728ba9c17597d8f
These one-shot plugins where used for upgrading older DokuWiki versions
and are no longer needed.

removed importoldchangelog and importoldindex plugins

Ignore-this: fb48b24cecb52541a728ba9c17597d8f
These one-shot plugins where used for upgrading older DokuWiki versions
and are no longer needed.
If you upgrade from a really old version you might want upgrade to
intermediate versions instead.

darcs-hash:20090125143050-7ad00-5ff7b2cd5f61c392e9e02e13eab947d045d60b04.gz

show more ...


# 99c8d7f2 18-Jan-2009 michael <michael@content-space.de>

Media changelog added

There is a new media changelog now, with the flag RECENTS_MEDIA_CHANGES media changes can be requested from the getRecents()-function or the new getRecentsSince()-function, tha

Media changelog added

There is a new media changelog now, with the flag RECENTS_MEDIA_CHANGES media changes can be requested from the getRecents()-function or the new getRecentsSince()-function, that returns all changes since a given timestamp and optionally before a given timestamp. The media upload and the XML-RPC-server have been changed to use these functions.

Additionally, the event MEDIA_UPLOAD_FINISH has been extended, it has a new $data-attribute (the 5th), that contains a boolean if the file does already exist and will be overwritten.

darcs-hash:20090118154345-074e0-5d9a90d269e86d8c6a156ecce5cf63115c827433.gz

show more ...


# 4894bdf5 23-Jun-2008 Andreas Gohr <andi@splitbrain.org>

fixed the sitemap submission URL for MS Live Search

darcs-hash:20080623175256-7ad00-4e6ec21196db228d47dbfede6294613567dbb762.gz


# 33a148e0 26-Feb-2008 Tom N Harris <tnharris@whoopdedo.org>

INDEXER_TASKS_RUN event for index-time hooks

The event INDEXER_TASKS_RUN is fired by lib/exe/indexer.php when a page is viewed. Plugins should only hook BEFORE the event if it is important for the t

INDEXER_TASKS_RUN event for index-time hooks

The event INDEXER_TASKS_RUN is fired by lib/exe/indexer.php when a page is viewed. Plugins should only hook BEFORE the event if it is important for the task to be run as often as possible. Otherwise, hook AFTER the even to be run only when other tasks have completed.

Plugin authors must call stopPropagation() and preventDefault() if any work is done. If your plugin does nothing, then you must allow the event to continue. Not following these rules may cause DokuWiki to exceed the PHP execution time limit.

darcs-hash:20080226011940-6942e-09291b73bab84a2c4445b1d1c4de8b3bba743243.gz

show more ...


# 4570e954 15-Oct-2007 Tom N Harris <tnharris@whoopdedo.org>

Fix border condition on recent change update

darcs-hash:20071015225711-6942e-4d540e23e3c2ab62e378b0b9bc3cb80041c79350.gz


# d0a27cb0 30-Sep-2007 Andreas Gohr <andi@splitbrain.org>

don't use fullpath() before initialized

darcs-hash:20070930201133-7ad00-a35a6c40f880116009efd9e50cb002bd75733369.gz


# 00976812 30-Sep-2007 Andreas Gohr <andi@splitbrain.org>

don't use realpath() anymore (FS#1261 and others)

The use of realpath() to clean up relative file names caused some
trouble in certain setups relying on symlinks or having restricitve
file structure

don't use realpath() anymore (FS#1261 and others)

The use of realpath() to clean up relative file names caused some
trouble in certain setups relying on symlinks or having restricitve
file structure setups.

This patch replaces all realpath() calls with a PHP only replacement
which should solve those problems.

darcs-hash:20070930184250-7ad00-512ff04c95f57fc9eaf104f80372237a3c94286f.gz

show more ...


# a0c5c349 19-Sep-2007 Tom N Harris <tnharris@whoopdedo.org>

Remove obsolete words from search index

Creates another index file 'pagewords.idx' for the words in each page.
Words that are deleted from a page can then be removed from the word index.
The indexer

Remove obsolete words from search index

Creates another index file 'pagewords.idx' for the words in each page.
Words that are deleted from a page can then be removed from the word index.
The indexer version is incremented to force rebuilding of the index.
Also, a minor flaw in the regexp for asian words is fixed.

darcs-hash:20070919194244-6942e-2e08157dcf4fdf166b35b36a0faf8a3dfb415ad9.gz

show more ...


# fcf93a34 03-Mar-2007 Andreas Gohr <andi@splitbrain.org>

improved writability check for sitemap FS#1093

darcs-hash:20070303192836-7ad00-fe821c42ba7541f58ab52b9d8d11b3241bc90b65.gz


# cde6a01b 08-Feb-2007 Andreas Gohr <andi@splitbrain.org>

workaround config for FS#852

On certain platforms the ignore_user_abort function does not work as expected, resulting
in a non working indexer webbug. Users with such a broken system (IIS+PHP as CGI

workaround config for FS#852

On certain platforms the ignore_user_abort function does not work as expected, resulting
in a non working indexer webbug. Users with such a broken system (IIS+PHP as CGI) can
enable this option to work around the problem (resulting in longer load times for the
webbug).

darcs-hash:20070208195145-7ad00-8fc14f9da535a70fa837066773e15a3926b077c7.gz

show more ...


# 613e963a 07-Dec-2006 Ben Coburn <btcoburn@silicodon.net>

string for constant fix

darcs-hash:20061207075815-05dcb-81fad7f4e40142e01f9f1aaa56f47fa51f978186.gz


# d5b23302 17-Nov-2006 Tom N Harris <tnharris@whoopdedo.org>

Indexer asian language fixes and speed-ups

Make Chinese and Japanese work better with the new indexer.
Some missing punctuation added to utf8_stripspecials.
Misc. other changes to make indexing fast

Indexer asian language fixes and speed-ups

Make Chinese and Japanese work better with the new indexer.
Some missing punctuation added to utf8_stripspecials.
Misc. other changes to make indexing faster. The indexes will expire on
backend upgrades, so you don't have to delete *.indexed

darcs-hash:20061117123032-6942e-774b38e08234928c49b37e40addba375acf67ac0.gz

show more ...


# 9046bb99 17-Nov-2006 Andreas Gohr <andi@splitbrain.org>

sitemapper update

The Google sitemap protocol was recently adopted by Yahoo and Microsoft and
made a common standard. This patch changes the XML namespace URL to the new
sitmaps.org site and ups the

sitemapper update

The Google sitemap protocol was recently adopted by Yahoo and Microsoft and
made a common standard. This patch changes the XML namespace URL to the new
sitmaps.org site and ups the version to 0.9

Pinging of Yahoo and Microsoft was added to the existing Google ping. The
microsoft ping currently fails with a "Bad format" error for unknown reason.
This will hopefully change when either Microsoft fixes their URL or releases
some documentation.

darcs-hash:20061117150030-7ad00-0fac1cba07926c3ffe687a8cbaf465e8de3abcd7.gz

show more ...


# 3fc667cf 13-Nov-2006 chris <chris@jalakai.co.uk>

fixes for stricter php5 typing (bug#978)

darcs-hash:20061113122645-9b6ab-e5f5be2e88eea7eb00643e6a5210086f46191c30.gz


# 579b0f7e 12-Nov-2006 TNHarris <telliamed@fastmail.us>

Word-Length Indexer

A modification to the indexer that sorts words based on length. This should make
searching a little bit more efficient. After the patch is applied, your old index
will be automat

Word-Length Indexer

A modification to the indexer that sorts words based on length. This should make
searching a little bit more efficient. After the patch is applied, your old index
will be automatically converted to the new format (when you visit a page). The
new index format is:

1. Index files are stored in savedir/index
2. Word lists are stored as wlen.idx. This used to be word.idx.
3. Word indexes are stored as ilen.idx. This used to be index.idx.
4. The page list, page.idx, is simply copied to the new location.

Any plugins you have, such as the blog plugin, that read the index files need to
be updated.

darcs-hash:20061112194900-2b9f0-a975498ccf0a1d39c6df73b79bcd028d5e81c389.gz

show more ...


# 27fbc761 24-Sep-2006 chris <chris@jalakai.co.uk>

update to previous changes cache patch

- fix potential array key collisions
- restore ability to keep a minimum number ($conf['recent']) of recent changes
irregardless of date of change

darcs-has

update to previous changes cache patch

- fix potential array key collisions
- restore ability to keep a minimum number ($conf['recent']) of recent changes
irregardless of date of change

darcs-hash:20060924162105-9b6ab-06350f04f9d9ac4c362f13787b682ef70887a1fc.gz

show more ...


# 07cb64f8 24-Sep-2006 Andreas Gohr <andi@splitbrain.org>

fix for sitemap creation with new compression option #919

darcs-hash:20060924100606-7ad00-7e0bc1fa7778669ac352f8d8994acbb7517323cd.gz


# 40b33eff 23-Sep-2006 chris <chris@jalakai.co.uk>

fix recent changes cache ordering

This patch fixes a bug in indexer.php which resulted in the order
of the recent changes cache being reversed each time it was trimmed.

It also adds sorting to both

fix recent changes cache ordering

This patch fixes a bug in indexer.php which resulted in the order
of the recent changes cache being reversed each time it was trimmed.

It also adds sorting to both getRecents() and runTrimRecentChanges()
as a defensive measure against the order of the file being corrupted.

darcs-hash:20060923235109-9b6ab-0f4062c1b02449cce9382426174cd22d71387e5a.gz

show more ...


# 847c2f3b 17-Sep-2006 Andreas Gohr <andi@splitbrain.org>

no gzipping in indexer.php

darcs-hash:20060917140718-7ad00-ab1b95974ad63966c770f787112bc7c2e285c394.gz


# d8186216 08-Sep-2006 Ben Coburn <btcoburn@silicodon.net>

suppress boring errors

Suppress any errors from set_time_limit,
unlink, and file_exists functions.
see: http://www.freelists.org/archives/dokuwiki/09-2006/msg00004.html

darcs-hash:20060908193433-05

suppress boring errors

Suppress any errors from set_time_limit,
unlink, and file_exists functions.
see: http://www.freelists.org/archives/dokuwiki/09-2006/msg00004.html

darcs-hash:20060908193433-05dcb-013617431870ab5bfb2ce8c6e99ba5af13493228.gz

show more ...


# 71726d78 30-Aug-2006 Ben Coburn <btcoburn@silicodon.net>

scalable changelog redesign

This patch provides a rewritten changelog system that is designed to run
efficiently on both small and large wikis. The patch includes a plugin to
convert changelogs from

scalable changelog redesign

This patch provides a rewritten changelog system that is designed to run
efficiently on both small and large wikis. The patch includes a plugin to
convert changelogs from the current format. The conversion is
non-destructive and happens automatically. For more information on the new
changelog format see "http://wiki.splitbrain.org/wiki:changelog".

Structure
In short the changelog is now stored in per-page changelog files, with a
recent changes cache. The recent changes cache is kept in
"/data/meta/_dokuwiki.changes" and trimmed daily. The per-page changelogs
are kept in "/data/meta/<ns>/<page_id>.changes" files. To preserve
revision information for revisions stored in the attic, the "*.changes"
files are not removed when their page is deleted. This allows the full
life-cycle of page creation, deletion, and reversion to be tracked.

Format
The changelog line format now uses a general "line type" field in place of
the special "minor" change syntax. There is also an extra field that can
be used to store arbitrary data associated with special line types. The
reverted line type (R) is a good example. There the extra field holds the
revision date used as the source for reverting the page. See the wiki for
the complete syntax description.

Code Notes
The changelog functions have been rewritten to load the whole file only if
it is small. For larger files, the function loads only the relevant
chunk(s). Parsed changelog lines are cached in memory to speed future
function calls.

getRevisionInfo
A binary search is used to locate the chunk expected to contain the
requested revision. The whole chunk is parsed, and adjacent lines are
optimistically cached to speed consecutive calls.

getRevisions
Reads the changelog file backwards (newest first) in chunks until the
requested number of lines have been read. Parsed changelog lines are
cached for subsequent calls to getRevisionInfo. Because revisions are read
from the changelog they are no longer guaranteed to exist in the attic.

(Note: Even with lines of arbitrary length getRevisionInfo and
getRevisions never split changelog lines while reading. This is done by
sliding the "file pointer" forward to the end of a line after each blind
seek.)

isMinor
Removed. To detect a minor edit check the type as follows:
$parsed_logline['type']

darcs-hash:20060830182753-05dcb-1c5ea17f581197a33732a8d11da223d809c03506.gz

show more ...


# fd0cab2e 01-Jul-2006 Andreas Gohr <andi@splitbrain.org>

check if ignore_user_abort was successful (maybe fix for #852)

darcs-hash:20060701120325-7ad00-07efe9cacd51043ad95d8d2d71d8680036721286.gz


# 0280d25b 29-May-2006 Andreas Gohr <andi@splitbrain.org>

fixed google sitemap pinging #815

darcs-hash:20060529183003-7ad00-de0e3acac75a9f94f6c27f67651eeabe40411d7a.gz


# 10eae714 26-May-2006 Andreas Gohr <andi@splitbrain.org>

fix for sitemap creation #813

darcs-hash:20060526223358-7ad00-2bdfd39a5dd8ca09101288834cc75e5e963afda5.gz


# 7421c3cc 11-May-2006 Andreas Gohr <andi@splitbrain.org>

more info is gathered on metaupdate in background indexer

The background indexer now gathers info on contributors and modification
dates from the changelog when adding the missing meta info.

A new

more info is gathered on metaupdate in background indexer

The background indexer now gathers info on contributors and modification
dates from the changelog when adding the missing meta info.

A new io_grep function was added which might be useful for other parts in
the Wiki as well.

darcs-hash:20060511191450-7ad00-baba1b48ea03b823c88a480862c612316f159b5a.gz

show more ...


1234