| #
9634d734 |
| 21-May-2025 |
Andreas Gohr <gohr@cosmocode.de> |
add option to always send full page context
|
| #
7be8078e |
| 15-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
allow models to have a zero token limit
This allows for configuring completely unknown models. For these models no token limit is known and we will simply do not apply any. Instead we trust that the
allow models to have a zero token limit
This allows for configuring completely unknown models. For these models no token limit is known and we will simply do not apply any. Instead we trust that the model will be either large enough to handle our input or at least throw useful error messages.
show more ...
|
| #
ed47fd87 |
| 27-Mar-2025 |
Andreas Gohr <andi@splitbrain.org> |
new UI with option to chat about the current page
|
| #
666b8ea7 |
| 29-Jul-2024 |
Andreas Gohr <andi@splitbrain.org> |
added configurable custom prompt
This makes it easier to add to the existing internal prompt instead of completely overwriting it using the localization mechanism.
|
| #
8c08cb3f |
| 27-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
auto style fixes
|
| #
87090e4b |
| 27-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
ask the rephrased question only if it has more context
Otherwise it's usually better to let the LLM reevaluate the context.
|
| #
0de7e020 |
| 25-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
mechanisms to override things on command line
This should help with debugging/evaluating
|
| #
2071dced |
| 21-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
automatic stylefixes
|
| #
59a2a267 |
| 21-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
prefer prompted user messages over system prompts
This seems to be better supported across different model providers and seems to influence the model in a stronger way.
Prompt files have been renam
prefer prompted user messages over system prompts
This seems to be better supported across different model providers and seems to influence the model in a stronger way.
Prompt files have been renamed to avoid having them show up in the DokuWiki translate interface. Multilingual models are fine with prompting in english only.
show more ...
|
| #
c2b7a1f7 |
| 21-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
various refactoring and introduction of a simulate command
The new command makes it easier to run the same chat questions against multiple models and compare the results in a spreadsheet
|
| #
51aa8517 |
| 20-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
separate the rephrasing model from the chat model
Rephrasing can be done with faster, simpler models as there is not much reasoning needed.
|
| #
04afb84f |
| 19-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
correctly use storage setting
|
| #
dce0dee5 |
| 19-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
move model configuration into json files
This removes the use of individual classes for each model in favor of more general client classes that only get a model name passed. The meta info about pric
move model configuration into json files
This removes the use of individual classes for each model in favor of more general client classes that only get a model name passed. The meta info about price and token limits is configured in a json file
show more ...
|
| #
34a1c478 |
| 19-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
more refactoring on chat and embed model support
* differentiate between input and output tokens * make use of much larger input contexts
|
| #
d02b7935 |
| 18-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
first tries with Anthropic
|
| #
294a9eaf |
| 18-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
Use interfaces for Chat and Embedding classes
This way it's easier to have a base OpenAI class. This also moves much of the statistics and http handling into the base class making model implementati
Use interfaces for Chat and Embedding classes
This way it's easier to have a base OpenAI class. This also moves much of the statistics and http handling into the base class making model implementations even leaner
show more ...
|
| #
6a18e0f4 |
| 14-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
First start on refactoring the class hierarchy
This splits embedding models from chat completion models.
|
| #
cfaf6b32 |
| 07-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
tell the model to fall back to current language if it's unsure
The model defaulted to spanish if no sources were found
|
| #
e75dc39f |
| 14-Feb-2024 |
Andreas Gohr <andi@splitbrain.org> |
record the times of embed and maintenance runs
This makes it easier to debug when something with the cronjob goes wrong. Currently the data is only exposed in the cli info command.
We might want to
record the times of embed and maintenance runs
This makes it easier to debug when something with the cronjob goes wrong. Currently the data is only exposed in the cli info command.
We might want to use it somewhere in the UI to warn about outdated data
show more ...
|
| #
4c0099a8 |
| 08-Nov-2023 |
Andreas Gohr <andi@splitbrain.org> |
Added Qdrant Storage
|
| #
441edf84 |
| 08-Nov-2023 |
Andreas Gohr <andi@splitbrain.org> |
fixed overlong lines
|
| #
30b9cbc7 |
| 08-Nov-2023 |
splitbrain <splitbrain@users.noreply.github.com> |
Automatic code style fixes
|
| #
5e6dd16e |
| 08-Nov-2023 |
Andreas Gohr <andi@splitbrain.org> |
Added ChromaDB vector storage support
|
| #
f8d5ae01 |
| 13-Sep-2023 |
Andreas Gohr <andi@splitbrain.org> |
codesniffer cleanups
|
| #
7ebc7895 |
| 13-Sep-2023 |
splitbrain <splitbrain@users.noreply.github.com> |
Automatic code style fixes
|