| #
7775eee7 |
| 29-Oct-2025 |
Andreas Gohr <gohr@cosmocode.de> |
fix interface handling with generic model
|
| #
d72a84c5 |
| 11-Jun-2025 |
Andreas Gohr <gohr@cosmocode.de> |
move api endpoint init to AbstractModel
This should finally solve the chicken-egg issue. This also renames the ollama url parameter from baseurl to apiurl and sets a sensible default
|
| #
7c3b69cb |
| 11-Jun-2025 |
Andreas Gohr <gohr@cosmocode.de> |
refactor HTTP Client initialization
|
| #
7be8078e |
| 15-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
allow models to have a zero token limit
This allows for configuring completely unknown models. For these models no token limit is known and we will simply do not apply any. Instead we trust that the
allow models to have a zero token limit
This allows for configuring completely unknown models. For these models no token limit is known and we will simply do not apply any. Instead we trust that the model will be either large enough to handle our input or at least throw useful error messages.
show more ...
|
| #
e3b34a2b |
| 15-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
fix model full name
|
| #
2e22aefb |
| 15-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
introduce a "Generic" model
This is basically an OpenAI implementation, which then can be reused by most other providers since they usually implement a somewhat compatible API. The API URL can be se
introduce a "Generic" model
This is basically an OpenAI implementation, which then can be reused by most other providers since they usually implement a somewhat compatible API. The API URL can be set manually so it should be easy to add new unsupported providers.
show more ...
|
| #
42b2c6e8 |
| 12-Mar-2025 |
Andreas Gohr <andi@splitbrain.org> |
add remote component to ask questions to the bot
The endpoint allows to override model and language settings on demand.
|
| #
4dd0657e |
| 06-Feb-2025 |
Andreas Gohr <andi@splitbrain.org> |
allow to set arbitrary models
We now initialize a model configuration even if we have no info in model.json using some default values for the token limits.
Models can implement the loadUnkonwModelI
allow to set arbitrary models
We now initialize a model configuration even if we have no info in model.json using some default values for the token limits.
Models can implement the loadUnkonwModelInfo() method to fetch the info from an API if such a thing exist. Implemented for gemini and ollama currently.
show more ...
|
| #
b446155b |
| 27-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
fix info output on used models
|
| #
2071dced |
| 21-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
automatic stylefixes
|
| #
c2b7a1f7 |
| 21-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
various refactoring and introduction of a simulate command
The new command makes it easier to run the same chat questions against multiple models and compare the results in a spreadsheet
|
| #
51aa8517 |
| 20-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
separate the rephrasing model from the chat model
Rephrasing can be done with faster, simpler models as there is not much reasoning needed.
|
| #
cfd76f4a |
| 19-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
added Mistral to the list of models
|
| #
dce0dee5 |
| 19-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
move model configuration into json files
This removes the use of individual classes for each model in favor of more general client classes that only get a model name passed. The meta info about pric
move model configuration into json files
This removes the use of individual classes for each model in favor of more general client classes that only get a model name passed. The meta info about price and token limits is configured in a json file
show more ...
|
| #
34a1c478 |
| 19-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
more refactoring on chat and embed model support
* differentiate between input and output tokens * make use of much larger input contexts
|
| #
d02b7935 |
| 18-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
first tries with Anthropic
|
| #
4373d2bf |
| 18-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
stricter interface inheritance
This ensures we have the apropriate methods when a class does not inherit from AbstractModel.
|
| #
294a9eaf |
| 18-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
Use interfaces for Chat and Embedding classes
This way it's easier to have a base OpenAI class. This also moves much of the statistics and http handling into the base class making model implementati
Use interfaces for Chat and Embedding classes
This way it's easier to have a base OpenAI class. This also moves much of the statistics and http handling into the base class making model implementations even leaner
show more ...
|
| #
6a18e0f4 |
| 14-Mar-2024 |
Andreas Gohr <andi@splitbrain.org> |
First start on refactoring the class hierarchy
This splits embedding models from chat completion models.
|
| #
7ebc7895 |
| 13-Sep-2023 |
splitbrain <splitbrain@users.noreply.github.com> |
Automatic code style fixes
|
| #
f6ef2e50 |
| 14-Jun-2023 |
Andreas Gohr <andi@splitbrain.org> |
refactoring to make models selectable
This makes it much easier to add new models. Models can now be selected via the configuration
|