| c6ae286a | 21-Jan-2026 |
Eduardo Mozart de Oliveira <2974895+eduardomozart@users.noreply.github.com> |
Update Gemini model configurations
Updated model configurations to include new Gemini versions and removed outdated ones. |
| 7775eee7 | 29-Oct-2025 |
Andreas Gohr <gohr@cosmocode.de> |
fix interface handling with generic model |
| 7bd2bac6 | 28-Oct-2025 |
Andreas Gohr <gohr@cosmocode.de> |
make ollama auth optional |
| d72a84c5 | 11-Jun-2025 |
Andreas Gohr <gohr@cosmocode.de> |
move api endpoint init to AbstractModel
This should finally solve the chicken-egg issue. This also renames the ollama url parameter from baseurl to apiurl and sets a sensible default |
| ac84f472 | 11-Jun-2025 |
Andreas Gohr <gohr@cosmocode.de> |
fix groq test and temperature issue |
| 332eea25 | 11-Jun-2025 |
Andreas Gohr <gohr@cosmocode.de> |
fixes based on tests, improve tests |
| 7c3b69cb | 11-Jun-2025 |
Andreas Gohr <gohr@cosmocode.de> |
refactor HTTP Client initialization |
| 7be8078e | 15-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
allow models to have a zero token limit
This allows for configuring completely unknown models. For these models no token limit is known and we will simply do not apply any. Instead we trust that the
allow models to have a zero token limit
This allows for configuring completely unknown models. For these models no token limit is known and we will simply do not apply any. Instead we trust that the model will be either large enough to handle our input or at least throw useful error messages.
show more ...
|
| e3b34a2b | 15-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
fix model full name |
| 2e22aefb | 15-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
introduce a "Generic" model
This is basically an OpenAI implementation, which then can be reused by most other providers since they usually implement a somewhat compatible API. The API URL can be se
introduce a "Generic" model
This is basically an OpenAI implementation, which then can be reused by most other providers since they usually implement a somewhat compatible API. The API URL can be set manually so it should be easy to add new unsupported providers.
show more ...
|
| c7364d3d | 01-Apr-2025 |
Andreas Gohr <andi@splitbrain.org> |
rewrite roles for Gemini
Gemini expects the role model instead of assistant. Older models allowed both, the newer ones not. |
| 5835c70b | 27-Mar-2025 |
Andreas Gohr <andi@splitbrain.org> |
updated model data |
| 42b2c6e8 | 12-Mar-2025 |
Andreas Gohr <andi@splitbrain.org> |
add remote component to ask questions to the bot
The endpoint allows to override model and language settings on demand. |
| 163a36ec | 10-Mar-2025 |
Andreas Gohr <andi@splitbrain.org> |
Adjust OpenAI API calls. fixes #34
* max_token has been replaced by max_completion_tokens * temperature setting is not suported by reasoning models |
| d481c63c | 25-Feb-2025 |
Andreas Gohr <andi@splitbrain.org> |
ollama: remove thinking part from deepseek answers |
| cc7172ce | 25-Feb-2025 |
Andreas Gohr <andi@splitbrain.org> |
ollama: fix embed endpoint and error in model file
We expect /api to be part of the configured base URL so it should not be part of the call itself. |
| 94b5d70e | 14-Feb-2025 |
Max Theisen <96209029+MaxThFe@users.noreply.github.com> |
Update EmbeddingModel.php to remove the :lastest suffix |
| 552bc116 | 13-Feb-2025 |
Max Theisen <96209029+MaxThFe@users.noreply.github.com> |
Update models.json with new llama3.3 70b |
| d2192bba | 13-Feb-2025 |
Max Theisen <96209029+MaxThFe@users.noreply.github.com> |
Update EmbeddingModel.php for new Ollama API |
| 8b9ccd0e | 12-Feb-2025 |
Eduardo Mozart de Oliveira <2974895+eduardomozart@users.noreply.github.com> |
Update models.json |
| 82a6394a | 12-Feb-2025 |
Eduardo Mozart de Oliveira <2974895+eduardomozart@users.noreply.github.com> |
Remove deprecated Groq models |
| da493c96 | 06-Feb-2025 |
Andreas Gohr <andi@splitbrain.org> |
some model updates
still not 100% uptodate |
| 4dd0657e | 06-Feb-2025 |
Andreas Gohr <andi@splitbrain.org> |
allow to set arbitrary models
We now initialize a model configuration even if we have no info in model.json using some default values for the token limits.
Models can implement the loadUnkonwModelI
allow to set arbitrary models
We now initialize a model configuration even if we have no info in model.json using some default values for the token limits.
Models can implement the loadUnkonwModelInfo() method to fetch the info from an API if such a thing exist. Implemented for gemini and ollama currently.
show more ...
|
| ee778380 | 05-Dec-2024 |
Andreas Gohr <andi@splitbrain.org> |
added bge-large embedding model for ollama |
| 4b794a5b | 14-Nov-2024 |
Andreas Gohr <andi@splitbrain.org> |
updated Ollama models
I only added a couple of popular ones. It's hard to find the context and dimension limits. |