| ae1e222d | 31-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
Version upped |
| adfc5429 | 29-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
generate clusters only if more than 3 clusters would be created |
| e33a1d7a | 28-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
optionally search one language only |
| 754b8394 | 24-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
keep last question/answer as additional context
This should help with questions relating to the last anwser when they haven't been rephrased well enough |
| 8bb3d0cd | 16-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
Version upped |
| 219268b1 | 16-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
option to prefer a configured language instead of guessing |
| 8c8b7ba6 | 16-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
Added dumping of TSV files to SQLite store
This allows visualizing the embed vectors |
| 2ad4d158 | 16-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
finish similar pages feature |
| 376b909c | 15-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
Version upped |
| 3379af09 | 15-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
use a k-means based cluster approach to speed up similarity searches |
| 35555bac | 15-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
simplify cosine distance calculation
Since all OpenAI vectors are normalized, only the dotproduct needs to be calculated for the distance. This saves a couple of floating point ops per chunk, but do
simplify cosine distance calculation
Since all OpenAI vectors are normalized, only the dotproduct needs to be calculated for the distance. This saves a couple of floating point ops per chunk, but doesn't make a huge difference overall.
show more ...
|
| e33f1423 | 14-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
Version upped |
| aee9b383 | 14-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
output info on similar chunk fetching
helps with figuring out how fast the store is |
| 01f06932 | 10-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
First go at syntax to display similar pages |
| e7d7ef4c | 14-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
reuse link colors when styling chat and buttons |
| 6dd3ce09 | 14-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
brainier button icon |
| 13dbfc23 | 10-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
implement Pinecone based storage |
| 68b6fa79 | 10-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
First go at syntax to display similar pages |
| 57665000 | 07-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
Version upped |
| 0eeff420 | 07-Aug-2023 |
Andreas Gohr <andi@splitbrain.org> |
support GPT4 |
| b0bd61d2 | 15-Jun-2023 |
Andreas Gohr <andi@splitbrain.org> |
Version upped |
| 3d0065c7 | 15-Jun-2023 |
Andreas Gohr <andi@splitbrain.org> |
fix visibility changes in chat dialog |
| 9f6b34c4 | 15-Jun-2023 |
Andreas Gohr <andi@splitbrain.org> |
lazy load embeddings, storage and model
This should fix an immediate problem with the sqlite db initializing slowly, but the underlying problem has to be identified. |
| 622aa5a8 | 15-Jun-2023 |
Andreas Gohr <andi@splitbrain.org> |
added button to reset the conversation
This is needed to start conversing on a new topic |
| 81b450c8 | 14-Jun-2023 |
Andreas Gohr <andi@splitbrain.org> |
use a cut-off point when considering similar documents |