-
-
Notifications
You must be signed in to change notification settings - Fork 334
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add gpt4all local models, including an embedding provider #454
Conversation
8add853
to
6939a0b
Compare
60a445c
to
fd50d2c
Compare
Regarding model size discussion just as FYI I wanted to highlight that there is a machine-readable JSON from which the download size, ram required, description, and prompt template for each model can be extracted as discussed in: #413 |
@krassowski |
@meeseeksdev please backport to 1.x |
…bedding provider
…ovider (#462) Co-authored-by: Piyush Jain <[email protected]>
…#454) * Added models for gpt4all. * Removed replit model. * Updated docs for gpt4all. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated docs. * A fix to stop download of embeddings model. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…#454) * Added models for gpt4all. * Removed replit model. * Updated docs for gpt4all. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Updated docs. * A fix to stop download of embeddings model. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Description
Adds new local models from gpt4all. Also adds an embedding provider for local embeddings generation with /learn. Also, fixes a small bug with the base chat handler, all exceptions thrown are causing an error because the method is not awaited currently.
Note
Could not add the
replit-code-v1_5-3b-q4_0.gguf
model, which crashes the server and throws this error when chat is used. If you know how to fix this, please contribute or leave a suggestion to fix in this PR.