Update GPT4ALL integration #4567

Chae4ek · 2023-05-12T09:47:02Z

Update GPT4ALL integration

GPT4ALL have completely changed their bindings. They use a bit odd implementation that doesn't fit well into base.py and it will probably be changed again, so it's a temporary solution.

Fixes #3839, #4628

AndriyMulyar · 2023-05-13T16:07:24Z

We are standardizing on these bindings (they are the best multiplatform option), no more large changes 😀

Chae4ek · 2023-05-13T16:48:04Z

thanks a lot @AndriyMulyar!
I'm confused a little with these parameters, it looks like implementation details.
And I see you have added MPT support, do you plan to keep that params the same for each model? If so it's better to remove this check I guess, just to make it work for future models without updating it every time

AndriyMulyar · 2023-05-13T17:54:56Z

Let's wait to merge anything until the python bindings dev chimes in

rguo123 · 2023-05-13T18:16:24Z

Hey @Chae4ek! The parameters are needed to set the prompt context correctly. They typically will not require any changes on the user side and as far as I know work well across MPT, llama, and GPTJ models. However, I do think they are nice to have as they may change model behavior (for example, I have messed around with temp param at times to see how it impacts the generated text) and they are "technically" required inputs to the C function calls.

I'm not sure what that backend check was for but when loading a GPT4All model using the Python bindings, we will load the correct model. The only case where this isn't applied is if it's a custom model built not provided by us but built off a valid ggml/llama.cpp backend. In this case, providing model_type is needed for us to know which architecture to use. Hope this provides some clarification. Let me know if there's anything else I can help with!

Chae4ek · 2023-05-13T19:37:29Z

backend check was needed to pass different parameters because pygpt4all used different ones, I'll remove it for now.

logits_size, tokens_size and n_past are unclear to me.
I'm looking at the implementation rn and:

logits_size is unused actually and can be safely removed, logits.size() is used instead. Anyway it's used only in sample_top_k_top_p and evaluation functions and always has size = n_vocab 1, 2
llama.cpp has a parameter to change this to n_vocab*N. You'll probably want to add that too.

tokens_size is unused as well.

n_past is used for the current state of the model to remember a conversation as I understand it. Currently it resets every generate call. I think it should be removed at least from here. It is impossible to save this from python

the rest parameters are pretty clear, thank you for clarification)

langchain/llms/gpt4all.py

AndriyMulyar · 2023-05-14T03:52:14Z

Please do rely on the auto suggested backend - we will ensure the right model gets loaded (it won't be just hard coded if statements soon) These bindings will be supported long term: we plan to be opinionated in their maintenance and more than just the three architectures will exist.

…

On Sat, May 13, 2023, 11:40 PM Alexey Nominas ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In langchain/llms/gpt4all.py <#4567 (comment)>: > values["client"] = GPT4AllModel( - model_path=values["model"], - **model_kwargs, + model_name=model_name, + model_path=model_path, I'd not rely on auto-suggested backend cause it's simply hardcoded <https://github.com/nomic-ai/gpt4all/blob/0f71cb6759f648ad210d2b4bdd56d5b3e94b8d37/gpt4all-bindings/python/gpt4all/gpt4all.py#L274> as well as the default model path <https://github.com/nomic-ai/gpt4all/blob/0f71cb6759f648ad210d2b4bdd56d5b3e94b8d37/gpt4all-bindings/python/gpt4all/gpt4all.py#L15>. someone may rename their model.bin idk why there is some logic around it ;) may be it's better to add model_path=model_path or './' because linux users might expect the path to be relative as the default behavior for shell/bash/etc. — Reply to this email directly, view it on GitHub <#4567 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADJ4TBTRIB5WKLZDSSE7KNLXGBH33ANCNFSM6AAAAAAX7JDHDA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Chae4ek · 2023-05-14T04:09:56Z

I hope it will be determined based on the actual file.
I don't understand how it's possible to do that based only on the file name

AndriyMulyar · 2023-05-14T04:25:33Z

Yep 😊 Happy to take suggestions on what that looks like in the gpt4all repo - inspecting the file contents makes the most sense. It doesn't make sense to implement the selection code in every downstream library!

…

On Sun, May 14, 2023, 12:10 AM Alexey Nominas ***@***.***> wrote: I hope it will be determined based on the actual file. I don't understand how it's possible to do that based only on the file name — Reply to this email directly, view it on GitHub <#4567 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADJ4TBVCTYTZ7TMV533DMNDXGBLJ7ANCNFSM6AAAAAAX7JDHDA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Chae4ek · 2023-05-14T04:33:28Z

I'd like to use the model path without splitting it into path-to-directory and file name, up to you though

dev2049 · 2023-05-16T23:42:56Z

@rguo123 @AndriyMulyar how does it look now?

rguo123 · 2023-05-17T03:06:55Z

Chiming in here on auto-suggest / installation path. We have a C library method for deciding correct model to load based on magic numbers for the model binaries. We haven't ported it over to the Python bindings yet but that may be a good solution temporarily. The magic number is for context size which differs for all these models.

This may not work in the future if different model architectures have the same context size but could be useful for now. I am wary of adapting it because it does make logic for deciding model more complicated than just if's on model names and will not be a universal solution in the long term.

rguo123 · 2023-05-17T03:31:30Z

I think having filename be part of model_path is reasonable and no strong feelings there if we want to see that change in the gpt4all package. I do think keeping model filename/name and directory separate lines up with Huggingface's from_pretrained pattern more which would be my slight preference for keeping original impl. Lmk if there are stronger leanings for either side.

rguo123 · 2023-05-17T03:31:47Z

PR looks good to me @dev2049!

docs/modules/models/llms/integrations/gpt4all.ipynb

AndriyMulyar · 2023-05-17T14:01:37Z

I think this is ready for merge pending the resolution of the above threads

Chae4ek · 2023-05-17T16:33:30Z

@HyunggyuJang, @rguo123, could you please double check the latest changes and resolve the above conversations?

rguo123 · 2023-05-17T17:35:58Z

Doesn't look like I have permission to resolve conversations here? PR lgtm!

Chae4ek · 2023-05-17T17:43:46Z

Oh ok, I did it

HyunggyuJang

LGTM!

srinicodeit · 2023-05-18T21:40:18Z

When will this be available in Pypi as part of the new release?

Update GPT4ALL integration

2eaed35

senbihan mentioned this pull request May 13, 2023

GPT4All Python Bindings out of date [move to new multiplatform bindings] #4628

Closed

Remove unnecessary check

23083e6

HyunggyuJang reviewed May 14, 2023

View reviewed changes

langchain/llms/gpt4all.py Outdated Show resolved Hide resolved

HyunggyuJang reviewed May 14, 2023

View reviewed changes

langchain/llms/gpt4all.py Show resolved Hide resolved

AndriyMulyar mentioned this pull request May 14, 2023

updated pygpt4all to gpt4all #4654

Closed

Model backend auto-detection

5e4a057

wassi3 mentioned this pull request May 16, 2023

AttributeError: 'Llama' object has no attribute 'ctx' zylon-ai/private-gpt#161

Closed

rguo123 reviewed May 17, 2023

View reviewed changes

docs/modules/models/llms/integrations/gpt4all.ipynb Outdated Show resolved Hide resolved

Change the link to more relevant docs

8e7de54

HyunggyuJang approved these changes May 18, 2023

View reviewed changes

dev2049 merged commit c9e2a01 into langchain-ai:master May 18, 2023

krassowski mentioned this pull request Jun 4, 2023

Add GPT4All local provider jupyterlab/jupyter-ai#209

Merged

danielchalef mentioned this pull request Jun 5, 2023

Zep Hybrid Search #5742

Merged

This was referenced Jun 25, 2023

Zep Authentication #6725

Closed

Zep Authentication #6728

Merged

dosubot bot mentioned this pull request Nov 15, 2023

Missing required arguments; Expected either ('model' and 'prompt') or ('model', 'prompt' and 'stream') #13410

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update GPT4ALL integration #4567

Update GPT4ALL integration #4567

Chae4ek commented May 12, 2023 •

edited

Loading

AndriyMulyar commented May 13, 2023

Chae4ek commented May 13, 2023 •

edited

Loading

AndriyMulyar commented May 13, 2023

rguo123 commented May 13, 2023

Chae4ek commented May 13, 2023 •

edited

Loading

AndriyMulyar commented May 14, 2023 via email

Chae4ek commented May 14, 2023

AndriyMulyar commented May 14, 2023 via email

Chae4ek commented May 14, 2023

dev2049 commented May 16, 2023

rguo123 commented May 17, 2023 •

edited

Loading

rguo123 commented May 17, 2023

rguo123 commented May 17, 2023

AndriyMulyar commented May 17, 2023

Chae4ek commented May 17, 2023

rguo123 commented May 17, 2023

Chae4ek commented May 17, 2023

HyunggyuJang left a comment

srinicodeit commented May 18, 2023

Update GPT4ALL integration #4567

Update GPT4ALL integration #4567

Conversation

Chae4ek commented May 12, 2023 • edited Loading

Update GPT4ALL integration

AndriyMulyar commented May 13, 2023

Chae4ek commented May 13, 2023 • edited Loading

AndriyMulyar commented May 13, 2023

rguo123 commented May 13, 2023

Chae4ek commented May 13, 2023 • edited Loading

AndriyMulyar commented May 14, 2023 via email

Chae4ek commented May 14, 2023

AndriyMulyar commented May 14, 2023 via email

Chae4ek commented May 14, 2023

dev2049 commented May 16, 2023

rguo123 commented May 17, 2023 • edited Loading

rguo123 commented May 17, 2023

rguo123 commented May 17, 2023

AndriyMulyar commented May 17, 2023

Chae4ek commented May 17, 2023

rguo123 commented May 17, 2023

Chae4ek commented May 17, 2023

HyunggyuJang left a comment

Choose a reason for hiding this comment

srinicodeit commented May 18, 2023

Chae4ek commented May 12, 2023 •

edited

Loading

Chae4ek commented May 13, 2023 •

edited

Loading

Chae4ek commented May 13, 2023 •

edited

Loading

rguo123 commented May 17, 2023 •

edited

Loading