feat: pull model list for openai-compatible endpoints #630

cpacker · 2023-12-16T07:41:03Z

Closes #592

Please describe the purpose of this pull request.

During memgpt configure if using an openai endpoint, pull the list of models
Also, allow users to choose a "enter yourself" custom option (useful for custom OpenAI compatible endpoints)

How to test

Run memgpt configure

OpenAI

Azure

vLLM

Have you tested this PR?

Tested on OpenAI
Tested on Azure OpenAI
Tested on vLLM

Other comments

@sarahwooders

I'm concerned for some users the total amount of models is information overload (see screenshot).
Also, this is even after filtering for models only prefixed by gpt-
Additionally, there are models in the list that are misleading to show (eg gpt-4-vision-preview shows up, but we don't support vision inputs at the moment)

sarahwooders

Could you also call the same function for the case where vllm is the endpoint type? e.g. with our endpoint you can do http://api.memgpt.ai/v1/models (since we require the OpenAI compatible vLLM endpoint).

Also might be nice to add Azure too, but not that important.

memgpt/cli/cli_config.py

cpacker · 2023-12-16T19:14:48Z

Could you also call the same function for the case where vllm is the endpoint type? e.g. with our endpoint you can do http://api.memgpt.ai/v1/models (since we require the OpenAI compatible vLLM endpoint).

Also might be nice to add Azure too, but not that important.

Added vLLM support + tested against our vLLM API (see picture in OP)

…ai api key again in case you made a mitake / want to swap it out

cpacker · 2023-12-22T07:17:03Z

@sarahwooders added a new method to resolve the "too many models" problem:

Start by showing a curated list:

Allow the user to expand the list (if they expand the list, they get a warning):

Finally, they can still enter a manual model if the internet / GET request is broken and they know what they're doing:

sarahwooders

lgtm!

* allow entering custom model name when using openai/azure * pull models from endpoint * added/tested vllm and azure * no print * make red * make the endpoint question give you an opportunity to enter your openai api key again in case you made a mitake / want to swap it out * add cascading workflow for openai+azure model listings * patched bug w/ azure listing

cpacker added 2 commits December 15, 2023 23:27

allow entering custom model name when using openai/azure

0cadd2c

pull models from endpoint

bece4b9

cpacker requested a review from sarahwooders December 16, 2023 07:42

sarahwooders requested changes Dec 16, 2023

View reviewed changes

cpacker added 2 commits December 16, 2023 11:10

added/tested vllm and azure

4e45caa

no print

0c4212f

cpacker commented Dec 16, 2023

View reviewed changes

memgpt/cli/cli_config.py Show resolved Hide resolved

cpacker commented Dec 16, 2023

View reviewed changes

memgpt/cli/cli_config.py Outdated Show resolved Hide resolved

cpacker commented Dec 16, 2023

View reviewed changes

memgpt/cli/cli_config.py Show resolved Hide resolved

cpacker requested a review from sarahwooders December 16, 2023 19:42

cpacker changed the title ~~Pull model list~~ feat: pull model list Dec 16, 2023

cpacker changed the title ~~feat: pull model list~~ feat: pull model list for openai-compatible endpoints Dec 16, 2023

Merge branch 'main' into pull-model-list

88fa63f

cpacker added the priority Merge ASAP label Dec 21, 2023

cpacker added 3 commits December 21, 2023 22:44

make red

44aecc3

Merge branch 'main' into pull-model-list

c2734fc

make the endpoint question give you an opportunity to enter your open…

3bda756

…ai api key again in case you made a mitake / want to swap it out

add cascading workflow for openai+azure model listings

f637d51

sarahwooders approved these changes Dec 22, 2023

View reviewed changes

patched bug w/ azure listing

4d38c14

cpacker merged commit b97064e into main Dec 22, 2023
3 checks passed

cpacker deleted the pull-model-list branch December 22, 2023 07:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: pull model list for openai-compatible endpoints #630

feat: pull model list for openai-compatible endpoints #630

cpacker commented Dec 16, 2023 •

edited

Loading

sarahwooders left a comment

cpacker commented Dec 16, 2023

cpacker commented Dec 22, 2023

sarahwooders left a comment

feat: pull model list for openai-compatible endpoints #630

feat: pull model list for openai-compatible endpoints #630

Conversation

cpacker commented Dec 16, 2023 • edited Loading

sarahwooders left a comment

Choose a reason for hiding this comment

cpacker commented Dec 16, 2023

cpacker commented Dec 22, 2023

sarahwooders left a comment

Choose a reason for hiding this comment

cpacker commented Dec 16, 2023 •

edited

Loading