Add an API endpoint to load the last-used model #5516

anon-contributor-0 · 2024-02-16T02:09:41Z

Adds an internal API endpoint to the OpenAI API that allows loading of the last used model.

This new endpoint would be particularly helpful for scenarios where VRAM management is necessary - a third-party application can ask text-generation-webui to vacate VRAM (e.g. with /v1/internal/model/unload), then quickly reload the model that was just active once some other task is done (for example, image generation). That technique is employed in the sd_api_pictures extension with AUTOMATIC1111's Web UI. This PR would allow other applications to perform the same technique with text-generation-webui.

/v1/internal/model/loadlast triggers the new models.load_last_model() function if it's POSTed to.

As a bonus, this also fixes a bug in models.reload_model() - it would fail since shared.model_name was set to None by models.unload_model(), meaning that reload_model() would then attempt to load None as a result. Setting it to the newly-added shared.last_model_name variable should fix that issue.

anon-contributor-0 · 2024-02-24T17:52:42Z

@oobabooga, any chance you could have a look at this? It's a relatively quick PR, and it could allow for better flexibility for users on lower-end hardware.

anon-contributor-0 · 2024-05-20T23:21:33Z

@oobabooga Checking in again - any chance you'd have a cycle to take a look here?

anon-contributor-0 changed the title ~~Add an API endpoint to reload the last-used model~~ Add an API endpoint to load the last-used model Feb 16, 2024

oobabooga deleted the branch oobabooga:dev February 17, 2024 21:53

oobabooga closed this Feb 17, 2024

oobabooga reopened this Feb 17, 2024

anon-contributor-0 force-pushed the add-reload-last branch from 04ec6f4 to 9f0fe76 Compare February 21, 2024 04:30

anon-contributor-0 force-pushed the add-reload-last branch from 9f0fe76 to 1c3601a Compare May 10, 2024 16:05

anon-contributor-0 force-pushed the add-reload-last branch from 1c3601a to 81134dc Compare May 20, 2024 23:20

anon-contributor-0 force-pushed the add-reload-last branch from 81134dc to 0dab954 Compare May 30, 2024 14:55

anon-contributor-0 force-pushed the add-reload-last branch from 0dab954 to 40410cf Compare July 15, 2024 14:02

Add an API endpoint to reload the last-used model

b9352ed

anon-contributor-0 force-pushed the add-reload-last branch from 40410cf to b9352ed Compare July 15, 2024 14:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an API endpoint to load the last-used model #5516

Add an API endpoint to load the last-used model #5516

anon-contributor-0 commented Feb 16, 2024 •

edited

Loading

anon-contributor-0 commented Feb 24, 2024

anon-contributor-0 commented May 20, 2024 •

edited

Loading

Add an API endpoint to load the last-used model #5516

Are you sure you want to change the base?

Add an API endpoint to load the last-used model #5516

Conversation

anon-contributor-0 commented Feb 16, 2024 • edited Loading

anon-contributor-0 commented Feb 24, 2024

anon-contributor-0 commented May 20, 2024 • edited Loading

anon-contributor-0 commented Feb 16, 2024 •

edited

Loading

anon-contributor-0 commented May 20, 2024 •

edited

Loading