[Enhancement] Auto-deploy ML Model when predict #1148

Zhangxunmt · 2023-07-20T00:02:30Z

Currently the ML models are manually "deployed" or "loaded" into the memory which requires customers to manually invoke a "deploy" API before using any ML models. Also after usage, ml-common requires a manual "undeploy" or "unload" from end users. This is adding more overhead to the system and end users to use ml-common in the workflow.

We should build a auto-deploy mechanism to get rid of these "deploy" and "undeploy" operations in the workflow. Instead, we should auto deploy the model when customers use a model in the first time and setup a TTL to auto-undeploy from the system. In this way, the deploy and un-deploy APIs can be removed from the workflow and user experience are much simplified.

hijakk · 2023-07-29T16:44:22Z

+1, automating management of availability of specific models would simplify operations significantly

owaiskazi19 · 2024-03-15T19:09:32Z

@Zhangxunmt thanks for the proposal. This looks like a much asked feature. Couple of questions around the same:

How does the API experience will look like for the auto deploy model? Would we have a param when registering a model something like _register?auto_deploy=true?
Will we still support deploy API for users to deploy the model manually or will we deprecate the API?
As part of automating the setup of ml-commons we support DeployModelStep and UndeployModelSetp in flow framework. We might need to deprecate them here as well.

Zhangxunmt · 2024-03-15T19:27:52Z

@owaiskazi19 , the BWC is still valid. Nothing is changed from your side. You can still setup deploy and undeploy in the flow frameworks. The API experience remain the same too. The model registration of "_register?auto_deploy=true" is still valid.

This change only handles the case when a cluster scale up and down, restart, or node replacement, etc. We need to auto-deploy the models in the "Prediction" stage so customers don't need to keep manually deploying again and again after each event.

jngz-es · 2024-04-01T23:05:09Z

#2206

Zhangxunmt added bug Something isn't working enhancement New feature or request labels Jul 20, 2023

Zhangxunmt self-assigned this Jul 20, 2023

github-actions bot added the untriaged label Jul 20, 2023

Zhangxunmt removed the untriaged label Jul 20, 2023

ylwu-amzn removed the bug Something isn't working label Aug 2, 2023

Zhangxunmt changed the title ~~[Improvement] Auto-deploy ML Model with TTL in the memory~~ [Enhancement] Auto-deploy ML Model with TTL in the memory Aug 25, 2023

Zhangxunmt mentioned this issue Sep 15, 2023

[Enhancement] Simplify the current user experience and APIs in Ml-Commons #1343

Closed

ylwu-amzn mentioned this issue Feb 13, 2024

[BUG] Intermittent Model Deployment Issues in AWS OpenSearch: "Model not ready yet" Error #2050

Closed

ylwu-amzn added the v2.13.0 Issues targeting release v2.13.0 label Feb 21, 2024

zane-neo mentioned this issue Mar 5, 2024

[BUG] Models are not auto-redeployed when all the nodes starts at the same time #2177

Open

ylwu-amzn changed the title ~~[Enhancement] Auto-deploy ML Model with TTL in the memory~~ [Enhancement] Auto-deploy ML Model when predict Mar 19, 2024

ylwu-amzn added this to ml-commons projects Mar 19, 2024

ylwu-amzn moved this to In Progress in ml-commons projects Mar 19, 2024

This was referenced Mar 19, 2024

Agent framework GA/ml-commons 2.13 release tasks tracking #2190

Closed

[DOC] Auto deploy model when predict request comes opensearch-project/documentation-website#6734

Closed

jngz-es closed this as completed Apr 1, 2024

github-project-automation bot moved this from In Progress to Done in ml-commons projects Apr 1, 2024

Zhangxunmt mentioned this issue Apr 26, 2024

Add TTL to un-deploy model automatically #2365

Merged

5 tasks

Zhangxunmt moved this from Done to Released in ml-commons projects Jun 4, 2024

Zhangxunmt mentioned this issue Jun 4, 2024

[FEATURE] Support update connector without undeploying the model #2496

Open

github-project-automation bot added this to OpenSearch Project Roadmap Aug 30, 2024

github-project-automation bot moved this to 2.13.0 (Launched ) in OpenSearch Project Roadmap Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Auto-deploy ML Model when predict #1148

[Enhancement] Auto-deploy ML Model when predict #1148

Zhangxunmt commented Jul 20, 2023

hijakk commented Jul 29, 2023

owaiskazi19 commented Mar 15, 2024

Zhangxunmt commented Mar 15, 2024

jngz-es commented Apr 1, 2024

[Enhancement] Auto-deploy ML Model when predict #1148

[Enhancement] Auto-deploy ML Model when predict #1148

Comments

Zhangxunmt commented Jul 20, 2023

hijakk commented Jul 29, 2023

owaiskazi19 commented Mar 15, 2024

Zhangxunmt commented Mar 15, 2024

jngz-es commented Apr 1, 2024