Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[META] Auto undeploy ML Model with TTL #2376

Closed
ylwu-amzn opened this issue Apr 30, 2024 · 1 comment
Closed

[META] Auto undeploy ML Model with TTL #2376

ylwu-amzn opened this issue Apr 30, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request v2.14.0

Comments

@ylwu-amzn
Copy link
Collaborator

ylwu-amzn commented Apr 30, 2024

Is your feature request related to a problem?
As we continue to develop and utilize numerous Machine Learning models, management and resource optimization have become essential. In particular, we need a mechanism to automatically undeploy the models after they've reached a specific Time To Live (TTL). This TTL limit proposes undeploying models that have been inactive or unused for a particular defined period.

What solution would you like?
Add model TTL to deploy setting. Automatic tracking of model usage or last accessed timestamp to understand if a model should be undeployed. Auto-undeployment should clean up all associated resources to conserve server memory, disk space, or any other resources the models may be using.

What alternatives have you considered?
Keep the same with current user experience. User need to manually undeploy model

Do you have any additional context?
No

@Zhangxunmt
Copy link
Collaborator

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request v2.14.0
Projects
Status: 2.14.0 (Launched)
Status: Released
Development

No branches or pull requests

3 participants