Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collect data and assess node auto-provisioning on Pangeo cluster #671

Closed
1 of 8 tasks
sgibson91 opened this issue Sep 10, 2021 · 1 comment
Closed
1 of 8 tasks

Collect data and assess node auto-provisioning on Pangeo cluster #671

sgibson91 opened this issue Sep 10, 2021 · 1 comment
Assignees
Labels
Task Actions that don't involve changing our code or docs.

Comments

@sgibson91
Copy link
Member

sgibson91 commented Sep 10, 2021

Description

Following discussion in #666 we deployed node auto-provisioning on the Pangeo cluster in #670 (since reverted in #678). This issue is to track the performance, cost-benefits, etc of using node auto-provisioning in this cluster and decide whether this is a feature we want to support long-term as an alternative to configuring node pools manually.

This is a time-boxed issue and we should aim to be making a decision within a 2-3 month time frame.

Some more context: #666 (comment)

Value / benefit

✅ Node auto-provisioning would reduce 2i2c Engineer's manual interaction with the cluster
❌ It may not actually be that cost-effective

We don't know the answers to these q's until we gather data and analyse 🙂

Implementation details

No response

Tasks to complete

  • Debug node autoprovisioning: did not match Pod's node affinity #677
  • Decide what metrics we need to evaluate and what "success" or "failure" of those metrics looks like
  • Decide what data will help towards evaluating those metrics
  • Decide how that data should be collected
  • Implement collection of data
  • Analyse data and evaluate metrics
  • Decide to support node auto-provisioning long-term or not
  • Decide whether to write up a blog post describing this process, the data we collected, and our conclusions

Updates

@sgibson91 sgibson91 added Task Actions that don't involve changing our code or docs. 🏷️ research labels Sep 10, 2021
@choldgraf
Copy link
Member

Updated to add the possibility of a little blog post at the end, after chatting about it with @sgibson91 :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Task Actions that don't involve changing our code or docs.
Projects
No open projects
Status: Done 🎉
Development

No branches or pull requests

2 participants