Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

daskhub: provide worker resource options for 16CPU/128GB nodes on GKE/EKS #3344

Merged
merged 8 commits into from
Nov 20, 2023

Conversation

consideRatio
Copy link
Contributor

@consideRatio consideRatio commented Oct 30, 2023

All GKE/EKS clusters with daskhubs have n2-highmem-16/r5.4xlarge nodes setup for them (#2687). This makes us able to easily standardize pre-defined options to choose from for all clusters with regards to how large CPU/memory requests should be made for each pod.

Review notes

  • daskhub's new default
    The daskhub chart comes with these options by default now, adjusting to use n2-highmem-16 or r5.4xlarge based on the k8s distribution that is detected automatically.
    With this, jupyter-meets-the-earth no longer specifies its own custom logic.

  • basehub-cluster-info ConfigMap added
    It is what makes us have the BASEHUB_K8S_DIST environment variable available for the hub and dask-gateway pod, where the dask-gateway pod relies on it in this PR and the hub pod got it mounted as its likely useful.

    The ConfigMap gets labels aligning with modern seemingly now stable helm chart best practices.

  • Test with pange/base-notebook:latest at https://dask-staging.2i2c.cloud

UX for users

GKE EKS (ugly extras resolves by in-image software update)
image image

@jbusecke I've deployed this to leap's staging hub for now, if you have feedback its most welcome!

@consideRatio consideRatio requested a review from a team as a code owner October 30, 2023 15:44
@github-actions

This comment was marked as resolved.

Copy link
Member

@GeorgianaElena GeorgianaElena left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good to me @consideRatio! Thank you!

I left a few comments about adding more notes about where certain numbers are coming from, so we can trace back in the future if need be.

Related to docs: do you believe it makes sense to add a small (rather generic) note about these new instance-dependent dask worker options in https://docs.2i2c.org/about/distributions/research/#a-brief-overview-of-scalable-computing-hubs?

helm-charts/daskhub/values.yaml Outdated Show resolved Hide resolved
helm-charts/daskhub/values.yaml Show resolved Hide resolved
helm-charts/daskhub/values.yaml Show resolved Hide resolved
helm-charts/daskhub/values.yaml Outdated Show resolved Hide resolved
@consideRatio
Copy link
Contributor Author

Deployed to leap's staging for now, with a 👍 from @jbusecke who actively makes use of dask-gateway atm I'll go for a merge.

@consideRatio consideRatio force-pushed the pr/dask-options branch 2 times, most recently from 8eda3d7 to 42de740 Compare November 17, 2023 13:59
@consideRatio
Copy link
Contributor Author

Rebased - I'll go for a merge at this point!

@consideRatio consideRatio merged commit 8b3d7d2 into 2i2c-org:master Nov 20, 2023
31 checks passed
Copy link

🎉🎉🎉🎉

Monitor the deployment of the hubs here 👉 https://github.com/2i2c-org/infrastructure/actions/runs/6934459391

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Status: Done 🎉
2 participants