Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU instance support on AWS #17

Open
mjuric opened this issue Dec 18, 2019 · 2 comments
Open

GPU instance support on AWS #17

mjuric opened this issue Dec 18, 2019 · 2 comments
Labels
enhancement New feature or request

Comments

@mjuric
Copy link
Member

mjuric commented Dec 18, 2019

Make sure we add GPU instance support for AWS deployments. This is a tracker issue for various pieces of this problem, and based on experiences with astroML demo prep.

Todo:
[ ] Start the GPU nodes with a recommended AMI
[ ] Patch the k8s deployment so EKS recognizes the GPU nodes (problem may have gone away by now)
[ ] Deploy nvidia-device-plugin into the k8s cluster (helm chart)
[ ] Start containers with the NVIDIA_DRIVER_CAPABILITIES: "all" environment variable
[ ] Write a small script/utility to verify everything has been set up correctly and is working.

Add anything that's missing.

@mjuric mjuric added the enhancement New feature or request label Dec 18, 2019
@mjuric
Copy link
Member Author

mjuric commented Dec 18, 2019

@bsipocz @stevenstetzler Add missing issues, and/or solution information here so we don't forget it.

@mjuric
Copy link
Member Author

mjuric commented Dec 18, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant