Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐛[bug] Error Starting Up Cluster using det deploy #8824

Closed
joshuacuellar1 opened this issue Feb 10, 2024 · 4 comments
Closed

🐛[bug] Error Starting Up Cluster using det deploy #8824

joshuacuellar1 opened this issue Feb 10, 2024 · 4 comments
Labels

Comments

@joshuacuellar1
Copy link

Describe the bug

After installing docker and determined, I can not start up the cluster. When I run det deploy local cluster-up I get an error.

Reproduction Steps

  1. Install Docker
  2. Install Determined using pip install determined
  3. Run det deploy local cluster-up

Expected Behavior

The cluster should start up so I can log in to the web portal.

Screenshot

Screenshot from 2024-02-09 18-26-47

Environment

  • Device or hardware: Geforce GTX 1660 Ti
  • OS: Ubuntu 22.04
  • Browser: Firefox
  • Version 22.04

Additional Context

No response

@MikhailKardash
Copy link
Contributor

Hello, thanks for the report. It looks like det deploy is having trouble detecting your gpu. Can you run nvidia-smi?

@joshuacuellar1
Copy link
Author

joshuacuellar1 commented Feb 12, 2024

@MikhailKardash
image

@MikhailKardash
Copy link
Contributor

MikhailKardash commented Feb 13, 2024

Great. Do you also have nvidia-container-toolkit installed? https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

More information can be found in our docs here: https://docs.determined.ai/latest/setup-cluster/on-prem/requirements.html#install-docker

@joshuacuellar1
Copy link
Author

Installing it immediately fixed it. I hadn't noticed it in the install instructions. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants