-
Notifications
You must be signed in to change notification settings - Fork 364
Issues: determined-ai/determined
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
🐛[bug] Can't set
slot_type
through values.yaml
- K8s deplyment
bug
#10152
by caio-davi
was closed Oct 30, 2024
🤔[question] How to Completely Uninstall Determined AI and Reset Admin User?
question
#9961
by Flukeds129
was closed Sep 20, 2024
🐛[bug] Distributed training not working for PyTorchTrial when using evaluate_full_dataset and more than 1 GPU
bug
#9916
by charles-viss
was closed Sep 12, 2024
🐛[bug] already set min_validation_period , but still got a single validation metric
bug
#9863
by scotthuang1989
was closed Sep 2, 2024
💡[feat] how to avoid node GPU fragmentation
feature
Feature requests
#9861
by GrS-AA
was closed Aug 29, 2024
🤔[question] “dtrainNetworkInterface” seems does not take effect when deploy on k8s
question
#9839
by ShiroZhang
was closed Aug 22, 2024
2 tasks done
🤔[question] can you provide me a example that use amp(mixed precision)
question
#9817
by ShiroZhang
was closed Aug 20, 2024
2 tasks done
🤔[question] where to set the
find_unused_parameters=True
question
#9814
by caiduoduo12138
was closed Aug 14, 2024
2 tasks done
🤔[question] I want to callback the interface when the resource is released.
question
#9734
by caiduoduo12138
was closed Jul 29, 2024
2 tasks done
🤔[question] Open to updates to EKS deployment?
question
#9355
by bryantbiggs
was closed May 28, 2024
2 tasks done
🤔[question] Where can I find the source code of the CLI?
question
#9072
by ms8922
was closed Mar 29, 2024
2 tasks done
🤔[question] dialing to http://172.22.0.1:32862: dial tcp 172.22.0.1:32862: connect: connection refused
question
#8954
by mr-nealon
was closed Apr 22, 2024
2 tasks done
🐛[bug] Running Mnist Tutorial distributed causes Runtime Errors and Hanging behavior
bug
#8915
by samjenks
was closed Feb 29, 2024
🤔[question] Updating the default Determined-Pytorch container to 2.1/2.2
question
#8908
by samjenks
was closed Feb 27, 2024
2 tasks done
🤔[question] Changing the default config path for the determined-agent.service
question
#8891
by samjenks
was closed Feb 27, 2024
2 tasks done
🐛[bug] Resources failed with non-zero exit code: container failed with non-zero exit code: 80
bug
#8844
by samjenks
was closed Feb 14, 2024
🐛[bug] Error Starting Up Cluster using det deploy
bug
#8824
by joshuacuellar1
was closed Feb 16, 2024
🤔 model registry - inference with pytorch model
question
#8806
by Fedege98
was closed Apr 22, 2024
2 tasks done
🐛[bug] det CLI tool errors on Python 3.12 because it relies on distutils which was deprecated in Python 3.10
bug
#8666
by sirredbeard
was closed Jan 9, 2024
Previous Next
ProTip!
Follow long discussions with comments:>50.