-
Notifications
You must be signed in to change notification settings - Fork 357
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
docs: release notes & updates for Kubernetes pods to jobs #9443
docs: release notes & updates for Kubernetes pods to jobs #9443
Conversation
✅ Deploy Preview for determined-ui ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## stoksc/feat/kubernetesjobs #9443 +/- ##
==============================================================
- Coverage 49.04% 49.04% -0.01%
==============================================================
Files 1233 1233
Lines 159205 159205
Branches 2778 2777 -1
==============================================================
- Hits 78084 78076 -8
- Misses 80947 80955 +8
Partials 174 174
Flags with carried forward coverage won't be shown. Click here to find out more. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
added suggested edit to match the release notes style, voice, tone
42ead04
into
stoksc/feat/kubernetesjobs
This change updates the Kubernetes resource manager to submit one Kubernetes job per Determined allocation instead of many pods. This is complicated but we think it is worth it because: - Jobs play nice with resource quotas and other Kubernetes features out of the box. - Eventually we can delegate restarts, TTL, pause/resume (using suspend), and more to jobs. - They allow us to better integrate with Kueue and other tools in the ml ecosystem. - Supporting VolcanoJobs (or similar alternatives) alongside Jobs is realistic. - The refactor is net positive w.r.t. test coverage (20% to 80%) and code quality. This commit is the result of several PRs, enumerated here for easier discovery. - #9296 contains most of the code changes. - #9443 - #9447 - #9450 - #9451 Co-authored-by: Carolina Calderon <[email protected]>
Ticket
RM-258
Description
Include release note & updates to Kubernetes docs for the pods to jobs project.
Test Plan
N/A
Checklist
docs/release-notes/
.See Release Note for details.