0.33.0
Release Notes
Changelog
- 0c2d3cf chore: bump version: 0.33.0-rc5 -> 0.33.0
- e165541 docs: add release notes for 0.33.0 (#9444)
- 8c69d8b chore: bump version: 0.33.0-rc4 -> 0.33.0-rc5
- e1a40b1 fix: dont utilize the default efs mount on normal aws deploys (#9437)
- ebe2698 chore: bump version: 0.33.0-rc3 -> 0.33.0-rc4
- b85b8b3 fix: set the defaults for shared_fs mount in genai correctly (#9433)
- 52c7d95 chore: bump version: 0.33.0-rc2 -> 0.33.0-rc3
- 9968dce fix: add feature gate for checking for blank admin/determined password [DET-10197] (#9425)
- 9c4fd74 chore: bump version: 0.33.0-rc1 -> 0.33.0-rc2
- 274b152 fix: Keep template modal open when config is invalid (#9424)
- f4d6f54 chore: bump version: 0.33.0-rc0 -> 0.33.0-rc1
- 2661ae0 chore: bump ngc image versions for release (#9418)
- cbc15db fix: master checks db newness before migrating [DET-10312] (#9414)
- d1b3343 chore: bump version: 0.33.0-dev0 -> 0.33.0-rc0
- ca45198 chore: lock published urls to preserve redirects
- f2cd018 chore: lock api state for backward compatibility check
- 6184f6f chore: bump version: 0.32.1-dev0 -> 0.33.0-dev0
- 4af9bfc revert: Framework splitting (#9405)
- 6fa1420 test: project create and delete react e2e [INFENG-456] (#9244)
- 860f6a8 docs: Describe config templates WebUI (#9399)
- 6ff8eb7 chore: Add slurm codeowners (#9403)
- 68b36c6 feat: require initial passwords on new cluster-up [DET-10197] (#9314)
- 0ef3e10 test: datagrid scrolling [INFENG-687] (#9379)
- 18ee0e3 chore: Update docker retag scripts (#9401)
- 6ed2976 pin setuptools in model hub tests (#9402)
- c4ebe5e feat: Release WebUI templates with notes (#9383)
- 3bbb51a feat: Display Log retention days and Remaining log retention days in Logs Tab (#9305)
- 047580c feat: update default scheduler to priority for agentrm (#9385)
- ce70c00 docs: Add more info helm install password (#9388)
- b84ee1f docs: cluster observability documentation and dashboard improvements (#9391)
- c3b3ae6 feat: helm install checks password complexity [DET-10293] (#9360)
- 5c51164 fix: Skip resource checking for unmanaged exp (#9372)
- 107e108 feat: add Sort menu to Flat Runs view (#9396)
- cb81a44 feat: Add charts to Comparison View (ET-99) (#9215)
- cd33c13 test: put flaky fix back in [INFENG-694] (#9394)
- d3e89b1 docs: add exp config for unmanaged example #2. (#9397)
- d4e23f4 chore: pin requests version < 2.32.0 so docker works (#9395)
- 5480c57 chore: don't use a seperate schema for views_and_triggers (#9392)
- 893f7f5 chore: add resource_pools intg test (#9356)
- de21593 chore: push oss images per commit (#9386)
- 95c70d4 docs: Add nav to genai docs (#9387)
- 0c42ced feat: SDK methods to fetch pachyderm configs [MD-406] (#9348)
- 0ff09e0 docs: Describe pwd requirements WebUI (#9378)
- 31bc08a refactor: rename multiRM to more intuitive name (#9350)
- df7a2af docs: Update release note (#9375)
- 2c9b9b9 feat: add pod labels with proper validation (#9364)
- 0a59c63 docs: Remove long metrics rn (#9374)
- 7e4b431 feat: add columns menu to Runs view (#9323)
- c10ae99 test: Remove flakiness of KillRun test (#9370)
- 653a0de chore: store database code as code [DET-9180] (#9302)
- d38e2e0 test: report individual test results from python tests (#9366)
- 7bce6ff chore: report ntsc names via cli at launch (#9228)
- 93c8d81 ci: keep waiting on failing workloads for sending slack alerts (#9371)
- 53edec9 test: More Page Models for Experiment Tracking [INFENG-694] (#9367)
- a96cafd feat: Framework splitting (#9318)
- 3b1d0df chore: remove test suite whose marks match no tests (#9363)
- 566b6af test: page model refactor for dropdown and select components [INFENG-694] (#9362)
- 68b7116 docs: deprecation notice for agentrm features (#9344)
- 2092943 docs: Add FSDP to deepspeed (#9182)
- 4f180db chore: update npm libraries (#9331)
- c7b78fa feat: edit/delete template from WebUI (#9353)
- 8c5fce7 test: provide GKE tests with a Helm value for initialUserPassword [DET-10196] (#9361)
- 0941fc4 feat: helm requires bootstrap password [DET-10196] (#9359)
- f91c2a3 docs: revert a doc format change to reenable slurm tests (#9358)
- 989341c feat: add options in flat run (#9341)
- 16a3f3b revert: resource pool intg tests (#9357)
- 54fb10a chore: add intg tests to resource_pool.go (#9199)
- c3901c8 fix: det ckpt download from s3. (#9332)
- 7c26fe1 refactor: columnpicker remove hard coded value (#9342)
- feb8a7b fix: remove pod labels with potentially incompatible names (#9349)
- eab4981 docs: Reformat grid tables (#9321)
- 758ffd7 chore: add retries to check-doc-links ci job (#9335)
- 80fac3d chore: update release notes date (#9334)
- efbcdee chore: update codecov to ignore e2e react [INFENG-689] (#9346)
- 2445d39 fix: Revert "feat: helm requires bootstrap password (DET-10196)" (#9345)
- df0b7f9 feat: Implement
/template/rename
to patch template name (#9320) - 0a0b3c3 feat: helm requires bootstrap password [DET-10196] (#9274)
- 86aa319 chore: Bunify and add test coverage for
ExperimentTotalStepTime
andExperimentNumSteps
(#9333) - 3c0eac6 test: experiement list tests [INFENG-457] (#9299)
- cc82cc9 chore: add missing setuptools to win cli tests (#9336)
- c0fdaa9 chore: remove step for authenticated master session check and use standard script (#9339)
- b868230 test: wait for background logout (#9340)
- ead928e ci: add missing var overide in ee release [skip ci] (#9338)
- cab9ac5 test: log in with the api rather than through the UI for most react tests (#9307)
- 349d2a5 feat: View templates from WebUI (#9304)
- 9d46f49 chore: update codecov to ignore e2e react [INFENG-689] (#9337)
- 6e10465 ci: send job level failure slack alerts (#9315)
- 2abacb9 docs: update "install on k8s" guide to use helm repo instead of tarball. (#9293)
- 492ef57 chore: bump version: 0.32.0-dev0 -> 0.32.1-dev0
- f74988c chore: add docs dropdown link for new version
- a1b6912 docs: add release notes for 0.32.0 (#9301)
- dab4946 feat: add integration config for pachyderm input datasets (ET-12) (#8933)
- 3d4e283 test: refactor nav spec to use sidebar pagemodels [INFENG-683] (#9326)
- 1779060 test: skip a flaky test [ET-233, ET-178] (#9324)
- 3b167c7 fix: filter action experiments, old ExperimentList (#9325)
- 5b73dc4 fix: filter batch action experiments (#9316)
- 6fb62ad feat: support for configuring the shared_fs mount path in genai (#9317)
- 5f4cbbf Revert "docs: Reformat tables with image names" (#9319)
- c9f5e8a docs: Reformat tables with image names (#9312)
- fdaa015 feat: support filter in flat run table (#9250)
- a76c549 ci: don't run test-e2e-longrunning tests on main (#9313)
- ebf19a6 chore: bumpenvs for efs-utils (#9309)
- f9a35d9 ci: run e2e-react manually [INFENG-676] (#9310)
- 9cab46d chore: drop unused postgres function experiments_best_validation_history (#9306)
- ee4f04e chore: stop writing database down migrations [RM-242] (#9289)
- 24aaff4 ci: store npm log (#9311)
- e90bd0d chore: improve messaging for e2e tests (#9286)
- 0aef4c7 fix: tensorboard metric overwrites and sync throttle [MD-328] [MD-291] (#9282)
- f9b96fe ci: don't run requests-hpc-tests on main (#9308)
- 4c314a2 chore: update efs-utils install for v2.0 (#9297)
- f6181ab test: revert runner size test-e2e-cpu (#9303)
- b0a008e feat: Update workspace for templates server side (#9272)
- 9357391 ci: circleci slack alerts should go to #ci-bots (#9300)
- 49ab75d docs: Update Chart.yaml [ci skip] (#9298)
- e31135a chore(deps): bump google.golang.org/grpc from 1.58.0 to 1.58.3 (#9292)
- f6e42cd fix: Bulk Action bug (#9255)
- a8d05fa test: skip a
useTypedParams
test case due to flakiness (#9287) - 2164912 chore: dependabot upgrade grpc/go-jose/net [RM-66] (#9280)
- f1aa92e chore: log health check failures in master logs (#9291)
- 7496445 fix: proto build shouldn't run if source files are unchanged (#9290)
- 21f76e9 fix: slots being filled returned out of order on k8s [RM-42] (#9276)
- a9c8700 test: e2e no floating promises [INFENG-668] (#9283)
- 7a296fa test: flaky user test fix [INFENG-663] (#9281)
- e4c6afe chore(deps): bump golang.org/x/net from 0.21.0 to 0.23.0 (#9202)
- b8eba3a ci: revert ee rebase changes to dependabot.yaml (#9278)
- 9ee9270 ci: gate hpc by request (#9198)
- 670ac40 chore: make command's run startup-hook.sh [RM-159] (#9275)
- b78020d feat: Create template through WebUI (#9263)
- abcc7b4 fix: Hide runs in archived experiments (#9270)
- b602ff2 docs: fix master config doc typo (#9256)
- 6bd2a8c ci: try to fix slurm podman tests by not building agent binary (#9273)
- 9c068d2 feat: webui create user prompts for password [DET-10221] (#9240)
- ae91042 feat: reuse HTTP sessions (#9116)
- a3f0fcf fix: show non det pods in other namespaces than 'default' [RM-141] (#9268)
- a611cf0 chore: stop publishing helm charts to NGC. (#9271)
- 2905180 test: increase runner size for react e2e (#9269)
- f8f8672 ci: try to fix podman tests by building proto once (#9267)
- 95b5164 feat: timeout change and package dedupe [ET-243] (#9265)
- 55b7fd9 chore: Image rename bumpenvs (#9253)
- 4d87127 test: some react tests are flaky [INFENG-663] (#9264)
- 86328cb fix: users can be removed from all groups in Web UI (#9259)
- aea83df chore: enable genai to connect to db over TLS (#9260)
- 703e6bd feat: Archive & Unarchive run (#9143)
- 8794e42 fix: historical-usage date calculation bug (#9257)
- cda4363 test: increase the timeout on a new users test [INFENG-455] (#9258)
- bd7b5ef test: user tests continued [INFENG-455] (#9214)
- dd4d0f9 ci: bugfix for nightlight workflow job name [INFENG-651] (#9254)
- 8ef477f fix: notification nil pointer dereference (#9251)
- 32b6ce6 feat: Add templates feature flag and entry points (#9229)
- f4d0466 fix: hew update for select bug in log viewer (#9249)
- c64a695 feat: Filter templates by workspace (#9242)
- 4468232 test: resize test executors that aren't fully used [infeng-598] (#9252)
- 95e4257 fix: undo default log retention in values.yaml (#9245)
- 1e87778 docs: Add qs to getstarted (#9248)
- 43e0bf4 Revert "docs: Update observability release note" (#9247)
- 265314c docs: Update observability release note (#9246)
- 4d20fae feat: Delete run enpoint (#9123)
- 3abe163 fix: support for metric names over 63 characters long [MD-285] (#9232)
- 7cc4c83 fix: allow genai deployments with agent GIDs set to share data properly (#9243)
- 5099158 fix: allow JupyterLab to listen on all interfaces [MD-375] (#9213)
- 985daeb chore: address protoc warning about unused import (#9224)
- 7aa5d5d ci: ee and oss job names [INFENG-651] (#9239)
- 87ef83d test: react e2e use type inference for newed components (#9237)
- 9590d71 ci: devcluster should fail job if step fails [INFENG-640] (#9217)
- 245ebfa feat: don't attempt to deal with EE when cherry-picking PRs for release (#9235)