Skip to content

Actions: pytorch/torchft

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
211 workflow runs
211 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

use torchx for manual many replica (20+) tests (#75)
Unit Tests #211: Commit 39a40b2 pushed by d4l3k
January 18, 2025 05:26 10m 13s main
January 18, 2025 05:26 10m 13s
use torchx for manual many replica (20+) tests
Unit Tests #210: Pull request #75 synchronize by d4l3k
January 18, 2025 00:40 10m 17s d4l3k/torchx
January 18, 2025 00:40 10m 17s
process_group: wait for futher_thread join before creating new one
Unit Tests #209: Pull request #68 synchronize by dwancn
January 17, 2025 03:18 Action required dwancn:fix_pg_config
January 17, 2025 03:18 Action required
use torchx for manual many replica (20+) tests
Unit Tests #208: Pull request #75 opened by d4l3k
January 16, 2025 22:51 9m 59s d4l3k/torchx
January 16, 2025 22:51 9m 59s
overhaul timeouts for Lighthouse, Manager, checkpoint server (#73)
Unit Tests #207: Commit 3ee2360 pushed by d4l3k
January 16, 2025 19:05 9m 45s main
January 16, 2025 19:05 9m 45s
overhaul timeouts for Lighthouse, Manager, checkpoint server
Unit Tests #206: Pull request #73 synchronize by d4l3k
January 16, 2025 18:54 9m 50s d4l3k/timeout_overhaul
January 16, 2025 18:54 9m 50s
Fix typo and use sampler in train_ddp.py (#74)
Unit Tests #205: Commit 03160ee pushed by mreso
January 16, 2025 18:27 9m 9s main
January 16, 2025 18:27 9m 9s
Dont return quorum if requester isnt involved (#72)
Unit Tests #204: Commit c58ed4c pushed by d4l3k
January 16, 2025 17:48 6m 46s main
January 16, 2025 17:48 6m 46s
Fix typo and use sampler in train_ddp.py
Unit Tests #202: Pull request #74 opened by mreso
January 16, 2025 00:45 8m 52s mreso:fix/typos
January 16, 2025 00:45 8m 52s
overhaul timeouts for Lighthouse, Manager, checkpoint server
Unit Tests #201: Pull request #73 synchronize by d4l3k
January 15, 2025 23:31 10m 9s d4l3k/timeout_overhaul
January 15, 2025 23:31 10m 9s
overhaul timeouts for Lighthouse, Manager, checkpoint server
Unit Tests #200: Pull request #73 opened by d4l3k
January 15, 2025 19:06 11m 5s d4l3k/timeout_overhaul
January 15, 2025 19:06 11m 5s
lighthouse/quorum: avoid split brain and add shrink_only support (#71)
Unit Tests #197: Commit 79572e6 pushed by d4l3k
January 15, 2025 00:01 9m 24s main
January 15, 2025 00:01 9m 24s
process_group: wait for futher_thread join before creating new one
Unit Tests #196: Pull request #68 synchronize by dwancn
January 14, 2025 08:10 9m 4s dwancn:fix_pg_config
January 14, 2025 08:10 9m 4s
lighthouse/quorum: avoid split brain and add shrink_only support
Unit Tests #195: Pull request #71 opened by d4l3k
January 14, 2025 01:43 9m 18s d4l3k/shrink_only
January 14, 2025 01:43 9m 18s
lighthouse, manager: remove room support (#70)
Unit Tests #194: Commit 97ad397 pushed by d4l3k
January 13, 2025 22:17 8m 45s main
January 13, 2025 22:17 8m 45s
lighthouse, manager: remove room support
Unit Tests #193: Pull request #70 opened by d4l3k
January 13, 2025 21:51 12m 26s d4l3k/remove_room
January 13, 2025 21:51 12m 26s
feat: fix security warnings in torchft (#69)
Unit Tests #192: Commit e0f76e1 pushed by d4l3k
January 13, 2025 21:49 8m 54s main
January 13, 2025 21:49 8m 54s
feat: fix security warnings in torchft
Unit Tests #191: Pull request #69 opened by c-p-i-o
January 13, 2025 21:05 9m 3s cpio/fix_vuln
January 13, 2025 21:05 9m 3s
process_group: wait for futher_thread join before creating new one
Unit Tests #190: Pull request #68 opened by dwancn
January 13, 2025 08:36 9m 3s dwancn:fix_pg_config
January 13, 2025 08:36 9m 3s
[lighthouse] detect unhealthy participants via heartbeats (#64)
Unit Tests #189: Commit 2f97660 pushed by d4l3k
January 11, 2025 01:19 9m 1s main
January 11, 2025 01:19 9m 1s
[lighthouse] detect unhealthy participants via heartbeats
Unit Tests #188: Pull request #64 synchronize by d4l3k
January 11, 2025 01:04 8m 52s d4l3k/quorum_heartbeats
January 11, 2025 01:04 8m 52s
[manager] fix address when binding to 0 (#67)
Unit Tests #187: Commit 6b3665a pushed by d4l3k
January 10, 2025 21:16 8m 37s main
January 10, 2025 21:16 8m 37s