Skip to content

[NeMo-UX] Use single instance of loss reductions in GPTModel #3196

[NeMo-UX] Use single instance of loss reductions in GPTModel

[NeMo-UX] Use single instance of loss reductions in GPTModel #3196

Job Run time
3s
3s
1m 34s
47s
2m 24s
1m 43s
1m 47s
1m 40s
47s
56s
3m 21s
1m 59s
1m 30s
3m 25s
1m 28s
2m 15s
18m 53s
18m 39s
28s
1m 0s
1m 4s
1m 7s
4m 48s
57s
1m 0s
32s
40s
1m 20s
15m 31s
40s
41s
32s
30s
31s
29s
1m 11s
1m 21s
34s
34s
31s
1m 36s
41s
1m 22s
59s
2m 34s
2m 1s
56s
36s
26s
30s
25s
1m 13s
40s
35s
32s
1m 27s
1m 21s
15s
1s
1s
1m 43s
55s
56s
41s
46s
1s
1s
32s
1s
1s
38s
39s
37s
33s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
3m 55s
1s
1s
46s
1s
1s
1s
3s
2h 10m 44s