-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test and fix failing nightly libcudacxx + CUB jobs #1847
Conversation
🟩 CI finished in 5h 09m: Pass: 100%/365 | Total: 2d 00h | Avg: 7m 59s | Max: 1h 09m | Hits: 97%/521326
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
CUB | |
Thrust | |
CUDA Experimental |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
🏃 Runner counts (total jobs: 365)
# | Runner |
---|---|
264 | linux-amd64-cpu16 |
56 | linux-amd64-gpu-v100-latest-1 |
24 | linux-arm64-cpu16 |
21 | windows-amd64-cpu16 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's hard for me to assess whether this causes any issues, becaues the CI for this PR does not run the nightlies. However, if we experience any issues from the nighlies, we can also disable the offenders or fix the issues again.
a6a50e6
to
13b3c78
Compare
To test the nightly builds, add the string To just test the nightly builds and skip the PR workflow, add the strings I've just force pushed this branch to include #1844 (since otherwise we'd have a bunch of conflicts soon), and added the failing nightly jobs to the |
We need to fix the failures before enabling the jobs.
🟨 CI finished in 7h 39m: Pass: 68%/41 | Total: 10h 48m | Avg: 15m 48s | Max: 43m 50s | Hits: 36%/43554
|
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
CUB | |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 41)
# | Runner |
---|---|
15 | linux-amd64-cpu16 |
5 | linux-amd64-gpu-t4-latest-1-testing |
5 | linux-amd64-gpu-v100-latest-1 |
5 | linux-amd64-gpu-rtx2080-latest-1-testing |
5 | linux-amd64-gpu-h100-latest-1 |
4 | linux-amd64-gpu-l4-latest-1-testing |
1 | linux-amd64-gpu-rtx4090-latest-1-testing |
1 | linux-amd64-gpu-rtxa6000-latest-1-testing |
🟨 CI finished in 1h 05m: Pass: 87%/41 | Total: 9h 13m | Avg: 13m 30s | Max: 46m 45s | Hits: 68%/43566
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
CUB | |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 41)
# | Runner |
---|---|
15 | linux-amd64-cpu16 |
5 | linux-amd64-gpu-t4-latest-1-testing |
5 | linux-amd64-gpu-v100-latest-1 |
5 | linux-amd64-gpu-rtx2080-latest-1-testing |
5 | linux-amd64-gpu-h100-latest-1 |
4 | linux-amd64-gpu-l4-latest-1-testing |
1 | linux-amd64-gpu-rtx4090-latest-1-testing |
1 | linux-amd64-gpu-rtxa6000-latest-1-testing |
@alliepiper It seems that the nvrtc catch2 tests are not properly configured with the right standard mode: https://github.com/NVIDIA/cccl/actions/runs/10212454129/job/28256780840?pr=1847#step:5:2575 |
🟨 CI finished in 8h 24m: Pass: 99%/421 | Total: 6d 17h | Avg: 22m 58s | Max: 1h 07m | Hits: 74%/31602
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
65 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
🟩 CI finished in 18h 28m: Pass: 100%/421 | Total: 6d 17h | Avg: 23m 01s | Max: 1h 07m | Hits: 74%/31602
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
65 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
8783527
to
bc9f5ab
Compare
bc9f5ab
to
91ec108
Compare
🟨 CI finished in 12h 26m: Pass: 96%/526 | Total: 3d 13h | Avg: 9m 48s | Max: 6h 00m | Hits: 99%/31602
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 526)
# | Runner |
---|---|
348 | linux-amd64-cpu16 |
71 | linux-amd64-gpu-v100-latest-1 |
28 | linux-amd64-gpu-l4-latest-1-testing |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
7 | linux-amd64-gpu-t4-latest-1-testing |
7 | linux-amd64-gpu-rtxa6000-latest-1-testing |
6 | linux-amd64-gpu-rtx2080-latest-1-testing |
6 | linux-amd64-gpu-rtx4090-latest-1-testing |
2 | linux-amd64-gpu-h100-latest-1 |
🟨 CI finished in 22h 25m: Pass: 99%/526 | Total: 3d 02h | Avg: 8m 30s | Max: 44m 37s | Hits: 99%/31602
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 526)
# | Runner |
---|---|
348 | linux-amd64-cpu16 |
71 | linux-amd64-gpu-v100-latest-1 |
28 | linux-amd64-gpu-l4-latest-1-testing |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
7 | linux-amd64-gpu-t4-latest-1-testing |
7 | linux-amd64-gpu-rtxa6000-latest-1-testing |
6 | linux-amd64-gpu-rtx2080-latest-1-testing |
6 | linux-amd64-gpu-rtx4090-latest-1-testing |
2 | linux-amd64-gpu-h100-latest-1 |
2beac0e
to
c058dd7
Compare
🟨 CI finished in 8h 24m: Pass: 99%/421 | Total: 2d 14h | Avg: 8m 54s | Max: 58m 29s | Hits: 89%/34164
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
65 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
🟩 CI finished in 4d 00h: Pass: 100%/421 | Total: 2d 14h | Avg: 8m 55s | Max: 58m 29s | Hits: 89%/34164
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
65 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
c058dd7
to
130754a
Compare
🟨 CI finished in 3h 20m: Pass: 95%/421 | Total: 2d 07h | Avg: 7m 53s | Max: 40m 17s | Hits: 95%/41430
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
304 | linux-amd64-cpu16 |
66 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
🟩 CI finished in 1d 01h: Pass: 100%/421 | Total: 2d 13h | Avg: 8m 47s | Max: 40m 17s | Hits: 95%/41430
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
+/- | CUB |
Thrust | |
CUDA Experimental | |
pycuda |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | pycuda |
🏃 Runner counts (total jobs: 421)
# | Runner |
---|---|
304 | linux-amd64-cpu16 |
66 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
Some of the libcudacxx nightly tests are not running because they would fail.
We need to fix that and ensure them running fine