Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More simplifications for CUB util_device #1948

Merged
merged 1 commit into from
Jul 18, 2024

Conversation

bernhardmgruber
Copy link
Contributor

Here are a few more simplifications of CUB's util_device.cuh.

@bernhardmgruber bernhardmgruber added the cub For all items related to CUB label Jul 6, 2024
Copy link
Contributor

github-actions bot commented Jul 6, 2024

🟩 CI finished in 3h 47m: Pass: 100%/249 | Total: 4d 20h | Avg: 28m 11s | Max: 59m 52s | Hits: 63%/248564
  • 🟩 cub: Pass: 100%/131 | Total: 2d 19h | Avg: 30m 59s | Max: 51m 53s | Hits: 56%/109298

    🟩 cpu
      🟩 amd64              Pass: 100%/123 | Total:  2d 15h | Avg: 30m 46s | Max: 51m 53s | Hits:  57%/102474
      🟩 arm64              Pass: 100%/8   | Total:  4h 36m | Avg: 34m 31s | Max: 36m 10s | Hits:  42%/6824  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 39m | Avg: 30m 36s | Max: 46m 40s | Hits:  40%/11583 
      🟩 11.8               Pass: 100%/3   | Total:  2h 18m | Avg: 46m 18s | Max: 51m 53s | Hits:  41%/2559  
      🟩 12.5               Pass: 100%/113 | Total:  2d 09h | Avg: 30m 38s | Max: 46m 20s | Hits:  58%/95156 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 41m 34s | Avg: 20m 47s | Max: 20m 51s | Hits:  44%/1410  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 39m | Avg: 30m 36s | Max: 46m 40s | Hits:  40%/11583 
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 18m | Avg: 46m 18s | Max: 51m 53s | Hits:  41%/2559  
      🟩 nvcc12.5           Pass: 100%/111 | Total:  2d 09h | Avg: 30m 49s | Max: 46m 20s | Hits:  59%/93746 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 41m 34s | Avg: 20m 47s | Max: 20m 51s | Hits:  44%/1410  
      🟩 nvcc               Pass: 100%/129 | Total:  2d 18h | Avg: 31m 09s | Max: 51m 53s | Hits:  56%/107888
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 06m | Avg: 31m 06s | Max: 36m 36s | Hits:  41%/4896  
      🟩 Clang10            Pass: 100%/3   | Total:  1h 43m | Avg: 34m 23s | Max: 35m 08s | Hits:  42%/2565  
      🟩 Clang11            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 06s | Max: 34m 31s | Hits:  42%/3420  
      🟩 Clang12            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 09s | Max: 33m 57s | Hits:  42%/3420  
      🟩 Clang13            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 39s | Max: 34m 52s | Hits:  42%/3420  
      🟩 Clang14            Pass: 100%/4   | Total:  2h 26m | Avg: 36m 30s | Max: 38m 19s | Hits:  42%/3420  
      🟩 Clang15            Pass: 100%/4   | Total:  2h 21m | Avg: 35m 17s | Max: 38m 33s | Hits:  42%/3412  
      🟩 Clang16            Pass: 100%/4   | Total:  2h 21m | Avg: 35m 25s | Max: 40m 00s | Hits:  42%/3412  
      🟩 Clang17            Pass: 100%/26  | Total: 11h 14m | Avg: 25m 55s | Max: 38m 16s | Hits:  78%/21882 
      🟩 GCC6               Pass: 100%/2   | Total: 57m 23s | Avg: 28m 41s | Max: 29m 52s | Hits:  40%/1554  
      🟩 GCC7               Pass: 100%/6   | Total:  3h 15m | Avg: 32m 39s | Max: 38m 27s | Hits:  41%/4899  
      🟩 GCC8               Pass: 100%/6   | Total:  3h 20m | Avg: 33m 24s | Max: 39m 29s | Hits:  41%/4899  
      🟩 GCC9               Pass: 100%/6   | Total:  3h 16m | Avg: 32m 46s | Max: 36m 42s | Hits:  41%/4899  
      🟩 GCC10              Pass: 100%/4   | Total:  2h 22m | Avg: 35m 43s | Max: 37m 06s | Hits:  42%/3420  
      🟩 GCC11              Pass: 100%/7   | Total:  4h 37m | Avg: 39m 40s | Max: 51m 53s | Hits:  41%/5971  
      🟩 GCC12              Pass: 100%/4   | Total:  2h 22m | Avg: 35m 38s | Max: 36m 45s | Hits:  41%/3412  
      🟩 GCC13              Pass: 100%/28  | Total: 11h 01m | Avg: 23m 38s | Max: 46m 12s | Hits:  75%/23884 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 00m | Avg: 40m 18s | Max: 44m 13s | Hits:  40%/2337  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 46m 40s | Avg: 46m 40s | Max: 46m 40s | Hits:  42%/696   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 30m | Avg: 45m 25s | Max: 46m 20s | Hits:  42%/1392  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 10m | Avg: 43m 30s | Max: 45m 30s | Hits:  42%/2088  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  1d 05h | Avg: 30m 26s | Max: 40m 00s | Hits:  58%/49847 
      🟩 GCC                Pass: 100%/63  | Total:  1d 07h | Avg: 29m 45s | Max: 51m 53s | Hits:  56%/52938 
      🟩 Intel              Pass: 100%/3   | Total:  2h 00m | Avg: 40m 18s | Max: 44m 13s | Hits:  40%/2337  
      🟩 MSVC               Pass: 100%/6   | Total:  4h 28m | Avg: 44m 40s | Max: 46m 40s | Hits:  42%/4176  
    🟩 gpu
      🟩 v100               Pass: 100%/131 | Total:  2d 19h | Avg: 30m 59s | Max: 51m 53s | Hits:  56%/109298
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 08h | Avg: 34m 02s | Max: 51m 53s | Hits:  42%/82002 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 43m | Avg: 20m 26s | Max: 28m 17s | Hits:  99%/6824  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 15m | Avg: 16m 58s | Max: 26m 28s | Hits:  99%/6824  
      🟩 HostLaunch         Pass: 100%/8   | Total:  3h 09m | Avg: 23m 44s | Max: 46m 12s | Hits:  94%/6824  
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 20m | Avg: 25m 06s | Max: 29m 45s | Hits:  99%/6824  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 18m | Avg: 46m 18s | Max: 51m 53s | Hits:  41%/2559  
      🟩 90a                Pass: 100%/4   | Total:  1h 17m | Avg: 19m 28s | Max: 20m 17s | Hits:  41%/3412  
    🟩 std
      🟩 11                 Pass: 100%/34  | Total: 16h 40m | Avg: 29m 25s | Max: 44m 42s | Hits:  57%/28571 
      🟩 14                 Pass: 100%/37  | Total: 19h 59m | Avg: 32m 25s | Max: 51m 53s | Hits:  54%/30659 
      🟩 17                 Pass: 100%/36  | Total: 19h 22m | Avg: 32m 17s | Max: 46m 12s | Hits:  54%/29891 
      🟩 20                 Pass: 100%/24  | Total: 11h 37m | Avg: 29m 04s | Max: 43m 00s | Hits:  61%/20177 
    
  • 🟩 thrust: Pass: 100%/118 | Total: 2d 01h | Avg: 25m 04s | Max: 59m 52s | Hits: 68%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  1d 21h | Avg: 25m 01s | Max: 59m 52s | Hits:  69%/129822
      🟩 arm64              Pass: 100%/8   | Total:  3h 25m | Avg: 25m 39s | Max: 27m 44s | Hits:  63%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  6h 28m | Avg: 25m 52s | Max: 46m 00s | Hits:  61%/17705 
      🟩 11.8               Pass: 100%/3   | Total:  1h 41m | Avg: 33m 53s | Max: 36m 16s | Hits:  63%/3543  
      🟩 12.5               Pass: 100%/100 | Total:  1d 17h | Avg: 24m 41s | Max: 59m 52s | Hits:  70%/118018
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 49m 30s | Avg: 24m 45s | Max: 25m 49s | Hits:  62%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 28m | Avg: 25m 52s | Max: 46m 00s | Hits:  61%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 41m | Avg: 33m 53s | Max: 36m 16s | Hits:  63%/3543  
      🟩 nvcc12.5           Pass: 100%/98  | Total:  1d 16h | Avg: 24m 41s | Max: 59m 52s | Hits:  70%/115658
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 49m 30s | Avg: 24m 45s | Max: 25m 49s | Hits:  62%/2360  
      🟩 nvcc               Pass: 100%/116 | Total:  2d 00h | Avg: 25m 04s | Max: 59m 52s | Hits:  68%/136906
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 28m | Avg: 24m 47s | Max: 28m 02s | Hits:  63%/7080  
      🟩 Clang10            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 17s | Max: 28m 55s | Hits:  63%/3540  
      🟩 Clang11            Pass: 100%/4   | Total:  1h 40m | Avg: 25m 13s | Max: 26m 45s | Hits:  63%/4720  
      🟩 Clang12            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 20s | Max: 30m 46s | Hits:  63%/4720  
      🟩 Clang13            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 21s | Max: 28m 36s | Hits:  63%/4720  
      🟩 Clang14            Pass: 100%/4   | Total:  1h 44m | Avg: 26m 07s | Max: 27m 44s | Hits:  63%/4720  
      🟩 Clang15            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 25s | Max: 28m 19s | Hits:  63%/4720  
      🟩 Clang16            Pass: 100%/4   | Total:  1h 44m | Avg: 26m 06s | Max: 28m 31s | Hits:  63%/4720  
      🟩 Clang17            Pass: 100%/18  | Total:  5h 27m | Avg: 18m 12s | Max: 27m 03s | Hits:  79%/21240 
      🟩 GCC6               Pass: 100%/2   | Total: 50m 47s | Avg: 25m 23s | Max: 30m 51s | Hits:  52%/2360  
      🟩 GCC7               Pass: 100%/6   | Total:  2h 35m | Avg: 25m 53s | Max: 30m 21s | Hits:  63%/7086  
      🟩 GCC8               Pass: 100%/6   | Total:  2h 34m | Avg: 25m 45s | Max: 28m 26s | Hits:  63%/7086  
      🟩 GCC9               Pass: 100%/6   | Total:  2h 34m | Avg: 25m 43s | Max: 28m 54s | Hits:  63%/7086  
      🟩 GCC10              Pass: 100%/4   | Total:  1h 52m | Avg: 28m 14s | Max: 29m 55s | Hits:  63%/4724  
      🟩 GCC11              Pass: 100%/7   | Total:  3h 29m | Avg: 29m 52s | Max: 36m 16s | Hits:  63%/8267  
      🟩 GCC12              Pass: 100%/4   | Total:  1h 53m | Avg: 28m 15s | Max: 32m 23s | Hits:  63%/4724  
      🟩 GCC13              Pass: 100%/20  | Total:  6h 05m | Avg: 18m 15s | Max: 33m 26s | Hits:  77%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 37m | Avg: 32m 31s | Max: 35m 37s | Hits:  63%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 46m 00s | Avg: 46m 00s | Max: 46m 00s | Hits:  61%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 34m | Avg: 47m 00s | Max: 48m 21s | Hits:  61%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  3h 44m | Avg: 37m 21s | Max: 59m 52s | Hits:  80%/7056  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 19h 41m | Avg: 23m 10s | Max: 30m 46s | Hits:  69%/60180 
      🟩 GCC                Pass: 100%/55  | Total: 21h 55m | Avg: 23m 54s | Max: 36m 16s | Hits:  68%/64953 
      🟩 Intel              Pass: 100%/3   | Total:  1h 37m | Avg: 32m 31s | Max: 35m 37s | Hits:  63%/3549  
      🟩 MSVC               Pass: 100%/9   | Total:  6h 04m | Avg: 40m 27s | Max: 59m 52s | Hits:  73%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  2d 01h | Avg: 25m 04s | Max: 59m 52s | Hits:  68%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  1d 21h | Avg: 27m 40s | Max: 59m 52s | Hits:  62%/116850
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 46m | Avg:  9m 39s | Max: 21m 10s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 52m | Avg: 14m 00s | Max: 18m 34s | Hits:  99%/9444  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 41m | Avg: 33m 53s | Max: 36m 16s | Hits:  63%/3543  
      🟩 90a                Pass: 100%/4   | Total:  1h 01m | Avg: 15m 29s | Max: 18m 22s | Hits:  63%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 10h 29m | Avg: 20m 58s | Max: 29m 27s | Hits:  69%/35418 
      🟩 14                 Pass: 100%/34  | Total: 15h 19m | Avg: 27m 02s | Max: 54m 54s | Hits:  67%/40122 
      🟩 17                 Pass: 100%/33  | Total: 14h 41m | Avg: 26m 43s | Max: 51m 05s | Hits:  68%/38946 
      🟩 20                 Pass: 100%/21  | Total:  8h 48m | Avg: 25m 08s | Max: 59m 52s | Hits:  71%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber marked this pull request as ready for review July 8, 2024 07:42
@bernhardmgruber bernhardmgruber requested review from a team as code owners July 8, 2024 07:42
Copy link
Contributor

🟩 CI finished in 7h 45m: Pass: 100%/250 | Total: 5d 01h | Avg: 29m 09s | Max: 57m 10s | Hits: 44%/248210
  • 🟩 cub: Pass: 100%/131 | Total: 2d 20h | Avg: 31m 13s | Max: 49m 51s | Hits: 44%/109298

    🟩 cpu
      🟩 amd64              Pass: 100%/123 | Total:  2d 15h | Avg: 30m 56s | Max: 49m 51s | Hits:  44%/102474
      🟩 arm64              Pass: 100%/8   | Total:  4h 44m | Avg: 35m 30s | Max: 37m 41s | Hits:  42%/6824  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 36m | Avg: 30m 27s | Max: 49m 22s | Hits:  40%/11583 
      🟩 11.8               Pass: 100%/3   | Total:  2h 14m | Avg: 44m 49s | Max: 47m 24s | Hits:  41%/2559  
      🟩 12.5               Pass: 100%/113 | Total:  2d 10h | Avg: 30m 58s | Max: 49m 51s | Hits:  45%/95156 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 42m 27s | Avg: 21m 13s | Max: 21m 17s | Hits:  46%/1410  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 36m | Avg: 30m 27s | Max: 49m 22s | Hits:  40%/11583 
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 14m | Avg: 44m 49s | Max: 47m 24s | Hits:  41%/2559  
      🟩 nvcc12.5           Pass: 100%/111 | Total:  2d 09h | Avg: 31m 08s | Max: 49m 51s | Hits:  44%/93746 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 42m 27s | Avg: 21m 13s | Max: 21m 17s | Hits:  46%/1410  
      🟩 nvcc               Pass: 100%/129 | Total:  2d 19h | Avg: 31m 22s | Max: 49m 51s | Hits:  44%/107888
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 08m | Avg: 31m 27s | Max: 34m 37s | Hits:  23%/4896  
      🟩 Clang10            Pass: 100%/3   | Total:  1h 48m | Avg: 36m 04s | Max: 36m 43s | Hits:   7%/2565  
      🟩 Clang11            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 09s | Max: 37m 31s | Hits:   7%/3420  
      🟩 Clang12            Pass: 100%/4   | Total:  2h 18m | Avg: 34m 33s | Max: 35m 53s | Hits:   7%/3420  
      🟩 Clang13            Pass: 100%/4   | Total:  2h 19m | Avg: 34m 50s | Max: 36m 24s | Hits:   7%/3420  
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 17s | Max: 35m 33s | Hits:   8%/3420  
      🟩 Clang15            Pass: 100%/4   | Total:  2h 19m | Avg: 34m 57s | Max: 36m 18s | Hits:  42%/3412  
      🟩 Clang16            Pass: 100%/4   | Total:  2h 15m | Avg: 33m 58s | Max: 34m 24s | Hits:  42%/3412  
      🟩 Clang17            Pass: 100%/26  | Total: 10h 43m | Avg: 24m 44s | Max: 37m 41s | Hits:  78%/21882 
      🟩 GCC6               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 02s | Max: 31m 15s | Hits:  40%/1554  
      🟩 GCC7               Pass: 100%/6   | Total:  3h 07m | Avg: 31m 10s | Max: 34m 21s | Hits:  22%/4899  
      🟩 GCC8               Pass: 100%/6   | Total:  3h 12m | Avg: 32m 09s | Max: 35m 52s | Hits:  22%/4899  
      🟩 GCC9               Pass: 100%/6   | Total:  3h 19m | Avg: 33m 13s | Max: 37m 44s | Hits:  22%/4899  
      🟩 GCC10              Pass: 100%/4   | Total:  2h 25m | Avg: 36m 28s | Max: 38m 22s | Hits:   6%/3420  
      🟩 GCC11              Pass: 100%/7   | Total:  4h 36m | Avg: 39m 25s | Max: 47m 24s | Hits:  41%/5971  
      🟩 GCC12              Pass: 100%/4   | Total:  2h 29m | Avg: 37m 20s | Max: 40m 40s | Hits:  41%/3412  
      🟩 GCC13              Pass: 100%/28  | Total: 11h 52m | Avg: 25m 26s | Max: 44m 19s | Hits:  67%/23884 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 56m | Avg: 38m 47s | Max: 41m 43s | Hits:   3%/2337  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 49m 22s | Avg: 49m 22s | Max: 49m 22s | Hits:  42%/696   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 30m | Avg: 45m 04s | Max: 46m 31s | Hits:  42%/1392  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 20m | Avg: 46m 57s | Max: 49m 51s | Hits:  42%/2088  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  1d 05h | Avg: 30m 01s | Max: 37m 41s | Hits:  45%/49847 
      🟩 GCC                Pass: 100%/63  | Total:  1d 08h | Avg: 30m 31s | Max: 47m 24s | Hits:  45%/52938 
      🟩 Intel              Pass: 100%/3   | Total:  1h 56m | Avg: 38m 47s | Max: 41m 43s | Hits:   3%/2337  
      🟩 MSVC               Pass: 100%/6   | Total:  4h 40m | Avg: 46m 43s | Max: 49m 51s | Hits:  42%/4176  
    🟩 gpu
      🟩 v100               Pass: 100%/131 | Total:  2d 20h | Avg: 31m 13s | Max: 49m 51s | Hits:  44%/109298
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 09h | Avg: 34m 33s | Max: 49m 51s | Hits:  26%/82002 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 55m | Avg: 21m 57s | Max: 44m 19s | Hits:  94%/6824  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 07m | Avg: 15m 55s | Max: 18m 48s | Hits:  99%/6824  
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 28m | Avg: 18m 31s | Max: 23m 53s | Hits:  99%/6824  
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 37m | Avg: 27m 10s | Max: 32m 39s | Hits:  99%/6824  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 14m | Avg: 44m 49s | Max: 47m 24s | Hits:  41%/2559  
      🟩 90a                Pass: 100%/4   | Total:  1h 22m | Avg: 20m 44s | Max: 23m 06s | Hits:   6%/3412  
    🟩 std
      🟩 11                 Pass: 100%/34  | Total: 17h 40m | Avg: 31m 11s | Max: 44m 26s | Hits:  40%/28571 
      🟩 14                 Pass: 100%/37  | Total: 19h 40m | Avg: 31m 53s | Max: 49m 22s | Hits:  43%/30659 
      🟩 17                 Pass: 100%/36  | Total: 18h 46m | Avg: 31m 17s | Max: 47m 24s | Hits:  43%/29891 
      🟩 20                 Pass: 100%/24  | Total: 12h 03m | Avg: 30m 09s | Max: 49m 51s | Hits:  53%/20177 
    
  • 🟩 thrust: Pass: 100%/118 | Total: 2d 05h | Avg: 27m 01s | Max: 57m 10s | Hits: 44%/138912

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  2d 01h | Avg: 27m 04s | Max: 57m 10s | Hits:  44%/129492
      🟩 arm64              Pass: 100%/8   | Total:  3h 31m | Avg: 26m 26s | Max: 29m 14s | Hits:  57%/9420  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 09m | Avg: 28m 36s | Max: 46m 23s | Hits:  20%/17660 
      🟩 11.8               Pass: 100%/3   | Total:  1h 57m | Avg: 39m 05s | Max: 42m 45s | Hits:  17%/3534  
      🟩 12.5               Pass: 100%/100 | Total:  1d 20h | Avg: 26m 26s | Max: 57m 10s | Hits:  49%/117718
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 47m 32s | Avg: 23m 46s | Max: 24m 41s | Hits:  62%/2354  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 09m | Avg: 28m 36s | Max: 46m 23s | Hits:  20%/17660 
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 57m | Avg: 39m 05s | Max: 42m 45s | Hits:  17%/3534  
      🟩 nvcc12.5           Pass: 100%/98  | Total:  1d 19h | Avg: 26m 29s | Max: 57m 10s | Hits:  49%/115364
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 47m 32s | Avg: 23m 46s | Max: 24m 41s | Hits:  62%/2354  
      🟩 nvcc               Pass: 100%/116 | Total:  2d 04h | Avg: 27m 05s | Max: 57m 10s | Hits:  44%/136558
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 49m | Avg: 28m 15s | Max: 31m 37s | Hits:  17%/7062  
      🟩 Clang10            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 51s | Max: 34m 06s | Hits:  17%/3531  
      🟩 Clang11            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 43s | Max: 32m 07s | Hits:  18%/4708  
      🟩 Clang12            Pass: 100%/4   | Total:  2h 10m | Avg: 32m 33s | Max: 34m 44s | Hits:  18%/4708  
      🟩 Clang13            Pass: 100%/4   | Total:  1h 57m | Avg: 29m 29s | Max: 30m 36s | Hits:  18%/4708  
      🟩 Clang14            Pass: 100%/4   | Total:  1h 46m | Avg: 26m 37s | Max: 28m 18s | Hits:  62%/4708  
      🟩 Clang15            Pass: 100%/4   | Total:  1h 43m | Avg: 25m 47s | Max: 27m 27s | Hits:  62%/4708  
      🟩 Clang16            Pass: 100%/4   | Total:  1h 47m | Avg: 26m 45s | Max: 29m 46s | Hits:  62%/4708  
      🟩 Clang17            Pass: 100%/18  | Total:  5h 36m | Avg: 18m 42s | Max: 27m 57s | Hits:  79%/21186 
      🟩 GCC6               Pass: 100%/2   | Total: 52m 32s | Avg: 26m 16s | Max: 28m 13s | Hits:  17%/2354  
      🟩 GCC7               Pass: 100%/6   | Total:  2h 51m | Avg: 28m 33s | Max: 34m 13s | Hits:  17%/7068  
      🟩 GCC8               Pass: 100%/6   | Total:  2h 52m | Avg: 28m 41s | Max: 31m 52s | Hits:  17%/7068  
      🟩 GCC9               Pass: 100%/6   | Total:  3h 01m | Avg: 30m 14s | Max: 34m 43s | Hits:  17%/7068  
      🟩 GCC10              Pass: 100%/4   | Total:  2h 01m | Avg: 30m 16s | Max: 33m 20s | Hits:  17%/4712  
      🟩 GCC11              Pass: 100%/7   | Total:  3h 53m | Avg: 33m 24s | Max: 42m 45s | Hits:  36%/8246  
      🟩 GCC12              Pass: 100%/4   | Total:  2h 07m | Avg: 31m 56s | Max: 33m 55s | Hits:  18%/4712  
      🟩 GCC13              Pass: 100%/20  | Total:  6h 07m | Avg: 18m 22s | Max: 29m 39s | Hits:  66%/23560 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 03m | Avg: 41m 06s | Max: 45m 57s | Hits:   2%/3540  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 46m 23s | Avg: 46m 23s | Max: 46m 23s | Hits:  61%/1173  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 34m | Avg: 47m 28s | Max: 48m 25s | Hits:  61%/2346  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  3h 31m | Avg: 35m 17s | Max: 57m 10s | Hits:  80%/7038  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 21h 25m | Avg: 25m 12s | Max: 34m 44s | Hits:  49%/60027 
      🟩 GCC                Pass: 100%/55  | Total: 23h 47m | Avg: 25m 57s | Max: 42m 45s | Hits:  37%/64788 
      🟩 Intel              Pass: 100%/3   | Total:  2h 03m | Avg: 41m 06s | Max: 45m 57s | Hits:   2%/3540  
      🟩 MSVC               Pass: 100%/9   | Total:  5h 53m | Avg: 39m 14s | Max: 57m 10s | Hits:  73%/10557 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  2d 05h | Avg: 27m 01s | Max: 57m 10s | Hits:  44%/138912
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 01h | Avg: 30m 03s | Max: 57m 10s | Hits:  34%/116553
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 43m | Avg:  9m 23s | Max: 19m 02s | Hits:  99%/12939 
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 50m | Avg: 13m 48s | Max: 24m 01s | Hits:  99%/9420  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 57m | Avg: 39m 05s | Max: 42m 45s | Hits:  17%/3534  
      🟩 90a                Pass: 100%/4   | Total:  1h 15m | Avg: 18m 57s | Max: 20m 19s | Hits:  17%/4712  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 11h 35m | Avg: 23m 10s | Max: 35m 06s | Hits:  37%/35328 
      🟩 14                 Pass: 100%/34  | Total: 16h 31m | Avg: 29m 09s | Max: 49m 54s | Hits:  44%/40020 
      🟩 17                 Pass: 100%/33  | Total: 15h 51m | Avg: 28m 50s | Max: 50m 13s | Hits:  44%/38847 
      🟩 20                 Pass: 100%/21  | Total:  9h 11m | Avg: 26m 16s | Max: 57m 10s | Hits:  56%/24717 
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 10m 29s | Avg: 10m 29s | Max: 10m 29s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda

🏃‍ Runner counts (total jobs: 250)

# Runner
178 linux-amd64-cpu16
41 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber merged commit 87d0849 into NVIDIA:main Jul 18, 2024
263 checks passed
@bernhardmgruber bernhardmgruber deleted the ref_util branch July 18, 2024 16:13
pciolkosz pushed a commit to pciolkosz/cccl that referenced this pull request Aug 4, 2024
pciolkosz pushed a commit to pciolkosz/cccl that referenced this pull request Aug 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cub For all items related to CUB
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

2 participants