Add device-side support for int.bit_count (which just lowers to cuda.… #130
Job | Run time |
---|---|
5s | |
35s | |
46s | |
4s | |
2s | |
2m 14s | |
3m 16s | |
2s | |
1m 19s | |
9m 4s | |
9m 1s | |
8m 33s | |
8m 42s | |
8m 52s | |
7m 36s | |
6m 39s | |
6m 38s | |
4s | |
10m 59s | |
11m 14s | |
10m 38s | |
10m 20s | |
8m 32s | |
7m 36s | |
7m 7s | |
8m 9s | |
1s | |
2h 28m 8s |