Faster GELU forward & backward using MUFU.TANH for SM7.5+ #1635
ci.yml
on: pull_request
build-cuda-windows
2m 11s
build-ubuntu20-04
2m 36s
build-cuda-fp32
1m 22s
build-cuda-bf16
1m 13s
build-cuda-fp16
1m 11s
build-cuda-kernels
1m 28s
Matrix: build-and-test-cpu