[ENH] Add __half
overloads to math API wrappers
#1542
Labels
0 - Backlog
In queue waiting for assignment
cpp
improvement
Improvement / enhancement to an existing function
The wrappers in https://github.com/rapidsai/raft/blob/branch-23.06/cpp/include/raft/core/math.hpp delegate work to the appropriate CUDA intrinsic. However, the CUDA intrinsics for square root, trigonometry functions, etc, have different names for
__half
(e.ghsqrt
instead ofsqrt
).To facilitate templated code, we could add overloads of
sqrt
callinghsqrt
on the device and similarly for trigonometry functions.On host, we can either not define those, or use the fp32 functions.
The text was updated successfully, but these errors were encountered: