You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
tfeher
changed the title
Re-introduce template instantiations to reduce compile time (were removed here: 358c09c and here 4afb03e)
Re-introduce CAGRA template instantiations to reduce compile time
Apr 20, 2023
Cagra was introduced header only in #1375. This PR adds a precompiled single- and multi-cta search kernels to libraft.so.
The single- and multi-cta search kernels were moved to separate header files to make it easier to specify extern template instantiations for these.
The macros for dispatching the kernels were replaced by functions. We define explicit instantiations for the top level dispatch functions. (This is in contrast to #1428 where the kernels themselves were instantiated, which resulted in a large number of parameter combinations that had to be explicitly spelled out.)
This PR fixes#1443.
Authors:
- Tamas Bela Feher (https://github.com/tfeher)
Approvers:
- Corey J. Nolet (https://github.com/cjnolet)
URL: #1650
No description provided.
The text was updated successfully, but these errors were encountered: