Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Improve performance of select-top-k WARP_SORT implementation (rapidsa…
…i#606) A few simplifications and tricks to improve the performance of the kernel: - Promote some constants to static constexpr - Allow `capacity < WarpSize` - Reduce the frequency of `sort` operations for `filtered` version - Remove `warp_sort::load` to simplify the api and implementation Authors: - Artem M. Chirkin (https://github.com/achirkin) Approvers: - Tamas Bela Feher (https://github.com/tfeher) URL: rapidsai#606
- Loading branch information