-
Notifications
You must be signed in to change notification settings - Fork 197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate KNN implementation: ivf-flat #652
Integrate KNN implementation: ivf-flat #652
Conversation
…eparate out cuivfl handle init in separate func
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One more batch of comments.
Co-authored-by: Tamas Bela Feher <[email protected]>
Upstream changes: rapidsai/raft#652
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Discussed further changes offline to get this in for 22.08.
rerun tests |
1 similar comment
rerun tests |
…n out-of-bounds read of the last entry
…ded in .cpp files as before (in cuml)
Thanks Artem for the updates! I had a look at the changes since my last review, and overall it looks good to me. I have some smaller comments (mostly about docs) we can address those in #747. |
ec23d69
to
8b26750
Compare
rerun tests |
@gpucibot merge |
…rim and improve performance for thin matrices (small `n_cols`) (#979) This follows up on a discussion at #652 (comment). The main goal of this PR is to make this helper accessible as a raft primitive. I also used the opportunity to look at the performance of this primitive, and have improved it for: - Thin matrices: less than 32 threads per row with shuffle-based reductions. - Thick matrices: cub-based reduction doing one row per block. Here is an overview of the before/after performance on A100: ![2022-11-11_normalize_perf_float_int32](https://user-images.githubusercontent.com/17441062/201403965-bf68d368-b64b-4a1f-92f0-a5de03b9d1a8.png) Authors: - Louis Sugy (https://github.com/Nyrio) Approvers: - Tamas Bela Feher (https://github.com/tfeher) - Corey J. Nolet (https://github.com/cjnolet) - Artem M. Chirkin (https://github.com/achirkin) URL: #979
Integrate a new KNN implementation.