CAGRA-Q search #2206

enp1s0 · 2024-02-29T09:13:14Z

Rel: #1889

Limitations

Only 8-bit PQ is supported
Sub-space size is only 2 supported

copy-pr-bot · 2024-02-29T09:13:17Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

achirkin · 2024-03-04T19:16:40Z

/ok to test

achirkin

Thanks, @enp1s0, for working on this!
I love your modular approach of using dataset descriptor to abstract away the distance computation code. I think we'll have to think a bit later whether it's possible to unify/backport this to IVF methods.

I'm trying now integrate the index building (compression) part, and I want to figure out how much of the functionality to implement at first.
As far as I see now, both the ported code and the original prototype support only pq_bits = 8, is that correct? There's probably some limitation on possible pq_dim values as well.
Could you please write down these and possibly other limitations of the prototype/initial port (search part only) in the description of the PR?

achirkin · 2024-03-05T06:46:10Z

/ok to test

cpp/include/raft/neighbors/detail/cagra/search_plan.cuh

cpp/include/raft/neighbors/cagra.cuh

…moved into detail namespace in rapidsai#2206

achirkin · 2024-03-13T09:07:20Z

/ok to test

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh

tfeher

Thanks @enp1s0 and @achirkin for your work. The PR looks good to me. There are a few additional issues that we can handle in a follow up PR.

Additionally, the test seem to compile too long, I will check whether there is any unintended template instantiation:

file	compile time	binary size
ann_cagra/test_half_uint32_t.cu.o	31:37 min	91.827 MB
ann_cagra/test_int8_t_uint32_t.cu.o	31:11 min	91.846 MB
ann_cagra/test_float_uint32_t.cu.o	30:41 min	91.709 MB
ann_cagra/test_uint8_t_uint32_t.cu.o	30:22 min	91.874 MB
ann_cagra_vpq/test_float_int64_t.cu.o	17:40 min	46.242 MB
ann_cagra/test_half_int64_t.cu.o	17:27 min	46.422 MB
ann_cagra/test_float_int64_t.cu.o	15:31 min	37.865 MB
bench/cagra_float_uint32_t.cu.o	14:09 min	47.566 MB
test_filter_float_int64_t.cu.o	10:49 min	19.376 MB
ann_cagra_vpq/test_float_uint32_t.cu.o	7:52 min	11.921 MB

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh

cpp/test/neighbors/ann_cagra_vpq.cuh

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh

Co-authored-by: Artem M. Chirkin <[email protected]>

…data descriptor

…ker errors

cpp/include/raft/neighbors/detail/cagra/compute_distance_vpq.cuh

…older cuda

The issues have been addressed.

tfeher · 2024-03-21T05:52:35Z

/merge

Add the relevant options to the CAGRA parameter parser and refinement to the CAGRA ANN benchmark. No changes to the library code. NB: the new option won't work correctly until #2206 is merged. Authors: - Artem M. Chirkin (https://github.com/achirkin) Approvers: - Tamas Bela Feher (https://github.com/tfeher) URL: #2233

enp1s0 requested a review from a team as a code owner February 29, 2024 09:13

github-actions bot added the cpp label Feb 29, 2024

enp1s0 added 5 - DO NOT MERGE Hold off on merging; see PR for details feature request New feature or request non-breaking Non-breaking change and removed cpp labels Feb 29, 2024

enp1s0 self-assigned this Mar 1, 2024

github-actions bot added the cpp label Mar 1, 2024

enp1s0 requested review from a team as code owners March 1, 2024 09:16

github-actions bot added CMake python ci labels Mar 1, 2024

tfeher added the Vector Search label Mar 4, 2024

github-actions bot removed CMake python ci labels Mar 4, 2024

achirkin reviewed Mar 4, 2024

View reviewed changes

achirkin reviewed Mar 5, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/search_plan.cuh Outdated Show resolved Hide resolved

achirkin mentioned this pull request Mar 5, 2024

Add CAGRA-Q build (compression) #2213

Merged

github-actions bot added the CMake label Mar 11, 2024

achirkin reviewed Mar 11, 2024

View reviewed changes

cpp/include/raft/neighbors/cagra.cuh Outdated Show resolved Hide resolved

achirkin added a commit to achirkin/raft that referenced this pull request Mar 13, 2024

Remove the dynamic dispatch from public search function for it to be …

9a55874

…moved into detail namespace in rapidsai#2206

achirkin reviewed Mar 13, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh Outdated Show resolved Hide resolved

achirkin reviewed Mar 13, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh Outdated Show resolved Hide resolved

achirkin reviewed Mar 13, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh Outdated Show resolved Hide resolved

tfeher approved these changes Mar 20, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh Outdated Show resolved Hide resolved

cpp/test/neighbors/ann_cagra_vpq.cuh Outdated Show resolved Hide resolved

achirkin mentioned this pull request Mar 20, 2024

Add CAGRA-Q to ANN benchmarks #2233

Merged

enp1s0 added 5 commits March 20, 2024 20:02

Fix typo

103b9c0

Fix VPQ search params validation

1fb7c36

Add dim size validation

89aa91e

Fix VPQ similarity computation for large dim

daf4f08

Update CAGRA VPQ test

38ab2bd

achirkin previously requested changes Mar 20, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh Outdated Show resolved Hide resolved

Merge branch 'branch-24.04' into cagra-q

15afe26

achirkin reviewed Mar 20, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh Outdated Show resolved Hide resolved

enp1s0 and others added 10 commits March 21, 2024 01:06

Update cpp/include/raft/neighbors/detail/cagra/cagra_search.cuh

5174811

Co-authored-by: Artem M. Chirkin <[email protected]>

Remove redundant team-size and dataset-block-dim parameters from the …

16ddb13

…data descriptor

Mark the strided_dataset::view as deleted (pure virtual) to avoid lin…

317c67f

…ker errors

Fix the instances in the tests as well

59033c7

Fix a bug in VPQ similarity compute

6567186

Disable implicit template instantiations for vpq tests

ecb896c

cagra-vpq enable instantiation of int64 kernels

1308c61

Correct copyright year

6d663ae

Update query copy from dmem to smem

0e29876

Merge branch 'cagra-q' of github.com:enp1s0/raft into cagra-q

31b6982

achirkin reviewed Mar 20, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/compute_distance_vpq.cuh Outdated Show resolved Hide resolved

achirkin reviewed Mar 20, 2024

View reviewed changes

cpp/include/raft/neighbors/detail/cagra/compute_distance_vpq.cuh Outdated Show resolved Hide resolved

achirkin and others added 2 commits March 20, 2024 22:33

Fix query mapping type and usage of a macro that is not available on …

6ebb99e

…older cuda

Set pq_len=2 as default, do not allow different pq_len for search

b2cdb6d

rapids-bot bot merged commit de7341e into rapidsai:branch-24.04 Mar 21, 2024
71 checks passed

tfeher mentioned this pull request Mar 21, 2024

[FEA] CAGRA-Q #1889

Closed

tfeher mentioned this pull request Apr 17, 2024

InnerProduct Distance Metric for CAGRA search #2260

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CAGRA-Q search #2206

CAGRA-Q search #2206

enp1s0 commented Feb 29, 2024 •

edited by tfeher

Loading

copy-pr-bot bot commented Feb 29, 2024

achirkin commented Mar 4, 2024

achirkin left a comment

achirkin commented Mar 5, 2024

achirkin commented Mar 13, 2024

tfeher left a comment

tfeher commented Mar 21, 2024

CAGRA-Q search #2206

CAGRA-Q search #2206

Conversation

enp1s0 commented Feb 29, 2024 • edited by tfeher Loading

Limitations

copy-pr-bot bot commented Feb 29, 2024

achirkin commented Mar 4, 2024

achirkin left a comment

Choose a reason for hiding this comment

achirkin commented Mar 5, 2024

achirkin commented Mar 13, 2024

tfeher left a comment

Choose a reason for hiding this comment

tfeher commented Mar 21, 2024

enp1s0 commented Feb 29, 2024 •

edited by tfeher

Loading