Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] CAGRA-Q #1889

Closed
cjnolet opened this issue Oct 11, 2023 · 1 comment
Closed

[FEA] CAGRA-Q #1889

cjnolet opened this issue Oct 11, 2023 · 1 comment
Assignees
Labels
feature request New feature or request Vector Search

Comments

@cjnolet
Copy link
Member

cjnolet commented Oct 11, 2023

There has been research and exploration around some extensions to CAGRA that use product quantization to effectively compress the input dataset, thus shrinking the CAGRA footprint and providing support for larger data sizes.

This is a placeholder issue to be taken up once CAGRA-Q research has gotten to a point where we feel it's ready to be ported to RAFT. Currently, this is showing great performance on GH using huge page pinned memory for the underlying optimized graph, allowing for the search times to be comparable to the graph stored fully in device memory.

@cjnolet cjnolet added the feature request New feature or request label Oct 11, 2023
@enp1s0 enp1s0 self-assigned this Feb 20, 2024
@enp1s0 enp1s0 mentioned this issue Feb 29, 2024
rapids-bot bot pushed a commit that referenced this issue Mar 21, 2024
Rel: #1889

## Limitations
- Only 8-bit PQ is supported
- Sub-space size is only 2 supported

Authors:
  - tsuki (https://github.com/enp1s0)
  - Artem M. Chirkin (https://github.com/achirkin)
  - Tamas Bela Feher (https://github.com/tfeher)

Approvers:
  - Tamas Bela Feher (https://github.com/tfeher)

URL: #2206
@tfeher
Copy link
Contributor

tfeher commented Mar 21, 2024

CAGRA-Q added in #2206 and #2213.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request Vector Search
Projects
Development

No branches or pull requests

3 participants