CosineExpanded Metric for IVF-PQ (normalize inputs) #346

tarang-jain · 2024-09-24T22:28:47Z

No description provided.

…cosine

achirkin

Thanks for the PR!
Couple nitpicks and a more substantial question about the cluster center handling here.

cpp/src/neighbors/ivf_common.cuh

cpp/src/neighbors/ivf_pq/ivf_pq_build.cuh

achirkin · 2024-09-25T07:46:20Z

cpp/src/neighbors/ivf_pq/ivf_pq_search.cuh

@@ -156,6 +172,35 @@ void select_clusters(raft::resources const& handle,
                     n_lists,
                     stream);

+  if (metric == distance::DistanceType::CosineExpanded) {
+    // TODO: store dataset norms in a different manner for the cosine metric to avoid the copy here


I don't get it. What's the difference to the inner product here?

we should choose which clusters to probe using cosine distance.

…cosine

We don't currently have cosine distance for ivf-pq (see rapidsai/cuvs#346) and we also don't have correlation distance support at all. re-add the metricprocessor code to handle this

lowener · 2024-10-01T10:52:15Z

cpp/src/neighbors/ivf_pq/ivf_pq_build.cuh

-                                              batch_labels_view,
-                                              utils::mapping<float>{});
+
+      if (index->metric() == cuvs::distance::DistanceType::CosineExpanded) {


cuvs::cluster::kmeans_balanced::predict Already supports Cosine metric, so there is no need to add normalization + switch to inner product

Yes I tried that. I also tried normalizing the cluster centers, but that does not give good recall. I get the best recall when I normalize the inputs and use inner product.

lowener · 2024-10-01T11:11:21Z

cpp/src/neighbors/ivf_pq/ivf_pq_build.cuh

@@ -1754,7 +1796,13 @@ auto build(raft::resources const& handle,
      cluster_centers, index.n_lists(), index.dim());
    cuvs::cluster::kmeans::balanced_params kmeans_params;
    kmeans_params.n_iters = params.kmeans_n_iters;
-    kmeans_params.metric  = static_cast<cuvs::distance::DistanceType>((int)index.metric());
+    if (index.metric() == distance::DistanceType::CosineExpanded) {


cuvs::cluster::kmeans_balanced::fit Already supports Cosine metric, so there is no need to add normalization + switch to inner product.

lowener · 2024-10-01T11:50:28Z

cpp/src/neighbors/ivf_pq/ivf_pq_search.cuh

@@ -137,6 +141,18 @@ void select_clusters(raft::resources const& handle,
      alpha = -1.0;
      beta  = 0.0;
    } break;
+    case cuvs::distance::DistanceType::CosineExpanded: {


As @achirkin noted, the norms of the centers and of the queries should be accounted for when computing the cosine distance. Right now only the norms of the queries is used, and this can result in the wrong clusters getting selected.
In IVF-Flat: https://github.com/rapidsai/cuvs/blob/branch-24.10/cpp/src/neighbors/ivf_flat/ivf_flat_search.cuh#L166

I tried to do that, but that gives poorer recall. Simply using inner product to select the clusters to probe gives better recall in the tests.

Among all of the things that I tried, normalizing the dataset and queries and using inner product directly works the best.

…cosine

achirkin

Thanks @tarang-jain for the updates. I think this is a long-awaited feature, but it's not so urgent to squeeze it in 24.10. I'd suggest we take a bit more time to make sure it has good, well understood performance from day one in the main branch.
If we decide to push this to 24.12, it would be nice to run a few benchmarks to see how cosine metric fares against other metrics and against the cuVS main branch.

achirkin · 2024-10-03T07:42:47Z

cpp/src/neighbors/ivf_pq/ivf_pq_build.cuh

+    auto float_vec_batch  = raft::make_device_mdarray<float, internal_extents_t>(
+      handle,
+      device_memory,
+      raft::make_extents<internal_extents_t>(vec_batch.size(), index->dim()));
+    raft::linalg::map(handle,
+                      float_vec_batch.view(),
+                      utils::mapping<float>{},
+                      raft::make_device_matrix_view<const T, internal_extents_t>(
+                        vec_batch.data(), vec_batch.size(), index->dim()));


This extra new code adds an overhead to the already existing metrics; the extra allocation uses the device memory, which is otherwise is carefully accounted in the calculation above. This means (1) we may have slowdown in important use-cases (e.g. CAGRA build using IVF-PQ), (2) we may get OOM error under some conditions.
If having here an extra allocation is really unavoidable for the cosine metric, I'd suggest limiting it only to this metric, using batches_mr for the allocation and then adjusting the estimate of the required workspace size above.

@achirkin for float datatype, we can normalize in place. We just need the extra memory for the float batch when the data type is uint8 or int8.

cpp/src/neighbors/ivf_pq/ivf_pq_search.cuh

…cosine

Need this for code freeze

cjnolet · 2024-10-03T20:50:29Z

/merge

tarang-jain added 4 commits September 21, 2024 18:54

all changes

6aa64e8

trial

5eae823

Merge branch 'branch-24.10' of https://github.com/rapidsai/cuvs into …

c9b800c

…cosine

debug

0a860d4

github-actions bot added cpp CMake labels Sep 24, 2024

tarang-jain mentioned this pull request Sep 24, 2024

Cosine Metric for IVF-PQ #284

Closed

tarang-jain added 2 commits September 24, 2024 16:57

debug

55c17fd

undo change

e3490e3

tarang-jain self-assigned this Sep 25, 2024

tarang-jain added feature request New feature or request non-breaking Introduces a non-breaking change and removed CMake labels Sep 25, 2024

style

be343be

github-actions bot added the CMake label Sep 25, 2024

achirkin requested changes Sep 25, 2024

View reviewed changes

tarang-jain added 2 commits September 25, 2024 12:52

tests passing:

f3b50e4

Merge branch 'branch-24.10' of https://github.com/rapidsai/cuvs into …

56bbed0

…cosine

github-actions bot added the Python label Sep 25, 2024

tarang-jain and others added 8 commits September 25, 2024 12:55

remove debug statements

3967c4c

add assertions

442d65e

style

3ab1c7f

use raft::linalg::map

6b93282

fix ci

bd41e20

update ivf-flat interleaved scan

ca240ac

style

69d1edf

Merge branch 'branch-24.10' into cosine

6b3c953

github-actions bot removed the CMake label Sep 27, 2024

tarang-jain marked this pull request as ready for review September 29, 2024 23:13

tarang-jain requested a review from a team as a code owner September 29, 2024 23:13

tarang-jain and others added 6 commits September 29, 2024 19:48

Merge branch 'branch-24.10' into cosine

edbda6c

rm bug

28a48d1

Merge branch 'cosine' of https://github.com/tarang-jain/cuvs into cosine

8f93ab3

use device_memory mr

75209a2

update postprocess

fca5d94

style

ecf6dc4

lowener previously requested changes Oct 1, 2024

View reviewed changes

tarang-jain and others added 8 commits October 1, 2024 15:01

Merge branch 'branch-24.10' into cosine

4bb816c

Merge branch 'branch-24.10' of https://github.com/rapidsai/cuvs into …

3ecda59

…cosine

Merge branch 'cosine' of https://github.com/tarang-jain/cuvs into cosine

076a60c

normalize centroids

a943f6a

merge 24.10

8c9d7be

allow per_subspace

ba660ce

update doc

bcefbe3

style

496ff5c

achirkin previously requested changes Oct 3, 2024

View reviewed changes

tarang-jain and others added 5 commits October 3, 2024 10:58

Merge branch 'branch-24.10' of https://github.com/rapidsai/cuvs into …

59fbb5e

…cosine

only support float

490e6d2

Merge branch 'branch-24.10' of https://github.com/rapidsai/cuvs into …

932ae8a

…cosine

run remaining float tests

eac9bba

Merge branch 'branch-24.10' into cosine

c6912ec

cjnolet approved these changes Oct 3, 2024

View reviewed changes

lowener approved these changes Oct 3, 2024

View reviewed changes

rapids-bot bot merged commit d5ca51e into rapidsai:branch-24.10 Oct 3, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CosineExpanded Metric for IVF-PQ (normalize inputs) #346

CosineExpanded Metric for IVF-PQ (normalize inputs) #346

tarang-jain commented Sep 24, 2024

achirkin left a comment

achirkin Sep 25, 2024

tarang-jain Sep 25, 2024

lowener Oct 1, 2024

tarang-jain Oct 1, 2024

lowener Oct 1, 2024

lowener Oct 1, 2024

tarang-jain Oct 1, 2024

tarang-jain Oct 1, 2024

achirkin left a comment

achirkin Oct 3, 2024

tarang-jain Oct 3, 2024

cjnolet commented Oct 3, 2024

CosineExpanded Metric for IVF-PQ (normalize inputs) #346

CosineExpanded Metric for IVF-PQ (normalize inputs) #346

Conversation

tarang-jain commented Sep 24, 2024

achirkin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

achirkin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cjnolet commented Oct 3, 2024