[FEA] reduce memory pressure in membership vector computation #5268

tarang-jain · 2023-03-13T21:05:54Z

Batching is done only while computing the pairwise distance matrix between the dataset and the set of exemplar points.

Closes #4879

…n/cuml into fea-membership-vector

…ership-vector

…reduce-memory-pressure-apmv

…ership-vector

cjnolet · 2023-03-27T20:38:19Z

cpp/src/hdbscan/detail/soft_clustering.cuh

      break;
    case raft::distance::DistanceType::CosineExpanded:
      raft::distance::
        distance<raft::distance::DistanceType::CosineExpanded, value_t, value_t, value_t, int>(
-          handle, X, exemplars_dense.data(), dist.data(), m, n_exemplars, n, true);
+          handle, query + batch_offset * n, exemplars_dense.data(), dist.data(), samples_per_batch, n_exemplars, n, true);


Very nice! This is a good solution to provide some significant savings in memory.

cjnolet

These changes LGTM at a glance but I'm going to request changes for now and give a more in-depth review after #5247 is merged.

…n/cuml into fea-membership-vector

…ership-vector

…reduce-memory-pressure-apmv

cjnolet

Looking good! Just a couple minor things, really.

cjnolet · 2023-03-30T16:20:44Z

python/cuml/cluster/hdbscan/prediction.pyx

        had ``prediction_data=True`` set.

+    batch_size : int, optional, default=0
+        Distance based membership is computed in batches to fit on the device. If not specified, or set to 0, distance based membership is computed at once for all points in the training data.


I think this could be worded a little bit better to highlight the purpose of this setting, maybe mention "points in the training data" closer to the beginning? Something like "Lowers memory requirement by computing distance-based membership in smaller batches of points in the training data. Batch size of 0 uses all of the training points, batch size of 1000 computes distances for 1000 points at a time."

cjnolet · 2023-03-30T16:32:56Z

python/cuml/cluster/hdbscan/prediction.pyx

+                              n_prediction_points,
+                              clusterer.min_samples,
+                              _metrics_mapping[clusterer.metric],
+                              <float*> membership_vec_ptr,


Just to avoid any potential future issues, we should probably validate that this is actually positive (in Python) before we pass it down.

…reduce-memory-pressure-apmv

cjnolet · 2023-03-31T22:26:25Z

/merge

tarang-jain and others added 22 commits February 17, 2023 17:18

membership_vector initial commit

e49d06a

Further updates to membership_vector

436b180

Merge branch 'branch-23.04' into fea-membership-vector

48030b8

Initial testing membership_vector

7912dba

Debug statements

4b41edb

Merge branch 'fea-membership-vector' of https://github.com/tarang-jai…

fe0fd34

…n/cuml into fea-membership-vector

debugging membership_vector

9d5badc

membership_vector first working impl

19f9dd8

GoogleTest intermediate commit

a4b565c

GTest working

1f4bf78

working tests and styling changes

fdf100b

replace with raft mdspan primitives and add FastIntDiv

e18096a

Merge branch 'branch-23.04' into fea-membership-vector

c2aa77e

cpu support

182ba31

Fix failing pytest

366ef26

Merge branch 'branch-23.04' into fea-membership-vector

b60d869

modification after merge

6bfaae2

Update softmax with raft::linalg reduction

c4e0bf1

Remove sync stream

fb634e4

memory study commit (to be reversed)

a49ba87

Merge branch 'branch-23.04' of github.com:rapidsai/cuml into fea-memb…

4ed9fd7

…ership-vector

first commit (working)

d1712c0

github-actions bot added CMake CUDA/C++ Cython / Python Cython or Python issue labels Mar 13, 2023

beckernick mentioned this pull request Mar 14, 2023

Parallelizing extract_embeddings() MaartenGr/BERTopic#26

Closed

tarang-jain added 3 commits March 14, 2023 10:41

set batch_size as an arg

f41416a

Merge branch 'branch-23.04' of github.com:rapidsai/cuml into fea-new-…

333077a

…reduce-memory-pressure-apmv

working build, styling changes

71217e2

beckernick mentioned this pull request Mar 15, 2023

topc_model.transform() breaks the kernel and have to restart the whole notebook again MaartenGr/BERTopic#1092

Closed

tarang-jain requested a review from a team as a code owner March 17, 2023 19:43

tarang-jain and others added 8 commits March 17, 2023 14:19

Remove print debug statements

367de04

Resolved failing pytest

eeb52c2

Merge branch 'branch-23.04' of github.com:rapidsai/cuml into fea-new-…

612afb1

…reduce-memory-pressure-apmv

Merge branch 'branch-23.04' of github.com:rapidsai/cuml into fea-memb…

980b1f7

…ership-vector

copyright changes

0bf779b

Merge branch 'branch-23.04' into fea-membership-vector

98aa237

Merge branch 'branch-23.04' into fea-new-reduce-memory-pressure-apmv

3a38769

Merge branch 'branch-23.04' into fea-membership-vector

d387026

cjnolet reviewed Mar 27, 2023

View reviewed changes

cjnolet requested changes Mar 27, 2023

View reviewed changes

tarang-jain added 7 commits March 28, 2023 11:42

Updates after PR reviews

ed40e22

Merge branch 'fea-membership-vector' of https://github.com/tarang-jai…

387cde8

…n/cuml into fea-membership-vector

Merge branch 'branch-23.04' of github.com:rapidsai/cuml into fea-memb…

092b3f8

…ership-vector

Update height_argmax

ef85fd3

Intermediate merge commit

52eda5c

Merge branch 'branch-23.04' of github.com:rapidsai/cuml into fea-new-…

d8da560

…reduce-memory-pressure-apmv

Update after merge membership_vector

7a95bfe

tarang-jain force-pushed the fea-new-reduce-memory-pressure-apmv branch from 91c2d6a to 7a95bfe Compare March 29, 2023 22:41

cjnolet requested changes Mar 30, 2023

View reviewed changes

tarang-jain and others added 3 commits March 30, 2023 13:09

Updates after PR Reviews

dc92f90

Merge branch 'branch-23.04' of github.com:rapidsai/cuml into fea-new-…

615ad10

…reduce-memory-pressure-apmv

Merge branch 'branch-23.04' into fea-new-reduce-memory-pressure-apmv

7b89484

cjnolet assigned tarang-jain Mar 31, 2023

Resolve merge conflicts

7f7f0a4

tarang-jain added non-breaking Non-breaking change feature request New feature or request labels Mar 31, 2023

cjnolet approved these changes Mar 31, 2023

View reviewed changes

rapids-bot bot merged commit aeb01ea into rapidsai:branch-23.04 Mar 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] reduce memory pressure in membership vector computation #5268

[FEA] reduce memory pressure in membership vector computation #5268

tarang-jain commented Mar 13, 2023 •

edited by beckernick

Loading

cjnolet Mar 27, 2023

cjnolet left a comment

cjnolet left a comment

cjnolet Mar 30, 2023

cjnolet Mar 30, 2023

cjnolet commented Mar 31, 2023

[FEA] reduce memory pressure in membership vector computation #5268

[FEA] reduce memory pressure in membership vector computation #5268

Conversation

tarang-jain commented Mar 13, 2023 • edited by beckernick Loading

cjnolet Mar 27, 2023

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet Mar 30, 2023

Choose a reason for hiding this comment

cjnolet Mar 30, 2023

Choose a reason for hiding this comment

cjnolet commented Mar 31, 2023

tarang-jain commented Mar 13, 2023 •

edited by beckernick

Loading