Sparse TSNE #3293

divyegala · 2020-12-11T06:42:10Z

This PR allows TSNE to accept sparse inputs.

It also removes long-standing warnings ptxas warning : Value of threads per SM for entry _ZN2ML4TSNE17IntegrationKernelEfffPfS1_PKfS3_S3_S3_S1_S1_S1_S1_S3_i is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _ZN2ML4TSNE15RepulsionKernelEffPKiS2_PKfS4_S4_PfS5_S5_fiiiS4_S2_ is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _ZN2ML4TSNE18TreeBuildingKernelEPiPKfS3_iiS1_S1_S3_ is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _ZN2ML4TSNE17BoundingBoxKernelEPiS1_PfS2_S2_S2_S2_S2_S2_iiiPjS2_ is out of range. .minnctapersm will be ignored from cuml builds which were caused by invalid parameters to __launch_bounds__ in TSNE kernels.

Furthermore, I also created a class TSNE_runner to handle running separate components of the algorithm as well as to ensure the proper use of RAII buffers and their de-allocation once their use is done, without explicitly deleting those buffers.

closes #2751

codecov-io · 2020-12-11T09:59:23Z

Codecov Report

Merging #3293 (383eca4) into branch-0.18 (ae7e444) will increase coverage by 0.06%.
The diff coverage is 98.41%.

@@               Coverage Diff               @@
##           branch-0.18    #3293      +/-   ##
===============================================
+ Coverage        71.48%   71.55%   +0.06%     
===============================================
  Files              207      207              
  Lines            16750    16787      +37     
===============================================
+ Hits             11974    12012      +38     
+ Misses            4776     4775       -1

Impacted Files	Coverage Δ
python/cuml/manifold/t_sne.pyx	`79.42% <98.30%> (+3.34%)`	⬆️
python/cuml/common/sparsefuncs.py	`91.95% <100.00%> (+0.28%)`	⬆️
...l/_thirdparty/sklearn/preprocessing/_imputation.py	`62.50% <0.00%> (+0.40%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ae7e444...383eca4. Read the comment docs.

…ea-018-sparse_tsne

divyegala · 2020-12-11T19:25:16Z

rerun tests

cjnolet

Really glad to see this change coming in!

Most of the feedback is minor, however adding this feature required that I also parametrize the remaining functions in UMAP so that updating from int64_t and float is straightforward. We should use a parametrized type instead of value_t where possible.

cpp/src/tsne/bh_kernels.cuh

cpp/src/tsne/tsne_runner.cuh

cpp/src/tsne/distances.cuh

cpp/src/tsne/tsne_runner.cuh

cpp/src_prims/sparse/coo.cuh

cpp/src/tsne/bh_kernels.cuh

cjnolet

This looks really good and it's almost there. My main concern is that I don't think all the non-pointer arguments need should be coupled to the 64-bit template types.

cpp/src/tsne/bh_kernels.cuh

cpp/src/tsne/distances.cuh

cpp/src/tsne/exact_kernels.cuh

cpp/src/tsne/tsne_runner.cuh

…ea-018-sparse_tsne

divyegala · 2020-12-22T01:42:59Z

@cjnolet leaving this comment as a reference for post-vacation. I updated this PR with your latest review feedback

divyegala · 2021-01-04T17:00:40Z

rerun tests

cjnolet · 2021-01-07T03:19:46Z

python/cuml/common/sparsefuncs.py

@@ -208,14 +208,16 @@ def extract_knn_graph(knn_graph, convert_dtype=True):
        knn_indices = knn_graph.col

    if knn_indices is not None:
+        convert_to_dtype = None
+        if convert_dtype:
+            convert_to_dtype = np.int32 if sparse else np.int64


It's going to be important to change this when FAISS is updated (and the indices are 32-bit). Referencing relevant issue: #2821

cjnolet

LGTM!

divyegala added 7 commits December 10, 2020 21:48

cpp build and tests working

10f4327

cython bind

47fd303

cython working

194c489

correcting libcuml++ API

d21fa49

style check

2d44291

sparse test

8227366

python style check

758cc13

divyegala requested review from a team as code owners December 11, 2020 06:42

divyegala marked this pull request as draft December 11, 2020 06:42

divyegala added 2 - In Progress Currenty a work in progress CUDA / C++ CUDA issue Cython / Python Cython or Python issue feature request New feature or request non-breaking Non-breaking change labels Dec 11, 2020

divyegala added 2 commits December 11, 2020 00:47

more python style check

a666f3f

more style check...

48f88f5

divyegala added 2 commits December 11, 2020 13:22

adding class runner

af4aa64

Merge branch 'branch-0.18' of https://github.com/rapidsai/cuml into f…

3c35a9f

…ea-018-sparse_tsne

divyegala marked this pull request as ready for review December 11, 2020 19:26

divyegala added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currenty a work in progress labels Dec 11, 2020

correcting doxygen

24bb192

cjnolet requested changes Dec 11, 2020

View reviewed changes

divyegala added 4 commits December 12, 2020 14:31

addressing some review changes for math_t -> value_t

5ddd4db

merging knn graph in TSNE PR

f0a33a9

style check

7f62912

doxygen

d58fb88

divyegala added 4 - Waiting on Reviewer Waiting for reviewer to review or respond and removed 3 - Ready for Review Ready for review by team labels Dec 13, 2020

divyegala added 5 commits December 14, 2020 15:58

explict templates for knn

7755ce6

runner and distances in templates

c576c60

exact TSNE with template

f878cfd

templates in coo symmetrize

f5fbeb4

templates on barnes hut

c7de8d2

cjnolet requested changes Dec 17, 2020

View reviewed changes

divyegala added 4 commits December 21, 2020 19:35

hyperparams to float32

09dad7a

removing constants from kernel launch bounds

76228b6

Merge branch 'branch-0.18' of https://github.com/rapidsai/cuml into f…

ae6fe0f

…ea-018-sparse_tsne

style check

383eca4

cjnolet reviewed Jan 7, 2021

View reviewed changes

cjnolet approved these changes Jan 7, 2021

View reviewed changes

cjnolet added 6 - Okay to Auto-Merge and removed 4 - Waiting on Reviewer Waiting for reviewer to review or respond labels Jan 7, 2021

rapids-bot bot merged commit 4d2de05 into rapidsai:branch-0.18 Jan 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse TSNE #3293

Sparse TSNE #3293

divyegala commented Dec 11, 2020 •

edited

Loading

codecov-io commented Dec 11, 2020 •

edited

Loading

divyegala commented Dec 11, 2020

cjnolet left a comment

cjnolet left a comment

divyegala commented Dec 22, 2020

divyegala commented Jan 4, 2021

cjnolet Jan 7, 2021

cjnolet left a comment

Sparse TSNE #3293

Sparse TSNE #3293

Conversation

divyegala commented Dec 11, 2020 • edited Loading

codecov-io commented Dec 11, 2020 • edited Loading

Codecov Report

divyegala commented Dec 11, 2020

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

divyegala commented Dec 22, 2020

divyegala commented Jan 4, 2021

cjnolet Jan 7, 2021

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

divyegala commented Dec 11, 2020 •

edited

Loading

codecov-io commented Dec 11, 2020 •

edited

Loading