Preparing sparse primitives for movement to RAFT #3157

cjnolet · 2020-11-18T23:40:58Z

This PR contains the initial steps to move many of the sparse prims API over to raft, including:

Adjusting MLCommon::Sparse namespaces to raft::sparse
Breaking csr/coo prims into multiple files (e.g. linalg, components, matrix, etc...)
Using RAFT namespaces for raft componentry used within the sparse prims, such as device_buffer and deviceAllocator.
Use RAFT handle in public API
Closes [FEA] Move sparse prims to RAFT #3106

… 10.2 version

…_knn Conflicts: cpp/src_prims/sparse/cusparse_wrappers.h

…l swap them out shortly)

…ce algorithm to make pieces more reusable

Conflicts: CHANGELOG.md

…version seems super expensive, but maybe it's necessary.

cjnolet · 2020-12-17T14:26:17Z

@divyegala This is ready for re-review when you get a moment. No rush!

I started scraping through the prims to make them all use raft::handle_t but I think this could be done more on a case-by-case basis as necessary. I personally wouldn't mind seeing more consistency across the prims in RAFT, but many of the current dense prims don't take a handle_t yet either.

divyegala

LGTM! I like the breaking of sparse tests

divyegala · 2020-12-22T01:58:11Z

@cjnolet dense prims in RAFT are following need-to-know for the handle. So, if the prim needs a handle it takes one and otherwise not

cjnolet · 2020-12-22T03:05:00Z

@divyegala Thanks! Case-by-case sounds fine to me, so long as the arguments aren't creating redundancy that will lead to unexpected side-effects (e.g. by taking both a handle and a separate stream). I'll submit a PR to clean those up at some point.

divyegala · 2020-12-22T03:19:37Z

@cjnolet @teju85 suggested that we still keep a separate stream parameter, so that dev can decide which stream to run the prim on

cjnolet · 2020-12-22T03:34:44Z

@teju85, @divyegala In that case, we should continue the discussion about this publicly (and after we return from the holidays, please). I'd like to avoid the cases where the streams are used inconsistently.

teju85 · 2020-12-22T06:16:13Z

At primitives level, we should certainly have everyone of it accept a separate stream parameter in its args-list, for maximum composability, because it's possible that the caller wants to call this prim with a separate stream other than found in handle_t::get_stream(). Sure, I'm happy to discuss about this after holidays.

divyegala

Just one comment which I am okay with having an issue created for later update once this PR goes in! Otherwise, LGTM, from my previous review

cpp/src_prims/sparse/linalg/symmetrize.cuh

cjnolet · 2021-01-15T21:42:10Z

rerun tests

@cjnolet

This Pull Request adds initial support for multi-node multi-GPU DBSCAN, and fixes the bugs identified in #3094. It works by copying the dataset on all the workers and giving ownership of a subset of points to each one. The workers compute a partial clustering with the knowledge of the relationships between their points and the rest of the dataset, and the partial clusterings are merged to form the final labeling. This merging algorithm is also used to accumulate the results in case a batch-wise approach is used on a worker to limit the memory consumption. The multi-GPU implementation gives great speedups for large datasets, while for small datasets the performance is dominated by the Dask launch overhead, as shown in the figure below: ![mnmg_dbscan_perf](https://user-images.githubusercontent.com/17441062/104958437-55a6da80-59d0-11eb-8a18-fcca0d69c41b.png) Notes: - I have renamed variables in the DBSCAN implementation to match our style conventions (snake case). Sorry for the noise that it adds to this PR. - I refactored some CSR tests to accept multiple test cases instead of hardcoded ones, in order to add corner cases to weak CC. PR #3157 by @cjnolet changed the location of these tests, so I moved those that I had already refactored accordingly. At the moment only the tests that were in `cpp/test/prims/csr.cu` previously have been refactored. I was thinking that the others can be refactored later, I'd like @cjnolet's opinion on this refactoring. - Regarding testing, the MNMG tests are mostly a copy of the single-GPU ones, though I removed a few tests with very small datasets to avoid problems with MNMG (it doesn't really support the edge case where a worker owns 0 sample, as I think it's a fair assumption that MNMG DBSCAN isn't used with such a tiny dataset). - Also regarding tests, I changed the comparison function to account for the fact that border points are ambiguous. It assumes that the labeling of core points is minimal in both our implementation and the reference, so if this assumption changes we will need to update the tests accordingly. If you want to access a pseudo-code description and proof of the new algorithm, feel free to contact me. Tagging people to whom this PR is relevant: @teju85 @tfeher @MatthiasKohl @canonizer Authors: - Louis Sugy (@Nyrio) Approvers: - Tamas Bela Feher (@tfeher) - Corey J. Nolet (@cjnolet) URL: #3382

cjnolet added 30 commits July 24, 2020 15:05

Adding brute force knn shell to sparse

4dbabf4

Stubbing out algorithm flow

46b2f09

Adding initial headers to wrapper

03dbdb8

Performing idx batching

86ad78a

Starting to full in cusparse calls

b50611c

Checking in

33f7e6a

Beginning to add selection kernel

0342eb4

Finished header

a3ead77

Updates. Need to finish populating merge buffer

db4ff22

Using block select for selecting k and using 3-partition merge buffer

7752e8e

Logic is just about done.

363b433

Merge branch 'branch-0.15' into fea-016-sparse_knn

9d70a69

Checking in changes. Need to swap out cuda 11 cusparse calls for cuda…

29e82af

… 10.2 version

Merge branch 'branch-0.15' into fea-016-sparse_knn

2ee8de2

Everything is building. Need end-to-end test

6f9f6d5

Running clang format

8c9dfb3

Updating changelog

87478ad

Using raft's cusparse_wrappers.h instead of cuml

1bb9c0e

Merge branch 'imp-016-use_raft_cusparse_wrappers' into fea-016-sparse…

a65f378

…_knn Conflicts: cpp/src_prims/sparse/cusparse_wrappers.h

Removing cuda11-required GEMM calls (commenting them out for now, wil…

fa2e64d

…l swap them out shortly)

Fixing clang style

8789378

Separating distance computation from selection from general brute for…

1db4d4d

…ce algorithm to make pieces more reusable

Updating clang style

6054863

Adding batcher to help ease batch state management

d83f523

Fixing clang style

763d3c1

MOre clang fixes

18b9891

Merge branch 'branch-0.16' into fea-016-sparse_knn

82aa14b

Conflicts: CHANGELOG.md

IP distance is computed using search * index.T.

eabe273

Making type template for value_t all the way through knn_merge_parts

9a9ada6

Adding simple googletest for sparse pairwise dists. The transpose con…

6d45c91

…version seems super expensive, but maybe it's necessary.

Updates from review feedback

3b84eee

cjnolet added 4 - Waiting on Reviewer Waiting for reviewer to review or respond and removed 2 - In Progress Currenty a work in progress labels Dec 17, 2020

divyegala approved these changes Dec 22, 2020

View reviewed changes

cjnolet added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 4 - Waiting on Reviewer Waiting for reviewer to review or respond labels Dec 22, 2020

cjnolet added 2 commits January 14, 2021 08:59

Merge branch 'branch-0.18' into fea-017-raft_sparse_prims

928bcde

Fixing merge issues

6d692c6

github-actions bot added CMake libcuml labels Jan 14, 2021

JohnZed added 6 - Okay to Auto-Merge and removed 5 - Ready to Merge Testing and reviews complete, ready to merge labels Jan 14, 2021

divyegala approved these changes Jan 14, 2021

View reviewed changes

cpp/src_prims/sparse/linalg/symmetrize.cuh Outdated Show resolved Hide resolved

Updates based on review feedback

ccaae61

cjnolet mentioned this pull request Jan 15, 2021

Sparse KNN optimization for large column sizes #3345

Closed

5 tasks

JohnZed approved these changes Jan 16, 2021

View reviewed changes

rapids-bot bot merged commit d72c54a into rapidsai:branch-0.18 Jan 16, 2021

Nyrio mentioned this pull request Jan 18, 2021

MNMG DBSCAN #3382

Merged

JohnZed mentioned this pull request Jan 25, 2021

[FEA] Break sparse prims into separate directories #1099

Closed

cjnolet mentioned this pull request Feb 5, 2021

[WIP] Sparse semiring follow-on & optimizations #3468

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preparing sparse primitives for movement to RAFT #3157

Preparing sparse primitives for movement to RAFT #3157

cjnolet commented Nov 18, 2020 •

edited

Loading

cjnolet commented Dec 17, 2020

divyegala left a comment

divyegala commented Dec 22, 2020

cjnolet commented Dec 22, 2020

divyegala commented Dec 22, 2020

cjnolet commented Dec 22, 2020

teju85 commented Dec 22, 2020

divyegala left a comment

cjnolet commented Jan 15, 2021

Preparing sparse primitives for movement to RAFT #3157

Preparing sparse primitives for movement to RAFT #3157

Conversation

cjnolet commented Nov 18, 2020 • edited Loading

cjnolet commented Dec 17, 2020

divyegala left a comment

Choose a reason for hiding this comment

divyegala commented Dec 22, 2020

cjnolet commented Dec 22, 2020

divyegala commented Dec 22, 2020

cjnolet commented Dec 22, 2020

teju85 commented Dec 22, 2020

divyegala left a comment

Choose a reason for hiding this comment

cjnolet commented Jan 15, 2021

cjnolet commented Nov 18, 2020 •

edited

Loading