Add MaskedL2NN #838

ahendriksen · 2022-09-22T13:16:47Z

This PR adds the sparseL2NN functionality.

This enables faster computing pairwise distances by making use of sparsity in the problem: the computation of distances between point pairs can be skipped.

The sparsity between arrays of points X and Y is expressed as follows:

X is split into rows (points)
Y is split into contiguous groups of points (i.e. all points in a group are adjacent)
A boolean adjacency matrix indicates for each row of X and each group in Y whether to compute the distance.

To speed up computation, the adjacency matrix is compressed into a bitfield.
To ensure competitive speeds, the caller must make sure that consecutive rows in X are adjacent to the same groups in Y (as much as possible) to enable efficient skipping in the kernel.

Some work is still TODO:

Flesh out documentation
Discuss / remove allocation of intermediate array
Optimize for skinny matrices by using a different KernelPolicy.

cjnolet · 2022-11-09T20:00:04Z

@ahendriksen, it's getting very close to burndown for 22.12. Mind if we bump this forward to 23.02? Do you think you might have something working by then?

ahendriksen · 2022-11-10T06:23:22Z

Yes that is okay.

…

On Wed, Nov 9, 2022, at 9:00 PM, Corey J. Nolet wrote: @ahendriksen <https://github.com/ahendriksen> mind if we bump this forward to 23.02? Do you think you might have something working by then? — Reply to this email directly, view it on GitHub <#838 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA72YFTLXXO3A5U7QTFWVTDWHP7FBANCNFSM6AAAAAAQTBOGFU>. You are receiving this because you were mentioned.Message ID: ***@***.***>

cjnolet · 2022-12-05T20:05:51Z

Hey @ahendriksen, we are still really excited about this feature. I was looking through your changes a bit mostly to see where we are with it. I have a few review items but I've held off from submitting the review until you feel this is in a good place to do so.

To ensure competitive speeds, the caller must make sure that consecutive rows in X are adjacent to the same groups in Y (as much as possible) to enable efficient skipping in the kernel.

This won't be easy to do in an algorithm like HDBSCAN without modifying the user's input data, though I understand the importance of memory co-location like coalescing the reads. The pattern that this PR solves pops up quite a bit, though. Do you know the expected perf hit from not co-locating the data points w/ the groups? Is it pretty much all from coalescing?

ahendriksen · 2022-12-12T12:17:07Z

Hi @cjnolet,

Feel free to add your comments!

To ensure competitive speeds, the caller must make sure that consecutive rows in X are adjacent to the same groups in Y (as much as possible) to enable efficient skipping in the kernel.

This won't be easy to do in an algorithm like HDBSCAN without modifying the user's input data, though I understand the importance of memory co-location like coalescing the reads. The pattern that this PR solves pops up quite a bit, though. Do you know the expected perf hit from not co-locating the data points w/ the groups? Is it pretty much all from coalescing?

Coalescing is absolutely necessary.
I ran a representative benchmark where roughly 99% of distance calculations could be skipped. In the theoretical best case, sparseL2NN should have taken 1% of the time of fusedL2NN. In practice, the relative performance was

~98% without coalescing (sorting input points);
~10% with coalescing.

There will be a cut off point before which it does not make sense to sort the inputs. However, since the number of distance calculations scales quadratically with the number of input points, sorting will be worth it especially for medium to large data sizes.

As you suggest, the sorting may even be done in place and reverted after the sparseL2NN call (rather than allocating a separate buffer with sorted inputs).

ajschmidt8 · 2023-01-11T14:26:03Z

Please use GitHub's Draft PR feature instead of WIP tags in the future. Draft PRs have the benefit of preventing notifications to codeowners until PRs are marked Ready for Review, which helps cut down on excessive notifications for PRs that are still being worked on. CI will still run on Draft PRs.

Some useful information about Draft PRs:

tfeher

Thanks Allard for addressing the issues! The PR looks good to me!

cjnolet · 2023-01-25T22:39:18Z

@ahendriksen , the PR looks great. I have one additional ask, though. Can you create Github issues for these remaining items just so it stays on our radar?

I did not get around to:

Exposing compress_to_bits in raft/utils.
Adding an overload the accepts a bitfield.

cjnolet · 2023-01-26T00:28:16Z

@ahendriksen one thing that I've noticed is that this PR and #837 both seem to be consistently timing out in CI while there are some other PRs that don't seem to be timing out. This makes me a little nervous about immediately merging these right before burndown. In case this behavior is isolated to this PR, I want to be careful not to cause issues for other PRs during this release.

We still have some time before code freeze to merge these, but it would help if you are able to verify locally that the build time (and ideally memory requirement during building) doesn't seem to be drastically affected by these changes. Maybe compare the build time for your changes against branch-23.02 just to see? It could be something super simple- like a pre-compiled template instantiation might no longer be getting used which could be causing things to get re-compiled more times from scratch.

cjnolet

Holding off on merging right away so we can investigate the CI timeouts.

cjnolet · 2023-01-26T11:10:44Z

@ahendriksen can you please update this PR to accept raft::device_resources everywhere instead of raft::handle_t?

…riksen/raft into wip-move-contractions-tiling-logic

ahendriksen · 2023-01-27T11:17:38Z

@ahendriksen can you please update this PR to accept raft::device_resources everywhere instead of raft::handle_t?

Done.

ahendriksen · 2023-01-27T11:26:02Z

@ahendriksen , the PR looks great. I have one additional ask, though. Can you create Github issues for these remaining items just so it stays on our radar?

I have created issues:

cjnolet · 2023-01-27T20:05:32Z

/merge

This reverts commit 2fb5c06.

ahendriksen requested review from a team as code owners September 22, 2022 13:16

github-actions bot added CMake cpp labels Sep 22, 2022

ahendriksen added enhancement New feature or request 2 - In Progress Currenty a work in progress non-breaking Non-breaking change and removed CMake labels Sep 22, 2022

ahendriksen force-pushed the enh-sparse-l2-nn branch from cfee44d to 90abf67 Compare October 5, 2022 14:28

github-actions bot added the CMake label Oct 5, 2022

cjnolet assigned ahendriksen Nov 1, 2022

ahendriksen force-pushed the enh-sparse-l2-nn branch from 90abf67 to efa8056 Compare January 11, 2023 13:33

ahendriksen requested review from a team as code owners January 11, 2023 13:33

github-actions bot added gpuCI python labels Jan 11, 2023

ahendriksen changed the base branch from branch-22.10 to branch-23.02 January 11, 2023 13:34

ahendriksen added the improvement Improvement / enhancement to an existing function label Jan 11, 2023

ahendriksen force-pushed the enh-sparse-l2-nn branch from efa8056 to 6cdcb98 Compare January 11, 2023 14:08

github-actions bot removed the python label Jan 11, 2023

ajschmidt8 marked this pull request as draft January 11, 2023 14:26

ahendriksen requested a review from tfeher January 25, 2023 20:59

cjnolet added 2 commits January 25, 2023 16:00

Merge branch 'branch-23.02' into enh-sparse-l2-nn

3e86024

Merge branch 'branch-23.02' into wip-move-contractions-tiling-logic

ba6491a

tfeher approved these changes Jan 25, 2023

View reviewed changes

cjnolet requested changes Jan 26, 2023

View reviewed changes

cjnolet and others added 12 commits January 26, 2023 10:10

Forcing sccache reinit.

e52b0f9

Merge branch 'branch-23.02' into wip-move-contractions-tiling-logic

34eb76a

Breaking specializations for refine into individual files

85c6294

Checking in

0fad842

Including just the refine specialization

f7788af

Merge branch 'branch-23.02' into wip-move-contractions-tiling-logic

e626101

Proper import of speicalizations

9e7b729

Merge branch 'wip-move-contractions-tiling-logic' of github.com:ahend…

9e4b5f3

…riksen/raft into wip-move-contractions-tiling-logic

Remove SCCACHE_RECACHE from build.sh

060e62c

Merge branch 'wip-move-contractions-tiling-logic' into enh-sparse-l2-nn

2870b67

Small compilation error remains

2b0c02b

Take device_resources instead of handle

1e83640

This was referenced Jan 27, 2023

[FEA] Expose compress to bits in raft/utils #1195

Open

[FEA] Have maskedL2NN accept bitfields directly #1196

Open

Merge remote-tracking branch 'rapids/branch-23.02' into enh-sparse-l2-nn

862e8f6

cjnolet approved these changes Jan 27, 2023

View reviewed changes

rapids-bot bot merged commit 2fb5c06 into rapidsai:branch-23.02 Jan 27, 2023

cjnolet added a commit to cjnolet/raft that referenced this pull request Feb 2, 2023

Revert "Add MaskedL2NN (rapidsai#838)"

bff0f16

This reverts commit 2fb5c06.

ahendriksen deleted the enh-sparse-l2-nn branch March 17, 2023 09:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MaskedL2NN #838

Add MaskedL2NN #838

ahendriksen commented Sep 22, 2022

cjnolet commented Nov 9, 2022 •

edited

Loading

ahendriksen commented Nov 10, 2022 via email

cjnolet commented Dec 5, 2022

ahendriksen commented Dec 12, 2022

ajschmidt8 commented Jan 11, 2023

tfeher left a comment

cjnolet commented Jan 25, 2023 •

edited

Loading

cjnolet commented Jan 26, 2023

cjnolet left a comment

cjnolet commented Jan 26, 2023

ahendriksen commented Jan 27, 2023

ahendriksen commented Jan 27, 2023

cjnolet commented Jan 27, 2023

Add MaskedL2NN #838

Add MaskedL2NN #838

Conversation

ahendriksen commented Sep 22, 2022

cjnolet commented Nov 9, 2022 • edited Loading

ahendriksen commented Nov 10, 2022 via email

cjnolet commented Dec 5, 2022

ahendriksen commented Dec 12, 2022

ajschmidt8 commented Jan 11, 2023

tfeher left a comment

Choose a reason for hiding this comment

cjnolet commented Jan 25, 2023 • edited Loading

cjnolet commented Jan 26, 2023

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet commented Jan 26, 2023

ahendriksen commented Jan 27, 2023

ahendriksen commented Jan 27, 2023

cjnolet commented Jan 27, 2023

cjnolet commented Nov 9, 2022 •

edited

Loading

cjnolet commented Jan 25, 2023 •

edited

Loading