-
Notifications
You must be signed in to change notification settings - Fork 549
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The support for "precomputed" in DBSCAN [FEA] #3302
Comments
This should be possible. But would like to hear thoughts from @Nyrio who's currently working on dbscan. @WangWenhao0716 I'm assuming you'd be interested in passing a dense distance matrix as input? |
@teju85 Thanks a lot! The assumption is right! |
Looking forward to good news. |
I agree with @teju85. It is possible to support that in our implementation. |
@Nyrio Thanks, is there a timetable to realize this feature? |
any update? |
@WangWenhao0716 @Jie2World sorry for the delayed response. We can get this implemented in 0.19 release timeframe. Does that sound ok to you? |
OK Thanks a lot!
发自我的iPhone
…------------------ Original ------------------
From: Thejaswi. N. S <[email protected]>
Date: Tue,Jan 12,2021 8:02 PM
To: rapidsai/cuml <[email protected]>
Cc: Wenhao Wang <[email protected]>, Mention <[email protected]>
Subject: Re: [rapidsai/cuml] The support for "precomputed" in DBSCAN [FEA] (#3302)
@WangWenhao0716 @Jie2World sorry for the delayed response. We can get this implemented in 0.19 release timeframe. Does that sound ok to you?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
Thanks. You made my day. |
This issue has been labeled |
This is being targeted for 0.19 release and @Nyrio is working towards it. So, certainly not stale. |
The 0.19 nightly version is online, and I want to know whether or when the "precomputed" choice will appear in this nightly version? Thanks a lot! |
@WangWenhao0716 the work has currently not started, but for sure we'll be filing PR during this month. We'll update this issue once this feature hits nightly builds. |
@teju85 Thanks a lot for your contribution. |
Closes #3302 ## Notes about performance If we don't count the cost of pre-computing the distance matrix (which is done by the user), the single-GPU code runs slightly faster when the distance matrix is pre-computed. (note: this is 2d, greater speedups expected for larger dimensions!) ![dbscan_precomputed_sg](https://user-images.githubusercontent.com/17441062/110022273-01f81280-7d2c-11eb-859f-929b691d76d5.png) ![dbscan_precomputed_timeline](https://user-images.githubusercontent.com/17441062/110022288-058b9980-7d2c-11eb-968c-eca35898f659.png) As I have stated in a comment in the code, it works with two kernels: one that uses a coalesced reduction to compute the vertex degrees from the distance matrix, and one that uses a 2D copy fused with an unary operation to get the boolean neighborhood matrix. _**Note:** the performance of this step could be better if `adj` was a row-major B*N matrix instead of column-major. We could fuse everything into one efficient kernel. It is something to keep in mind when we re-write `csr_adj_graph_batched`._ ## Notes about MNMG Cf #3615 Authors: - Louis Sugy (@Nyrio) Approvers: - Tamas Bela Feher (@tfeher) - Thejaswi. N. S (@teju85) - Dante Gama Dessavre (@dantegd) URL: #3585
@WangWenhao0716 and @Jie2World this feature is now in 0.19 branch (Thanks @Nyrio). Can you folks try this out and give us feedback? |
Thanks, I may try it. However, because I’m no longer work on this part and I’m also no longer in the before research group, it may be a little difficult for me.
发自我的iPhone
…------------------ Original ------------------
From: Thejaswi. N. S ***@***.***>
Date: Tue,Mar 30,2021 10:55 AM
To: rapidsai/cuml ***@***.***>
Cc: Wenhao Wang ***@***.***>, Mention ***@***.***>
Subject: Re: [rapidsai/cuml] The support for "precomputed" in DBSCAN [FEA] (#3302)
@WangWenhao0716 and @Jie2World this feature is now in 0.19 branch (Thanks @Nyrio). Can you folks try this out and give us feedback?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
It can be observed from sklearn-dbscan that the metric can be “precomputed” and X can be a distance matrix. However, currently, cuML does not support this "precomputed" metric.
I want to know whether this feature will be added to a future version of cuML?
Thanks.
The text was updated successfully, but these errors were encountered: