Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Soft clustering with HDBSCAN #4467

Closed
jhfoxliu opened this issue Dec 31, 2021 · 5 comments
Closed

[FEA] Soft clustering with HDBSCAN #4467

jhfoxliu opened this issue Dec 31, 2021 · 5 comments
Assignees
Labels
? - Needs Triage Need team to review and classify feature request New feature or request inactive-30d inactive-90d

Comments

@jhfoxliu
Copy link

jhfoxliu commented Dec 31, 2021

Is your feature request related to a problem? Please describe.
I wish using HDBSCAN soft clustering method (which is implemented in the CPU version, see https://hdbscan.readthedocs.io/en/latest/soft_clustering.html) with cuML.

Describe the solution you'd like
It is the best to update a "all_points_membership_vectors" function the same as its CPU version.

Describe alternatives you've considered
It's can be OK if the attributes (prediction_data, raw_tree, etc.) necessary for soft clustering are provided. With that attributes, I can use CPU version to compute the values. However, they are missing in cuML.

Additional context

@jhfoxliu jhfoxliu added ? - Needs Triage Need team to review and classify feature request New feature or request labels Dec 31, 2021
@github-actions
Copy link

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

@markselias
Copy link

I would be interested in this as well.

@github-actions
Copy link

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

@github-actions
Copy link

This issue has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

@beckernick
Copy link
Member

With the merge of #4800, soft clustering the original dataset with all_points_membership_vectors is now available. Please give it a try and file issues if you run into any issues or have any feedback.

Closing this issue as resolved. Soft-clustering a new set of points with membership_vector is being tracked in #4724

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
? - Needs Triage Need team to review and classify feature request New feature or request inactive-30d inactive-90d
Projects
None yet
Development

No branches or pull requests

4 participants