-
Notifications
You must be signed in to change notification settings - Fork 548
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Gaussian Naive Bayes #4079
Add Gaussian Naive Bayes #4079
Conversation
Conflicts: python/cuml/naive_bayes/naive_bayes.py
…parts that do work.
…when fitting all classes at once instead of one at a time.
… little differently than sklearn
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very nice work. Only 1 small nitpick and I think this is ready to go.
Parameters | ||
---------- | ||
|
||
X : {array-like, cupy sparse matrix} of shape (n_samples, n_features) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We might want to make a mention here that the "optimal" input for the sparse matrix is COOrdinate format, otherwise it will be copied in order to convert to that format.
I updated the target branch to 21.10 so we can see CI pass |
rerun tests |
CI failed on 3 tests of Nearest Neighbors due to timeouts:
|
rerun tests |
Codecov Report
@@ Coverage Diff @@
## branch-21.10 #4079 +/- ##
===============================================
Coverage ? 85.96%
===============================================
Files ? 232
Lines ? 18500
Branches ? 0
===============================================
Hits ? 15904
Misses ? 2596
Partials ? 0
Flags with carried forward coverage won't be shown. Click here to find out more. Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@gpucibot merge |
This is a continuation of PR #1763, #4053, and #4079, to add Categorical Naive Bayes. This is supposed to be merged after #4079. Linking issue #1666. Authors: - Micka (https://github.com/lowener) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: #4150
This is a continuation of PR rapidsai#1763 and rapidsai#4053, to add Gaussian Naive Bayes. This is supposed to be merged after rapidsai#4053 Here is a comparison of cuML and SKLearn performance on Gaussian NB. This is done using a synthetic dataset generated by make_regression. The GPU used is a RTX 8000, and the CPU is i9-10920X @ 3.50GHz ![gaussian](https://user-images.githubusercontent.com/9810050/126572439-8982faa8-5ad1-4bca-91ab-76704050bf33.png) Linking issue rapidsai#1666 Authors: - Micka (https://github.com/lowener) - Corey J. Nolet (https://github.com/cjnolet) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#4079
This is a continuation of PR rapidsai#1763, rapidsai#4053, and rapidsai#4079, to add Categorical Naive Bayes. This is supposed to be merged after rapidsai#4079. Linking issue rapidsai#1666. Authors: - Micka (https://github.com/lowener) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#4150
This is a continuation of PR #1763 and #4053, to add Gaussian Naive Bayes.
This is supposed to be merged after #4053
Here is a comparison of cuML and SKLearn performance on Gaussian NB.
This is done using a synthetic dataset generated by make_regression.
The GPU used is a RTX 8000, and the CPU is i9-10920X @ 3.50GHz
Linking issue #1666