[REVIEW] Update Mixin classes and include in estimators #2411

beckernick · 2020-06-12T19:42:53Z

This PR:

Moves the single GPU Mixin classes to common.base
Updates the set of regressors using RegressorMixin to point to the updated namespace
~~- [ ] Updates the Mixin classes to use the new CumlArray (may not be necessary)~~
Adds Mixin classes to additional estimators (classifiers and regressors)
Adds clf.score via the Mixin class to every estimator and removes newly redundant score methods (does not remove score methods that use an optimized libcuml implementation such as random forest)

Related to #2401 . This closes #2393

As @dantegd noted offline (summarized in #2393 (comment)), it would be nice to update the single GPU Mixin classes to ensure they're working well and use them for inheritance.

GPUtester · 2020-06-12T19:43:22Z

Please update the changelog in order to start CI tests.

View the gpuCI docs here.

…er mixin to logistic regression and rips out redundant logistic regression score implementation

beckernick · 2020-06-15T17:35:12Z

This is now ready for review.

beckernick · 2020-06-15T17:35:18Z

rerun tests

cjnolet

Glad to see this being used across the board. Very minor things.

cjnolet · 2020-06-16T16:25:21Z

python/cuml/common/base.pyx

+            handle = None
+
+        preds = self.predict(X)
+        return r2_score(y, preds, handle=handle)


Should we increase compatibility by also overriding the _more_tags() method? https://github.com/scikit-learn/scikit-learn/blob/fd237278e/sklearn/base.py#L501. It

I'm happy to implement this, though I wonder if it might be worth holding off unless there's a compelling reason. Do you know what this generally used for in sklearn @cjnolet ? It's not clear to me, but I'm not well versed in the sklearn internals.

This was really a genuine question in my part. If you don’t see a need for this yet then I’m fine holding off.

I just happened to notice this was the only difference between our mixins and those in Scikit-learn so I figured I’d ask.

The scikit-learn documentation also gives little evidence into exactly how this is used. My first thinking was that it might be used for some type of tag-based model selection where only estimators meeting particular behavioral / characteristic criteria are trained and evaluated?

I guess they are used primarily in tests and by helper functions to determine check/validate their inputs and outputs: https://scikit-learn.org/stable/developers/develop.html#estimator-tags

Nice find. Sharing the overview summary in this thread for readability:

Scikit-learn introduced estimator tags in version 0.21. These are annotations of estimators that allow programmatic inspection of their capabilities, such as sparse matrix support, supported output types and supported methods. The estimator tags are a dictionary returned by the method _get_tags().

Given your research into how sklearn uses them, I would vote we hold off for now. But I'm still happy to be persuaded otherwise

I agree, I don't these are a high priority at the moment.

python/cuml/common/base.pyx

Co-authored-by: Corey J. Nolet <[email protected]>

beckernick · 2020-06-17T14:09:46Z

rerun tests

cjnolet

LGTM

beckernick added 2 commits June 12, 2020 12:34

move mixins to common.base

7d216be

update existing regressors to use the new common.base mixin

c2c9122

beckernick changed the title ~~[WIP] Refactor mixin classes [skip-ci]~~ [WIP] Update Mixin classes [skip-ci] Jun 12, 2020

update mixin classes for cleaner r2 and accuracy score. adds classifi…

7048819

…er mixin to logistic regression and rips out redundant logistic regression score implementation

beckernick self-assigned this Jun 12, 2020

beckernick added the 2 - In Progress Currenty a work in progress label Jun 12, 2020

beckernick added 7 commits June 12, 2020 13:28

remove metrics/base.pyx as its now redundant

c198678

add mixins to mbsgd classifier/regressor

36a5036

use mixins for kneighbors and rip out redundant score functions

cd181c8

use mixins for svc/svr and rip out redundant score functions

88ea38f

actually pass kwargs down to the predict method in score

e15924d

use classifiermixin for rf classifier

2f8e9eb

use classifiermixin for rf regressor

bea6bd1

beckernick marked this pull request as ready for review June 15, 2020 16:52

beckernick requested a review from a team as a code owner June 15, 2020 16:52

beckernick added 2 commits June 15, 2020 09:54

changelog

6949e00

merge branch-0.15

9dc8031

beckernick changed the title ~~[WIP] Update Mixin classes [skip-ci]~~ [REVIEW] Update Mixin classes [skip-ci] Jun 15, 2020

beckernick added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currenty a work in progress labels Jun 15, 2020

beckernick changed the title ~~[REVIEW] Update Mixin classes [skip-ci]~~ [REVIEW] Update Mixin classes and include in estimators Jun 15, 2020

beckernick added 3 commits June 15, 2020 10:58

remove unused accuracy_score input

1970de7

remove unused r2_score import

0e31a82

clean up imports in knn classifier

b861a05

cjnolet requested changes Jun 16, 2020

View reviewed changes

beckernick and others added 2 commits June 16, 2020 15:05

Update python/cuml/common/base.pyx

b2b719c

Co-authored-by: Corey J. Nolet <[email protected]>

Update python/cuml/common/base.pyx

0a06053

Co-authored-by: Corey J. Nolet <[email protected]>

beckernick and others added 2 commits June 16, 2020 15:05

Update python/cuml/common/base.pyx

64c689e

Co-authored-by: Corey J. Nolet <[email protected]>

Update python/cuml/common/base.pyx

5a90aa3

Co-authored-by: Corey J. Nolet <[email protected]>

cjnolet added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Jun 17, 2020

cjnolet approved these changes Jun 17, 2020

View reviewed changes

cjnolet merged commit 1e3ab6e into rapidsai:branch-0.15 Jun 17, 2020

beckernick deleted the feature/refactor-mixin-classes branch June 18, 2020 01:42

tfeher mentioned this pull request Jul 23, 2020

[FEA] Add score method to SVC #1865

Closed

beckernick mentioned this pull request Jul 29, 2020

[FEA] Multinomial Naive Bayes should inherit from ClassifierMixin and use it for score #2614

Closed

beckernick mentioned this pull request Feb 4, 2021

[FEA] Add _estimator_type attribute to all clustering estimators #3462

Closed

beckernick mentioned this pull request Feb 23, 2021

[BUG] Switch to rmm.to_device in RegressorMixin #1092

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Update Mixin classes and include in estimators #2411

[REVIEW] Update Mixin classes and include in estimators #2411

beckernick commented Jun 12, 2020 •

edited

Loading

GPUtester commented Jun 12, 2020

beckernick commented Jun 15, 2020

beckernick commented Jun 15, 2020

cjnolet left a comment

cjnolet Jun 16, 2020

beckernick Jun 16, 2020

cjnolet Jun 17, 2020

cjnolet Jun 17, 2020 •

edited

Loading

beckernick Jun 17, 2020 •

edited

Loading

cjnolet Jun 17, 2020

beckernick commented Jun 17, 2020

cjnolet left a comment

[REVIEW] Update Mixin classes and include in estimators #2411

[REVIEW] Update Mixin classes and include in estimators #2411

Conversation

beckernick commented Jun 12, 2020 • edited Loading

GPUtester commented Jun 12, 2020

beckernick commented Jun 15, 2020

beckernick commented Jun 15, 2020

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet Jun 16, 2020

Choose a reason for hiding this comment

beckernick Jun 16, 2020

Choose a reason for hiding this comment

cjnolet Jun 17, 2020

Choose a reason for hiding this comment

cjnolet Jun 17, 2020 • edited Loading

Choose a reason for hiding this comment

beckernick Jun 17, 2020 • edited Loading

Choose a reason for hiding this comment

cjnolet Jun 17, 2020

Choose a reason for hiding this comment

beckernick commented Jun 17, 2020

cjnolet left a comment

Choose a reason for hiding this comment

beckernick commented Jun 12, 2020 •

edited

Loading

cjnolet Jun 17, 2020 •

edited

Loading

beckernick Jun 17, 2020 •

edited

Loading