-
Notifications
You must be signed in to change notification settings - Fork 546
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[LogisticRegressionMG][FEA] Support training when dataset contains only one class #5655
Conversation
4324d4d
to
56f00c4
Compare
2adfc37
to
7d5c635
Compare
/ok to test |
if not isinstance(cu_preds, np.ndarray): | ||
cu_preds = cu_preds.to_numpy() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What do we expect there? Sparse arrays?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cu_preds stores predicted labels in a dense array or cudf.
2cc76f5
to
ce40be2
Compare
ce40be2
to
1bc15ec
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few minor suggestions, but in principle no objections. LGTM!
/merge |
This pull request introduces functionality for C++ training on datasets with a single label. It helps Spark Rapids ML match Spark's behavior. Additionally, it updates the Dask class to generate an error message, consistent with Scikit-learn's behavior.
This PR depends on #5632