New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[LogisticRegressionMG][FEA] Support training when dataset contains only one class #5655

Merged

rapids-bot merged 5 commits into rapidsai:branch-23.12 from lijinf2:fea_lrmg_onelabel

Nov 29, 2023

Contributor

lijinf2 commented Nov 14, 2023 •

edited

Loading

This pull request introduces functionality for C++ training on datasets with a single label. It helps Spark Rapids ML match Spark's behavior. Additionally, it updates the Dask class to generate an error message, consistent with Scikit-learn's behavior.

This PR depends on #5632

lijinf2 requested review from a team as code owners

November 14, 2023 18:52

copy-pr-bot bot commented Nov 14, 2023

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

github-actions bot added Cython / Python CUDA/C++ labels

lijinf2 force-pushed the fea_lrmg_onelabel branch from 4324d4d to 56f00c4 Compare

November 14, 2023 18:53

lijinf2 added non-breaking 3 - Ready for Review labels

lijinf2 force-pushed the fea_lrmg_onelabel branch 2 times, most recently from 2adfc37 to 7d5c635 Compare

November 14, 2023 18:56

lijinf2 added the improvement label

Contributor

csadorf commented Nov 14, 2023

/ok to test

lijinf2 mentioned this pull request

[LogisticRegressionMG] Support standardization #5656

Closed

csadorf requested changes

View reviewed changes

python/cuml/linear_model/logistic_regression_mg.pyx Outdated Show resolved Hide resolved

python/cuml/linear_model/logistic_regression_mg.pyx Outdated Show resolved Hide resolved

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated

Comment on lines 400 to 401

		if not isinstance(cu_preds, np.ndarray):
		cu_preds = cu_preds.to_numpy()

Contributor

csadorf Nov 28, 2023

What do we expect there? Sparse arrays?

Contributor Author

lijinf2 Nov 28, 2023

cu_preds stores predicted labels in a dense array or cudf.

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved

python/cuml/linear_model/logistic_regression_mg.pyx Outdated Show resolved Hide resolved

python/cuml/linear_model/base_mg.pyx Outdated Show resolved Hide resolved

lijinf2 force-pushed the fea_lrmg_onelabel branch 2 times, most recently from 2cc76f5 to ce40be2 Compare

November 28, 2023 19:49

lijinf2 added 2 commits

November 28, 2023 13:05


          support one label training

972e5f9


          address review comments

1bc15ec

lijinf2 force-pushed the fea_lrmg_onelabel branch from ce40be2 to 1bc15ec Compare

November 28, 2023 21:12

csadorf approved these changes

View reviewed changes

Contributor

csadorf left a comment

A few minor suggestions, but in principle no objections. LGTM!

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved

lijinf2 and others added 2 commits

November 28, 2023 14:31


          Update python/cuml/tests/dask/test_dask_logistic_regression.py

c28bc0a

Co-authored-by: Simon Adorf <[email protected]>


          revise PR according to second round comments

2b73d40

csadorf approved these changes

View reviewed changes

python/cuml/tests/dask/test_dask_logistic_regression.py Outdated Show resolved Hide resolved


          move functool import to head of the file

df8fdf6

cjnolet approved these changes

View reviewed changes

Member

cjnolet commented Nov 29, 2023

/merge

rapids-bot bot merged commit 97b6fa3 into rapidsai:branch-23.12

49 checks passed

lijinf2 deleted the fea_lrmg_onelabel branch

June 26, 2024 21:58

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3 - Ready for Review CUDA/C++ Cython / Python improvement non-breaking