Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize MG variance calculation for dataset standardization for logistic regression #6138

Merged
merged 4 commits into from
Dec 5, 2024

Conversation

lijinf2
Copy link
Contributor

@lijinf2 lijinf2 commented Nov 20, 2024

MG variance calculation currently involks raft SG vars API. However, the abs() step of raft SG vars API introduces errors in skewed data distribution (e.g., one GPU gets small values 1 and 2, and the other GPU gets large values 98 and 99).

The PR avoids the effect of abs() when involking SG vars for calculating MG vars. The key idea is to pass a vector of zeroes when calling SG vars.

@lijinf2 lijinf2 requested a review from a team as a code owner November 20, 2024 04:01
@lijinf2 lijinf2 requested review from wphicks and divyegala November 20, 2024 04:01
Copy link

copy-pr-bot bot commented Nov 20, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@lijinf2 lijinf2 added CUDA / C++ CUDA issue Cython / Python Cython or Python issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change and removed CUDA/C++ labels Nov 20, 2024
@github-actions github-actions bot added CUDA/C++ and removed Cython / Python Cython or Python issue labels Nov 20, 2024
@lijinf2 lijinf2 changed the title Optimize MG variance calculation for logistic regression standardization Optimize MG variance calculation for dataset standardization for logistic regression Nov 20, 2024
@lijinf2 lijinf2 force-pushed the raft_stat_1118 branch 2 times, most recently from 33e27de to b3ee71e Compare November 26, 2024 18:30
@lijinf2 lijinf2 added the 3 - Ready for Review Ready for review by team label Nov 26, 2024
@dantegd
Copy link
Member

dantegd commented Dec 5, 2024

/merge

@rapids-bot rapids-bot bot merged commit de96f3a into rapidsai:branch-24.12 Dec 5, 2024
65 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3 - Ready for Review Ready for review by team CUDA / C++ CUDA issue CUDA/C++ improvement Improvement / enhancement to an existing function non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants