-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add snip_momentum structured pruning which supports higher sparse ratio #3300
Conversation
@microsoft-github-policy-service agree company="Intel" |
Due to different algorithms may not share the same best hyperparameter, we have tried others. The main difference is we only use the second to last layer for distillation and change the lr.
|
tested the accuracy and looks great. |
@ftian1, there is a formatting issue on the PR. The pre-commit needs to be run and the file changes committed to the branch. In particular, the following needs to be run on the repo:
|
Head branch was pushed to by a user without write access
@xiaoxiawu-microsoft sorry for the late response due to PRC holiday and thanks for your review. I have fixed the yapf scan issue. but in my local, the detection of destroyed symlinks always fail after merge master. not sure why it happens as everything looks good. so I push the code at first. Hope it will not waste pre-ci resources. |
…ratio with minor accuracy loss Signed-off-by: Tian, Feng <[email protected]>
@xiaoxiawu-microsoft Those pre-ci errors are not related with my changes, could you pls have a check? |
This PR is used to contribute
snip_momentum
pruning algorithm in Intel Neural Compress to DeepSpeed compression like we proposed in RFC.The snip_momentum algo implements the algorithm described in here.
We tested it on DeepSpeedExamples/compression/bert with a newly added script
bash_script/pruning_sparse_snip_momentum.sh
and get below results. The changes in examples is herecc @hshen14 @wenhuach21