Add snip_momentum structured pruning which supports higher sparse ratio #3300

ftian1 · 2023-04-19T03:55:41Z

This PR is used to contribute snip_momentum pruning algorithm in Intel Neural Compress to DeepSpeed compression like we proposed in RFC.

The snip_momentum algo implements the algorithm described in here.

We tested it on DeepSpeedExamples/compression/bert with a newly added script bash_script/pruning_sparse_snip_momentum.sh and get below results. The changes in examples is here

pattern	sparsity ratio	pruning method	epochs	acc & mm-acc
1x1	80%	DeepSpeed L1	2	0.8113/0.822
1x1	80%	Snip_momentum	2	0.8176/0.822
4x1	80%	snip_momentum	10	0.8248/0.8305

cc @hshen14 @wenhuach21

ftian1 · 2023-04-19T04:02:58Z

@microsoft-github-policy-service agree company="Intel"

wenhuach21 · 2023-04-19T04:47:49Z

Due to different algorithms may not share the same best hyperparameter, we have tried others. The main difference is we only use the second to last layer for distillation and change the lr.

pattern	sparsity ratio	pruning method	epochs	acc & mm-acc
4x1	80%	Snip_momentum	2	0.8284/0.8388
4x1	80%	Snip_momentum	6	0.8339/0.8418

xiaoxiawu-microsoft · 2023-05-08T16:57:34Z

tested the accuracy and looks great.

xiaoxiawu-microsoft · 2023-05-08T17:28:09Z

@ftian1, there is a formatting issue on the PR. The pre-commit needs to be run and the file changes committed to the branch. In particular, the following needs to be run on the repo:

pre-commit run --all-files

Contributing - DeepSpeed

ftian1 · 2023-05-09T07:40:45Z

@xiaoxiawu-microsoft sorry for the late response due to PRC holiday and thanks for your review.

I have fixed the yapf scan issue. but in my local, the detection of destroyed symlinks always fail after merge master. not sure why it happens as everything looks good. so I push the code at first. Hope it will not waste pre-ci resources.

…ratio with minor accuracy loss Signed-off-by: Tian, Feng <[email protected]>

ftian1 · 2023-05-10T05:07:30Z

@xiaoxiawu-microsoft Those pre-ci errors are not related with my changes, could you pls have a check?

ftian1 requested review from jeffra, tjruwase, yaozhewei, minjiaz, xiaoxiawu-microsoft, conglongli and mrwyattii as code owners April 19, 2023 03:55

ftian1 mentioned this pull request Apr 19, 2023

Add snip_momentum structured pruning example with 80% sparsity ratio microsoft/DeepSpeedExamples#348

Merged

tjruwase removed request for jeffra, conglongli, tjruwase, mrwyattii and minjiaz April 22, 2023 00:00

xiaoxiawu-microsoft enabled auto-merge (squash) May 8, 2023 16:57

xiaoxiawu-microsoft disabled auto-merge May 8, 2023 16:59

xiaoxiawu-microsoft enabled auto-merge (squash) May 8, 2023 17:00

auto-merge was automatically disabled May 9, 2023 07:12
Head branch was pushed to by a user without write access

Add snip_momentum structured pruning which can support higher sparse …

e974b0d

…ratio with minor accuracy loss Signed-off-by: Tian, Feng <[email protected]>

ftian1 force-pushed the master branch from 2f2cfc4 to e974b0d Compare May 9, 2023 08:53

Merge branch 'master' into master

6db8bdc

xiaoxiawu-microsoft enabled auto-merge (squash) May 10, 2023 17:30

xiaoxiawu-microsoft disabled auto-merge May 10, 2023 17:32

xiaoxiawu-microsoft enabled auto-merge (squash) May 10, 2023 17:32

yaozhewei approved these changes May 10, 2023

View reviewed changes

xiaoxiawu-microsoft merged commit 6938c44 into microsoft:master May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add snip_momentum structured pruning which supports higher sparse ratio #3300

Add snip_momentum structured pruning which supports higher sparse ratio #3300

ftian1 commented Apr 19, 2023 •

edited

Loading

ftian1 commented Apr 19, 2023

wenhuach21 commented Apr 19, 2023 •

edited

Loading

xiaoxiawu-microsoft commented May 8, 2023

xiaoxiawu-microsoft commented May 8, 2023

ftian1 commented May 9, 2023 •

edited

Loading

ftian1 commented May 10, 2023

Add snip_momentum structured pruning which supports higher sparse ratio #3300

Add snip_momentum structured pruning which supports higher sparse ratio #3300

Conversation

ftian1 commented Apr 19, 2023 • edited Loading

ftian1 commented Apr 19, 2023

wenhuach21 commented Apr 19, 2023 • edited Loading

xiaoxiawu-microsoft commented May 8, 2023

xiaoxiawu-microsoft commented May 8, 2023

ftian1 commented May 9, 2023 • edited Loading

ftian1 commented May 10, 2023

ftian1 commented Apr 19, 2023 •

edited

Loading

wenhuach21 commented Apr 19, 2023 •

edited

Loading

ftian1 commented May 9, 2023 •

edited

Loading