[DRAFT]: Add FracBits experimental feature #1286

vinnamkim · 2022-10-04T07:56:28Z

Changes

Add a new mixed-precision QAT algorithm, FracBits [paper] and [code] as an experimental feature.

Reason for changes

To expand the choice of mixed-precision QAT algorithms for users.

Related tickets

87363

Tests

Implemented in tests/torch/experimental/fracbits.

…#1234) * Implement FracBitsQuantizationBuilder and Controller - Implement Builder and Controller - Add and test ModelSizeCompressionLoss Signed-off-by: Kim, Vinnam <[email protected]>

* Add fracbits runnable script and configs Signed-off-by: Kim, Vinnam <[email protected]> * Fix FracBitsAsymmetricQuantizer bug Signed-off-by: Kim, Vinnam <[email protected]> * Update config Signed-off-by: Kim, Vinnam <[email protected]> * Update configs for mobilenetv2-imagenet Signed-off-by: Kim, Vinnam <[email protected]> * Fix unsynchronization bug in distributed system Signed-off-by: Kim, Vinnam <[email protected]> * Fix test errors Signed-off-by: Kim, Vinnam <[email protected]> * Add find_unused_parameters=True Signed-off-by: Kim, Vinnam <[email protected]> * Fix code format Signed-off-by: Kim, Vinnam <[email protected]> * Add resnet50 configs Signed-off-by: Kim, Vinnam <[email protected]> * Fix PyTorch example dependency - Move efficientnet_pytorch from test to examples - Add setuptools==59.5.0 because of tensorboard issue Signed-off-by: Kim, Vinnam <[email protected]> * Add inception_v3 configs Signed-off-by: Kim, Vinnam <[email protected]> * Fix configs Signed-off-by: Kim, Vinnam <[email protected]> * Gather integer model size - We has been gathering fractional model size to compute compression_rate for report. Fix it to report integer model size. Signed-off-by: Kim, Vinnam <[email protected]> * Refactor Fracbits params Signed-off-by: Kim, Vinnam <[email protected]> * Add find_unused_parameters to configs Signed-off-by: Kim, Vinnam <[email protected]> * Update batchsize of resnet50 config Signed-off-by: Kim, Vinnam <[email protected]> * Add pylint disables Signed-off-by: Kim, Vinnam <[email protected]> * Log fractional model size also Signed-off-by: Kim, Vinnam <[email protected]> * Fix parameter construction from config dictionary Signed-off-by: Kim, Vinnam <[email protected]> * Update FracBits README.md Signed-off-by: Kim, Vinnam <[email protected]> * Update README.md Signed-off-by: Kim, Vinnam <[email protected]> * Update accuracy by re-running experiments after the latest code change Signed-off-by: Kim, Vinnam <[email protected]> * Fix typos Signed-off-by: Kim, Vinnam <[email protected]>

Signed-off-by: Kim, Vinnam <[email protected]>

alexsu52

@vinnamkim, thanks for your contribution!

Some general questions:

Did you compare the implemented algorithm with existing NNCF algorithms.
What user scenario do you cover?

Signed-off-by: Kim, Vinnam <[email protected]>

vinnamkim · 2022-10-05T02:08:30Z

@vinnamkim, thanks for your contribution!

Some general questions:

Hi @alexsu52,

Did you compare the implemented algorithm with existing NNCF algorithms.

I just compared FracBits with NNCF 8bit QAT. You can see the results in README.md included in this PR. It shows that FracBits can compress the total bits of model weights (model size) 1.5x compared to NNCF 8bit QAT for 3 models (MobileNet-V2, Inception-V3, and ResNet-50) and 2 datasets (ImageNet and CIFAR100) under competitive degradation (<1%).

What user scenario do you cover?

I think that it can be used by users who want to compress their model size more with the mixed-precision QAT. It doesn't require any time-consuming initialization phase or external exploration phase unlike HAWQ and AutoQ. However, it requires quantization forward-backward propagation steps twice than vanilla QAT.

alexsu52 · 2022-10-05T04:37:47Z

I just compared FracBits with NNCF 8bit QAT. You can see the results in README.md included in this PR. It shows that FracBits can compress the total bits of model weights (model size) 1.5x compared to NNCF 8bit QAT for 3 models (MobileNet-V2, Inception-V3, and ResNet-50) and 2 datasets (ImageNet and CIFAR100) under competitive degradation (<1%).

It looks like it's not fair to compare with 8bit QAT. Have you had any comparison results (time/accuracy/compression rate/easy to use) with HAWQ and AutoQ?

I think that it can be used by users who want to compress their model size more with the mixed-precision QAT. It doesn't require any time-consuming initialization phase or external exploration phase unlike HAWQ and AutoQ. However, it requires quantization forward-backward propagation steps twice than vanilla QAT.

If I understand correctly, the user must to get a smaller model in comparison with INT8 model in the OpenVINO format. Does OpenVINO support your model? You reported the theoretical compression rate in README.md. What is the actual compression rate?

vinnamkim added 5 commits October 4, 2022 14:54

Implement FracBitsQuantizationBuilder and Controller (openvinotoolkit…

7bd7b87

…#1234) * Implement FracBitsQuantizationBuilder and Controller - Implement Builder and Controller - Add and test ModelSizeCompressionLoss Signed-off-by: Kim, Vinnam <[email protected]>

Fix config json link

e9d49b8

Signed-off-by: Kim, Vinnam <[email protected]>

Update README.md and config path

fb206ea

Signed-off-by: Kim, Vinnam <[email protected]>

Remove configs for comparison

6348399

Signed-off-by: Kim, Vinnam <[email protected]>

vinnamkim requested a review from a team as a code owner October 4, 2022 07:56

github-actions bot added documentation Improvements or additions to documentation experimental NNCF Common Pull request that updates NNCF Common NNCF PT Pull requests that updates NNCF PyTorch labels Oct 4, 2022

alexsu52 reviewed Oct 4, 2022

View reviewed changes

vinnamkim added 2 commits October 5, 2022 10:44

Fix Pylint

dfcd5fd

Signed-off-by: Kim, Vinnam <[email protected]>

Fix README.md math expressions

d86120a

Signed-off-by: Kim, Vinnam <[email protected]>

vinnamkim force-pushed the feature/fracbits-rebase branch from 443462a to d86120a Compare October 5, 2022 01:59

AlexKoff88 changed the title ~~Add FracBits experimental feature~~ [DRAFT]: Add FracBits experimental feature Nov 22, 2023

AlexKoff88 marked this pull request as draft November 22, 2023 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT]: Add FracBits experimental feature #1286

[DRAFT]: Add FracBits experimental feature #1286

vinnamkim commented Oct 4, 2022

alexsu52 left a comment

vinnamkim commented Oct 5, 2022 •

edited

Loading

alexsu52 commented Oct 5, 2022

[DRAFT]: Add FracBits experimental feature #1286

Are you sure you want to change the base?

[DRAFT]: Add FracBits experimental feature #1286

Conversation

vinnamkim commented Oct 4, 2022

Changes

Reason for changes

Related tickets

Tests

alexsu52 left a comment

Choose a reason for hiding this comment

vinnamkim commented Oct 5, 2022 • edited Loading

alexsu52 commented Oct 5, 2022

vinnamkim commented Oct 5, 2022 •

edited

Loading