[bitsandbbytes] follow-ups #9730

sayakpaul · 2024-10-21T05:45:18Z

What does this PR do?

Takes care of [Quantization] Add quantization support for bitsandbytes #9213 (comment)
Move the test repos to hf-internal-testing
Check bnb param shape
Other minor nits

HuggingFaceDocBuilderDev · 2024-10-21T05:57:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 · 2024-10-21T18:01:38Z

src/diffusers/quantizers/base.py

+        """
+        checks if the quantized param has expected shape.
+        """
+        if not hasattr(self, "check_quantized_param_shape"):


Think this should be?

Suggested change

if not hasattr(self, "check_quantized_param_shape"):

if not hasattr(self, "check_if_quantized_param")::

The method is checking for itself? That can't be the case right? Because the subclass inherits it? So it should already be in there?

My bad. Have updated in 3298a04.

src/diffusers/models/model_loading_utils.py

sayakpaul · 2024-10-22T03:35:18Z

@stevhliu could you review the doc changes please?

@DN6 ready for your reviews. I have run the tests and they pass.

sayakpaul · 2024-10-22T03:35:49Z

docs/source/en/quantization/bitsandbytes.md

-Once a model is quantized, you can push the model to the Hub with the [`~ModelMixin.push_to_hub`] method. The quantization `config.json` file is pushed first, followed by the quantized model weights.
-
-```py
-from diffusers import FluxTransformer2DModel, BitsAndBytesConfig
-
-quantization_config = BitsAndBytesConfig(load_in_8bit=True)
-
-model_8bit = FluxTransformer2DModel.from_pretrained(
-    "black-forest-labs/FLUX.1-dev", 
-    subfolder="transformer",
-    quantization_config=quantization_config
-)
-```
+Once a model is quantized, you can push the model to the Hub with the [`~ModelMixin.push_to_hub`] method. The quantization `config.json` file is pushed first, followed by the quantized model weights. You can also save the serialized 4-bit models locally with [`~ModelMixin.save_pretrained`].


To unify the content between 8bit and 4bit hfoption.

sayakpaul · 2024-10-22T03:36:08Z

src/diffusers/models/model_loading_utils.py

+                and hf_quantizer.check_if_quantized_param(model, param, param_name, state_dict, param_device=device)
+            ):
+                hf_quantizer.check_quantized_param_shape(param_name, empty_state_dict[param_name].shape, param.shape)
+            elif not is_quant_method_bnb:


Could have done with else but I think it's a tad bit safer.

sayakpaul added 3 commits October 21, 2024 10:35

bnb follow ups.

14a44e5

add a warning when dtypes mismatch.

065700c

fx-copies

9c3a952

sayakpaul marked this pull request as draft October 21, 2024 05:50

sayakpaul added 3 commits October 21, 2024 13:47

clear cache.

e39544a

check_if_quantized_param

23fdc7a

add a check on shape.

cb94414

sayakpaul changed the title ~~[WIP][bitsandbbytes] follow-ups~~ [bitsandbbytes] follow-ups Oct 21, 2024

sayakpaul requested a review from DN6 October 21, 2024 09:54

updates

1fa9d7f

sayakpaul marked this pull request as ready for review October 21, 2024 10:18

docs

4c7ea4f

DN6 reviewed Oct 21, 2024

View reviewed changes

sayakpaul added 3 commits October 22, 2024 09:03

improve readability.

6dc8936

resources.

3dbe41f

Merge branch 'main' into bnb-follow-up

8a99701

sayakpaul requested a review from DN6 October 22, 2024 03:35

sayakpaul commented Oct 22, 2024

View reviewed changes

sayakpaul added 2 commits October 22, 2024 14:23

Merge branch 'main' into bnb-follow-up

1af10a8

fix

3298a04

DN6 approved these changes Oct 22, 2024

View reviewed changes

sayakpaul merged commit 60ffa84 into main Oct 22, 2024
18 checks passed

sayakpaul deleted the bnb-follow-up branch October 22, 2024 10:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bitsandbbytes] follow-ups #9730

[bitsandbbytes] follow-ups #9730

sayakpaul commented Oct 21, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 21, 2024

DN6 Oct 21, 2024

sayakpaul Oct 22, 2024

DN6 Oct 22, 2024 •

edited

Loading

sayakpaul Oct 22, 2024

sayakpaul commented Oct 22, 2024 •

edited

Loading

sayakpaul Oct 22, 2024

sayakpaul Oct 22, 2024

	if not hasattr(self, "check_quantized_param_shape"):
	if not hasattr(self, "check_if_quantized_param")::

[bitsandbbytes] follow-ups #9730

[bitsandbbytes] follow-ups #9730

Conversation

sayakpaul commented Oct 21, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Oct 21, 2024

DN6 Oct 21, 2024

Choose a reason for hiding this comment

sayakpaul Oct 22, 2024

Choose a reason for hiding this comment

DN6 Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

sayakpaul Oct 22, 2024

Choose a reason for hiding this comment

sayakpaul commented Oct 22, 2024 • edited Loading

sayakpaul Oct 22, 2024

Choose a reason for hiding this comment

sayakpaul Oct 22, 2024

Choose a reason for hiding this comment

sayakpaul commented Oct 21, 2024 •

edited

Loading

DN6 Oct 22, 2024 •

edited

Loading

sayakpaul commented Oct 22, 2024 •

edited

Loading