account for the new merged/unmerged weight to perform the quantization again #1370

pacman100 · 2024-01-18T12:38:31Z

What does this PR do?

BNB has CI failures with 2 tests failing due to the following:
In merge method, the following things happens wherein the lora delta weights are added to the dequantized base layer weights. This is followed by setting the weight of the base layer to a new instance of Params4bit with the new merged weights:

w_data = bnb.functional.dequantize_4bit(weight.data, weight.quant_state) + lora_data
self.get_base_layer().weight = bnb.nn.Params4bit(w_data.to("cpu"), requires_grad=False,**kwargs).to(weight.device)

Now, kwargs copies the ones currently there which has bnb_quantized=True and the old quant_state so the new merged weights aren't quantized and the old quant state remains. After this, during Linear4bit forward call, the old quant state is used while the weights are unquantized. This happens due to changes of the PR bitsandbytes-foundation/bitsandbytes#970.

This PR fixes the failures by setting bnb_quantized=False when merging/unmerging 4-bit layers.

…n again

HuggingFaceDocBuilderDev · 2024-01-18T12:42:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Titus-von-Koeller · 2024-01-18T13:42:59Z

Thanks @pacman100 for the quick reaction and fix! Really appreciated.

Seems like it's ready to merge :)

younesbelkada

Thanks for the quick fix!

…n again (huggingface#1370)

account for the new merged/unmerged weight to perform the quantizatio…

77fea9c

…n again

pacman100 changed the title ~~account for the new merged/unmerged weight to perform the quantizatio…~~ account for the new merged/unmerged weight to perform the quantization n again Jan 18, 2024

pacman100 changed the title ~~account for the new merged/unmerged weight to perform the quantization n again~~ account for the new merged/unmerged weight to perform the quantization again Jan 18, 2024

pacman100 requested a review from younesbelkada January 18, 2024 12:49

pacman100 marked this pull request as ready for review January 18, 2024 12:53

Titus-von-Koeller mentioned this pull request Jan 18, 2024

Initial FSDP Support for QLoRA Finetuning bitsandbytes-foundation/bitsandbytes#970

Merged

younesbelkada approved these changes Jan 18, 2024

View reviewed changes

Titus-von-Koeller merged commit ebbff40 into main Jan 18, 2024
14 checks passed

Titus-von-Koeller deleted the smangrul/fix-bnb-4bit branch January 18, 2024 14:39

BenjaminBossan pushed a commit to BenjaminBossan/peft that referenced this pull request Mar 14, 2024

account for the new merged/unmerged weight to perform the quantizatio…

c73fcac

…n again (huggingface#1370)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

account for the new merged/unmerged weight to perform the quantization again #1370

account for the new merged/unmerged weight to perform the quantization again #1370

pacman100 commented Jan 18, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 18, 2024

Titus-von-Koeller commented Jan 18, 2024

younesbelkada left a comment

account for the new merged/unmerged weight to perform the quantization again #1370

account for the new merged/unmerged weight to perform the quantization again #1370

Conversation

pacman100 commented Jan 18, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jan 18, 2024

Titus-von-Koeller commented Jan 18, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

pacman100 commented Jan 18, 2024 •

edited

Loading