分享使用最新的peft-0.4.0的一些实践。 #189

Qznan · 2023-08-24T10:56:13Z

Qznan
Aug 24, 2023

本项目要求使用的是peft==0.3.0.dev0，但使用中若需要使用QLoRA等新功能时要求新版的peft(#144) 。同时旧版peft搭配最新deepspeed来resume_from_checkpoint时还可能需要修改deepspeed代码来适配(#161)。

经修改，本人目前使用peft==0.4.0可成功训练和推断，具体使用环境如下：

torch==2.0.1
transformers==4.31.0
deepspeed==0.10.0
peft==0.4.0

具体修改：只需将run_clm_pt_with_peft.py和run_clm_sft_with_peft.py中的以下的代码段去掉：

model.state_dict = (
        lambda self, *_, **__: get_peft_model_state_dict(self, old_state_dict())
    ).__get__(model, type(model))

这里作者替换了PeftModel模型实例的state_dict函数，让其只返回例如lora微调的参数。推测是想让deepspeed在保存模型checkpoint如mp_rank_00_model_states.pt时，其中的module字段只保留lora微调参数，节省磁盘占用。

而新版peft中，模型载入后对每个lora参数增加了adapter_name后缀(默认为'default')，同时在每次保存PeftModel.save_pretrained也会调用get_peft_model_state_dict。这导致经过作者替换后的state_dict已经过滤了没有了default后缀，而新版的get_peft_model_state_dict却还要通过是否有default后缀来过滤参数，最终导致PeftModel.save_pretrained保留的是空的adapter.bin。

故只要将以上替换state_dict的代码去掉就好，但此时deepspeed保存的checkpoint模型确实是全量的，太大。这里的一个解决想法是保持使用上述state_dict替换，但是old_state_dict保留，并且在SavePeftModelCallback中调用save_pretrained时再显式传入state_dict=old_state_dict()。用原始全量来调用save_pretrained以正确保存。

欢迎交流讨论😄

ppppls115 · 2023-09-08T07:45:08Z

ppppls115
Sep 8, 2023

请问我训练到一半，出现这个报错，AttributeError: 'LlamaForCausalLM' object has no attribute 'sava checkpoint'。是不是peft版本的问题。

2 replies

Qznan Sep 9, 2023
Author

这个可以参考#166 下我的回答哈

ppppls115 Sep 13, 2023

好的，解决了，感谢！

Khalilxfan · 2023-10-08T08:58:34Z

Khalilxfan
Oct 8, 2023

请问如何在SavePeftModelCallback中调用save_pretrained时再显式传入state_dict=old_state_dict()呢，有代码可以参考吗~

0 replies

Yu-Jie06 · 2025-01-17T14:46:10Z

Yu-Jie06
Jan 17, 2025

您好，想請問TypeError: GenerationMixin._extract_past_from_model_output() got an unexpected keyword argument 'standardize_cache_format'這是什麼狀況呢?

0 replies

Rick-24 · 2025-01-17T14:47:14Z

Rick-24
Jan 17, 2025

您好，您的来信我已收到，我会尽快处理。祝好！

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

分享使用最新的peft-0.4.0的一些实践。 #189

{{title}}

Replies: 4 comments 2 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

分享使用最新的peft-0.4.0的一些实践。 #189

Qznan Aug 24, 2023

Replies: 4 comments · 2 replies

ppppls115 Sep 8, 2023

Qznan Sep 9, 2023 Author

ppppls115 Sep 13, 2023

Khalilxfan Oct 8, 2023

Yu-Jie06 Jan 17, 2025

Rick-24 Jan 17, 2025

Qznan
Aug 24, 2023

Replies: 4 comments 2 replies

ppppls115
Sep 8, 2023

Qznan Sep 9, 2023
Author

Khalilxfan
Oct 8, 2023

Yu-Jie06
Jan 17, 2025

Rick-24
Jan 17, 2025