deepseek-vl系列的微调支持 (finetune) #10

Jintao-Huang · 2024-03-12T09:20:17Z

ms-swift大模型训练框架已经支持了deepseek-vl系列模型的推理和微调～

RERV · 2024-03-12T14:01:08Z

Thank you for supporting DeepSeek-VL!

soloice · 2024-03-13T10:50:22Z

@Jintao-Huang Can you kindly confirm if swift can be used to finetune visual encoder? If so, how? If not, what's the simplest way to support it?

xs818818 · 2024-03-13T13:19:51Z

还有就是lora微调后怎么部署使用

SinanAkkoyun · 2024-03-13T15:15:05Z

@soloice Thank you very much for asking

Jintao-Huang · 2024-03-13T22:23:14Z

LoRA fine-tuning and merge-LoRA have been supported for both the visual encoder and aligner.
Full parameter fine-tuning is also supported.
😊

SinanAkkoyun · 2024-03-13T22:28:53Z

I am super grateful for your work, thank you a lot!!! ❤️

This was referenced Mar 12, 2024

请问什么时候可以开源微调代码 #9

Closed

Fine-tuning Script #6

Open

Jintao-Huang mentioned this issue Mar 13, 2024

support deepseek vl finetune vision encoder modelscope/ms-swift#547

Merged

Jintao-Huang changed the title ~~deepseek-vl系列的微调支持~~ deepseek-vl系列的微调支持 (finetune) Mar 14, 2024

Jintao-Huang mentioned this issue Mar 14, 2024

fix deepseek-vl 'eval_loss' not found bug modelscope/ms-swift#552

Merged

soloice closed this as completed Mar 15, 2024

soloice pinned this issue Mar 15, 2024

Provide feedback