Skip to content

New recipes support for DeepSeek's family of distilled R1 models

Latest
Compare
Choose a tag to compare
@xiaoxshe xiaoxshe released this 01 Feb 01:45
· 1 commit to main since this release
a15395f

What's Changed

New recipes

  • Added support for DeepSeek's family of distilled R1 models. Users can now finetune various sizes of DeepSeek-R1-Distill-Llama and DeepSeek-R1-Distill-Qwen using SFT and PEFT (lora/qlora).