New recipes support for DeepSeek's family of distilled R1 models

Latest

Latest

xiaoxshe released this 01 Feb 01:45

· 1 commit to main since this release

a15395f

What's Changed

New recipes

Added support for DeepSeek's family of distilled R1 models. Users can now finetune various sizes of DeepSeek-R1-Distill-Llama and DeepSeek-R1-Distill-Qwen using SFT and PEFT (lora/qlora).

Assets 2