Replies: 1 comment 3 replies
-
Please see https://axolotl-ai-cloud.github.io/axolotl/docs/config.html 's It would be passed to hf trainer https://huggingface.co/docs/transformers/main_classes/trainer#transformers.TrainingArguments.weight_decay |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, let me one question.
If using axolotl for supervised fune-tuning, how do I implement penalizing the distance between starting and current weights? This was shown to be effective in https://arxiv.org/abs/1706.03610
Beta Was this translation helpful? Give feedback.
All reactions