-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于运行流程 #30
Comments
|
感谢大佬回答。 |
选择pt或者sft的参数的位置都在customized_trainer.py实现的,可以通过script里面的extend_layers来传新增加的layers,然后训练那部分参数 |
好的,感谢大佬! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
感谢分享!新的训练思路!
在这里我一个小白想提出几个我的疑问。期待大佬的解答!
我在使用block_expansion.py给Llama3-Chinese_v2扩展后会在指定目录出现一个pytorch_model.bin文件。
第一个问题:请问一下这个pytorch_model.bin文件是否含有原Llama3-Chinese_v2的能力?
在出现这个文件之后我把原有Llama3-Chinese_v2的相关文件复制进来一份。然后执行finetune_codealpaca.sh
第二个问题:我在训练这个新的模型文件时如何选择训练方式 pt或者sft如何进行选择?
The text was updated successfully, but these errors were encountered: