Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TrainableModule enhancement #178

Open
wzh1994 opened this issue Aug 29, 2024 · 0 comments
Open

TrainableModule enhancement #178

wzh1994 opened this issue Aug 29, 2024 · 0 comments
Labels
Milestone

Comments

@wzh1994
Copy link
Contributor

wzh1994 commented Aug 29, 2024

  1. 构造TrainableModule时不下载模型,微调和推理时再下载模型;异步下载;一个模型不应该被多个模块重复下载
  2. 本地模型给长链接,只要名字符合pattern, 就可以找到对应的特殊token和system prompt
  3. 本地保存的模型checkpoint可以以某种方式注册其类别,或直接注册特殊token和system prompt
@wzh1994 wzh1994 added the v0.3 label Aug 29, 2024
@wzh1994 wzh1994 added this to the LazyLLM v0.3 milestone Aug 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant