-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[bug] Medusa example fails with vicuna 33B #2478
Comments
@SoundProvider, could you also show the command to convert the checkpoint? |
|
Hi @SoundProvider , I just tried to build Medusa engine with Vicuna-33B model with TP=1 and TP=4 using TRT-LLM 0.15 release. Since the error is related to pickle, it seems like your converted checkpoint config is outdated. Could you please try to convert the checkpoint again and then build? If you are still running into the same issue, can you share which version of TRT-LLM you are using? |
Hello @rakib-hasan |
Thank you for developing trt-llm. It's helping me a lot
I'm trying to use medusa with trt-llm, referencing this page
It's working fine with vicuna 7B and its medusa heads, with no errors at all.
However, when implementing with vicuna 33B and its trained heads, the following error occurs when executing
trtllm-build
converting checkpoint with medusa was done with following result
The text was updated successfully, but these errors were encountered: