-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for Mistral Nemo #1985
Comments
@byshiue Looking forward to any progress |
Hello @byshiue It seems like Mistral 7B model is already supported TensorRT-LLM/examples/llama/README.md Line 1072 in 5ddb6bf
If the model architecture is the same, would that mean that we can also use existing scripts / code for Mistral-Nemo as well? Would be happy to try out with existing scripts. Please let us know. cc: @AdamzNV @ncomly-nvidia as well. |
@byshiue @AdamzNV @ncomly-nvidia Can you help solve this problem? Yesterday I tried to directly use the mistral method to convert and compile the mistral nemo 12b engine, but an error occurred during the conversion phase. I use the smoothquant conversion method. The following is the conversion script and error log. CC: @hongjunchoi92 Convert script: Error log: |
Hello everyone! Same issue here. Any news about the integration of this model? The logs are the following (
|
@nv-guomingz Could you please take a look? Thanks |
Hi @eleapttn ,we've fixed this issue internally and corresponding fixing will be pushed to main branch in coming weekly update. |
Hi @QiJune, @nv-guomingz, |
This is working in 0.12. Good job! |
As more and more new models enter the market, we have prepared comprehensive instructions for TRT-LLM developers on adapting to new models of interest. We encourage our community developers to expand the range of supported models, fostering an open ecosystem with rapid iterations. Please try following these instructions and let us know if you encounter any issues during the adaptation process. We greatly appreciate your dedication. |
https://mistral.ai/news/mistral-nemo/
Would Mistral Nemo Models be supported in Tensorrt-LLM in near future?
The text was updated successfully, but these errors were encountered: