-
Notifications
You must be signed in to change notification settings - Fork 27.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mamba2 Codestral generation example fails to load mismatching state dict #32561
Comments
Thanks for the issue, taking a look! |
You are using |
It's copied directly from the docs on the site, which I suppose makes this a documentation error. I suspected it would be something this simple, but I was just doing a very quick test out of curiosity and didn't have time to dig into it immediately. |
I thought I remembered checking that actually so I just took another look at my test notebook. It appears that Mamba2 isn't available on Colab without a |
No worries, you're right, I'll update the docs right away! For colab yes, it should be available soon though :) |
System Info
Google Colab, transformers 4.42.4 (default Colab version) and 4.44.0 (after --upgrade)
transformers
version: 4.42.4Who can help?
@ArthurZucker @molbap
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Copied directly from the documentation.
It first warns
You are using a model of type mamba2 to instantiate a model of type mamba. This is not supported for all configurations of models and can yield errors.
It then errors with
etc. for all layers.
Expected behavior
It should work as documented.
The text was updated successfully, but these errors were encountered: