-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG]: Performance degradation after updating to Mojo v 0.4.0 #1030
Comments
See if this PR helps: tairov/llama2.mojo#46 For me on macOS m2 max it more than doubled performance, might have similar results in Linux |
thanks @jackos , I merged your PR , though I decided to keep |
No worries thanks @tairov, investigating what's happening here thanks for raising |
I can also report performance degradation with the new changes to all my toy examples. Unfortunately, I lost my earlier 0.3.1 install in the update and cannot get back those numbers (#1032 would be nice). Even earlier the vanilla |
Did we ever get to the bottom of this? |
@Mogball Building and testing older versions on my M2 max shows improved performance with global runtime:
There were other changes made to |
EDIT: Might be specific to Linux, testing now |
Resolved in next release |
Bug description
After updating to Mojo version
0.4.0
we adaptedllama2.mojo
code to latest changes, primarily related to global Runtime. Now master branch ofllama2.mojo
is compatible with Mojo0.4.0
But with this upgrade we noticed some performance degradation on smaller baby llama models like
stories15M.bin
v0.3.1
achieved tok/s
= from 420 to 450 tok/sv0.4.0
achieved tok/s
= from 390 to 410 tok/sMojo v. 0.3.1
Mojo v. 0.4.0
Steps to reproduce
System information
The text was updated successfully, but these errors were encountered: