-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why triton convert to float8e5 will cause local memory read/write #4769
Comments
can you provide a simple kernel example? |
Likely a register spilling problem |
Check your register usage of this kernel |
this PR might fix it. Can you try: |
Thanks Thomas, I will try it :D |
this commit still has local memory write |
It has to do with |
IMO it's probably caused by nvptx doesn't handle 8-bit vector type well. Let me trigger a discussion and get back to you |
Update: @ThomasRaoux has a workaround now, and we will probably land his code after he is back from vacation. |
Thanks! |
Hi keren, has this code been updated on main branch ? |
Probably yes? I'm not 100% sure |
It isn't fixed yet, but will be fixed by llvm/llvm-project#113928 |
Good to know. Thanks! |
Fixes triton-lang#4769 (cherry picked from commit f484cb8)
Fixes triton-lang#4769 (cherry picked from commit f484cb8)
Fixes triton-lang#4769 (cherry picked from commit f484cb8)
Fixes triton-lang#4769 (cherry picked from commit f484cb8)
Fixes triton-lang#4769 (cherry picked from commit f484cb8)
I just write a kernel and it contains a
x.to(tl.float8e5)
, in ncu I found it cause local memory read/storeThe text was updated successfully, but these errors were encountered: