-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] rs_op_selector does not allow to use mixed FP8 types for A and B #1268
Comments
I also noticed that the option of FP8 operand and FP16 accumulator is excluded in
Was this intentional or is this an oversight? |
hi @ex-rzr , we support mixed fp8 types in the RS op selector now: cutlass/include/cute/arch/mma_sm90.hpp Line 996 in acba5be
Is this what you wanted? Can we close the issue? |
@thakkarV cutlass/include/cute/arch/mma_sm90.hpp Line 857 in acba5be
|
sorry about that. I noticed the assert after replying earlier to the thread. This will get fixed with 3.4 tagging in the next few days. |
Fixed witb 3.4. Closing. |
Describe the bug
rs_op_selector
does not allow to use mixed FP8 types for A and B.There is a static assert that prevents this (it is placed correctly in
ss_op_selector
, each branch has its own check if needed).See https://github.com/NVIDIA/cutlass/blob/main/include/cute/arch/mma_sm90.hpp#L857
Steps/Code to reproduce bug
Expected behavior
The same behavior as in
ss_op_selector
.The text was updated successfully, but these errors were encountered: