You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I also noticed that the option of FP8 operand and FP16 accumulator is excluded in ss_op_selector and rs_op_selector, even though this is accommodated by the PTX ISA and otherwise has MMA_Traits structs defined for it (F16E4M3E4M3 etc.). This is done by the following code:
#1646
Closed
ylz1104 opened this issue
Jul 22, 2024
· 1 comment
I also noticed that the option of FP8 operand and FP16 accumulator is excluded in `ss_op_selector` and `rs_op_selector`, even though this is accommodated by the PTX ISA and otherwise has `MMA_Traits` structs defined for it (F16E4M3E4M3 etc.). This is done by the following code:
// FP16 accumulator
if constexpr (is_same_v<ElementC, half_t>) {
static_assert(is_same_v<ElementA, half_t>, "Element types for AB must be half if ElementC is half.");
static_assert(is_same_v<ElementB, half_t>, "Element types for AB must be half if ElementC is half.");
Was this intentional or is this an oversight?
Originally posted by @jayhshah in #1268 (comment)
The text was updated successfully, but these errors were encountered: