Revert "Optimize fp8
linalg_ext.attention
by rework Q@K scaling"
#30
Job | Run time |
---|---|
8s | |
8m 54s | |
9m 2s |
fp8
linalg_ext.attention
by rework Q@K scaling"
#30
Job | Run time |
---|---|
8s | |
8m 54s | |
9m 2s |