Skip to content

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #300

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #300

Annotations

2 errors

Integration-Tests-AMD (self-hosted, gfx942)

cancelled Dec 13, 2024 in 2m 37s