Skip to content

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #300

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #300