persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #300
Annotations
1 warning
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
Loading