Skip to content

Add cutlass 3xTF32,DMMA based L2/cosine distance kernels for SM 8.0 or higher#939

Merged
rapids-bot[bot] merged 28 commits intorapidsai:branch-22.12from mdoijade:cutlass_dist_kernelsNov 16, 2022

Commits