Skip to content

Actions: NVIDIA/cutlass

Auto Assign New Issues to Triage Project

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
587 workflow runs
587 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[QST] why have Int<2>{} in coalesce_x function when last shape value equal to constant one.
Auto Assign New Issues to Triage Project #963: Issue #2023 opened by Shan19900305
January 5, 2025 16:53 12s
January 5, 2025 16:53 12s
[QST] why the implementation of f16xs8 mixed gemm is different between TRT-LLM and native cutlass mixed gemm example?
Auto Assign New Issues to Triage Project #962: Issue #2022 opened by danielhua23
January 5, 2025 13:16 10s
January 5, 2025 13:16 10s
[QST] The code location where the shared memory write by swizzled layout occurs in cutlass 2.x?
Auto Assign New Issues to Triage Project #961: Issue #2019 opened by danielhua23
January 1, 2025 03:08 15s
January 1, 2025 03:08 15s
[QST] Why it won't OOB in tiled_copy pipeline
Auto Assign New Issues to Triage Project #960: Issue #2018 opened by ZhZhang711
December 31, 2024 09:09 14s
December 31, 2024 09:09 14s
[BUG] Memory corruption/undefined behavior on GemmUniversal in 3.4.0 - 3.6.0 🐛
Auto Assign New Issues to Triage Project #959: Issue #2017 opened by warpuv
December 28, 2024 17:47 11s
December 28, 2024 17:47 11s
[QST] Where is CuTe ValLayout in TiledMMA as of CUTLASS 3.4.0
Auto Assign New Issues to Triage Project #958: Issue #2016 opened by ZhZhang711
December 27, 2024 10:55 15s
December 27, 2024 10:55 15s
[QST]Why Does CUTLASS Use 3-4-3 Swizzle?
Auto Assign New Issues to Triage Project #957: Issue #2015 opened by ziyuhuang123
December 27, 2024 04:02 11s
December 27, 2024 04:02 11s
[BUG] Precision issue with python cutlass gemm
Auto Assign New Issues to Triage Project #956: Issue #2014 opened by MinghaoYan
December 26, 2024 19:03 17s
December 26, 2024 19:03 17s
[BUG] Where is 3.6.0 release?
Auto Assign New Issues to Triage Project #955: Issue #2012 opened by ankutalev
December 25, 2024 10:00 13s
December 25, 2024 10:00 13s
[QST] why kElementsPerAccess > 1 is not permanent in default_mma_sm80_core.h
Auto Assign New Issues to Triage Project #954: Issue #2011 opened by danielhua23
December 23, 2024 12:33 12s
December 23, 2024 12:33 12s
[BUG] [QST] Regression - why Sm90RowBroadcast in 3.5.1 stops support smem usage?
Auto Assign New Issues to Triage Project #953: Issue #2010 opened by ankutalev
December 23, 2024 10:11 14s
December 23, 2024 10:11 14s
[BUG] Removal of OpMultiplyAdd template substitutions from mma_sm80.h in 3.5.1
Auto Assign New Issues to Triage Project #952: Issue #2009 opened by ankutalev
December 23, 2024 10:03 16s
December 23, 2024 10:03 16s
[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory?
Auto Assign New Issues to Triage Project #951: Issue #2008 opened by ziyuhuang123
December 23, 2024 07:13 13s
December 23, 2024 07:13 13s
[QST] How to Let __launch_bounds__ and setmaxnreg Work with Each Other?
Auto Assign New Issues to Triage Project #950: Issue #2007 opened by Maximilianxu
December 23, 2024 03:12 10s
December 23, 2024 03:12 10s
[BUG] wmma should be enabled w/ clang.
Auto Assign New Issues to Triage Project #949: Issue #2006 opened by Artem-B
December 20, 2024 19:03 15s
December 20, 2024 19:03 15s
[QST] Significant GFLOPs variations due to different input initialization behaviours
Auto Assign New Issues to Triage Project #948: Issue #2004 opened by yuukidach
December 20, 2024 17:22 11s
December 20, 2024 17:22 11s
[BUG] Unaligned access in test/unit/gemm/threadblock/batched_gemv.cu
Auto Assign New Issues to Triage Project #947: Issue #2003 opened by Artem-B
December 19, 2024 22:40 11s
December 19, 2024 22:40 11s
[QST]Behavior of TMA Store and Wait Mechanism in CUTLASS
Auto Assign New Issues to Triage Project #946: Issue #2002 opened by ziyuhuang123
December 19, 2024 14:20 11s
December 19, 2024 14:20 11s
[QST] When to use MainloopSm90TmaGmmaWarpSpecializedFP8?
Auto Assign New Issues to Triage Project #945: Issue #2001 opened by ginowu
December 19, 2024 08:46 14s
December 19, 2024 08:46 14s
[Proposal] layout deduction ambiguity of Nested Layout Access Problem
Auto Assign New Issues to Triage Project #944: Issue #2000 opened by yiakwy-xpu-ml-framework-team
December 18, 2024 15:21 12s
December 18, 2024 15:21 12s
[QST]Is the Key Difference Between mbarrier and barrier Their Handling of Producer-Consumer Count?
Auto Assign New Issues to Triage Project #943: Issue #1999 opened by ziyuhuang123
December 18, 2024 11:20 11s
December 18, 2024 11:20 11s
[QST]How to Handle Synchronization with Different Thread Counts for Producer and Consumer in CUTLASS?
Auto Assign New Issues to Triage Project #942: Issue #1998 opened by ziyuhuang123
December 18, 2024 11:03 13s
December 18, 2024 11:03 13s
[BUG] calling cast_smem_ptr_to_uint(device fn) from make_gmma_desc(host device fn) is not allowed
Auto Assign New Issues to Triage Project #941: Issue #1997 opened by lygztq
December 18, 2024 09:36 12s
December 18, 2024 09:36 12s
[QST] Gemm got 'incomplete type is not allowed' when use Sm90
Auto Assign New Issues to Triage Project #940: Issue #1996 opened by TopIdiot
December 18, 2024 08:43 11s
December 18, 2024 08:43 11s
[QST]How to Use and Evaluate Prefetch in CUTLASS?
Auto Assign New Issues to Triage Project #939: Issue #1995 opened by ziyuhuang123
December 18, 2024 04:14 14s
December 18, 2024 04:14 14s