[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory? #2008

ziyuhuang123 · 2024-12-23T07:13:50Z

Could you explain how TMA works in CUTLASS? For example, when writing from the shared memory Tensor sS to the global memory Tensor gD, it seems that the data is written sequentially, i.e., sS[i] directly maps to gD[i]. Is this the correct behavior?

ziyuhuang123 added ? - Needs Triage question Question labels Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory? #2008

[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory? #2008

ziyuhuang123 commented Dec 23, 2024

[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory? #2008

[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory? #2008

Comments

ziyuhuang123 commented Dec 23, 2024