Skip to content

Pull requests: NVIDIA/Fuser

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add missing handle for EmbeddingFwdOp
#4019 opened Mar 6, 2025 by naoyam Loading…
Indexing for TMem ld and st
#4017 opened Mar 6, 2025 by zasdfgbnm Loading…
Tensor memory 32x32b data path pattern matching
#4015 opened Mar 5, 2025 by zasdfgbnm Loading…
use torch.nn.functional.rms_norm
#4011 opened Mar 5, 2025 by liqiangxl Loading…
WIP
#4006 opened Mar 4, 2025 by liqiangxl Draft
update the default to 20
#4002 opened Mar 3, 2025 by crcrpar Loading…
Remove MmaOp::AxisMapping
#3995 opened Feb 28, 2025 by jacobhinkle Draft
redo register sharing PR-3972
#3993 opened Feb 28, 2025 by liqiangxl Draft
add packed warp reduction
#3959 opened Feb 25, 2025 by liqiangxl Draft
Debug zip
#3939 opened Feb 21, 2025 by zasdfgbnm Loading…
Multidimensional mesh
#3937 opened Feb 21, 2025 by cowanmeg Loading…
Expose backend type to python
#3928 opened Feb 20, 2025 by samnordmann Loading…
avoid ublk tma out bound access
#3917 opened Feb 18, 2025 by liqiangxl Draft
ProTip! What’s not been updated in a month: updated:<2025-02-06.