-
Notifications
You must be signed in to change notification settings - Fork 515
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Passing TMA descriptors through grid constant (#3066)
Summary: X-link: facebookresearch/FBGEMM#163 Pull Request resolved: #3066 Improving the TMA kernel by passing the TMA descriptors through grid constant. Grid constant (D61692148) significantly reduces kernel invocation overhead. Also enables bias for the TMA kernel. Reviewed By: sfzhu93 Differential Revision: D61799463
- Loading branch information
1 parent
225ac16
commit 0b6a4aa
Showing
1 changed file
with
214 additions
and
83 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters