-
Notifications
You must be signed in to change notification settings - Fork 515
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enable torch.compile compatibility for triton fp8 rowwise gemm (#2978)
Summary: Pull Request resolved: #2978 X-link: facebookresearch/FBGEMM#74 This diff adds custom op wrappers around `quantize_fp8_row` and `matmul_fp8_row`. This should make them opaque to torch.compile and prevent issues where dynamo tries to trace triton code that is meant to be precompiled. I also add registration for fake kernels so that torch.compile can properly pass faketensors through the ops. Reviewed By: henrylhtsang Differential Revision: D61216580 fbshipit-source-id: c96cba96775656213c27f3fe36a20325376ca082
- Loading branch information
1 parent
425d1ac
commit cdb290a
Showing
2 changed files
with
76 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters