-
Notifications
You must be signed in to change notification settings - Fork 631
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Revert "Revert "[LLVMGPU][ROCm] Add MFMA_F32_16x16x4_F32 instruction"… #17921
Conversation
…iree-org#17894)" This reverts commit 02c2000.
836a821
to
2ee6d68
Compare
// TODO: Generalize to other dimensions. | ||
// Try to search for pad value and check only filter dimension is blocked. | ||
SmallVector<std::array<int64_t, 3>> mnkPaddingCandidates; | ||
for (const GPUMatmulShapeType &intrinsic : intrinsics) { | ||
|
||
if (inpElemType != intrinsic.aType && kernelElemType != intrinsic.bType) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this should be ||
no? because if either of this is not true then we want to skip.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, thanks for correcting.
if lhs_rhs_type.value.upper() not in schedule.intrinsic: | ||
# Search for the lhs_rhs type in the first part of intrinsic | ||
# e.g., MFMA_F32_16x16x4_F32 -> MFMA_F32 | ||
if lhs_rhs_type.value.upper() not in schedule.intrinsic[:8]: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would this work with MFMA_F8E4M3FNUZ
intrinsic?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The current logic won't. I've updated the script to find the right pattern. PTAL.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
iree-org#17921) … (iree-org#17894)" This reverts commit 02c2000. Signed-off-by: Lubo Litchev <[email protected]>
… (#17894)"
This reverts commit 02c2000.