You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
VectorDistribute does this after vectorization here with the LLVMGPUCastTypeToFitMMA pass.
For TileAndFuse I think ideally we would want to do something similar when we make the multi_mma op which I did in this branch
However, doing it early like this introduced two issues that I noticed at the FuseAndHoist pass
The extf leads to a materialization of empty+fill+extf which should be just an empty
The truncf doesnt seem to be tiled along with the multi_mma
Here is a dump
The end result is that it failed to bufferize.
The text was updated successfully, but these errors were encountered:
The TileandFuse Matmul config heursictics has this code snippet
However, it cannot actually upcast becuase we will fail to make a multi-mma op here
https://github.com/iree-org/iree/blob/main/compiler/src/iree/compiler/Codegen/Dialect/GPU/Transforms/Transforms.cpp#L446-L447
as we dont have upcast + multi_mma + down cast ability.
VectorDistribute does this after vectorization here with the
LLVMGPUCastTypeToFitMMA
pass.For TileAndFuse I think ideally we would want to do something similar when we make the multi_mma op which I did in this branch
However, doing it early like this introduced two issues that I noticed at the FuseAndHoist pass
Here is a dump
The end result is that it failed to bufferize.
The text was updated successfully, but these errors were encountered: