-
Notifications
You must be signed in to change notification settings - Fork 641
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GPUHeuristic] Modify schedule generator to consider distribution of tranfer_read layout anchor #17636
Merged
raikonenfnu
merged 7 commits into
iree-org:main
from
raikonenfnu:constraintTransferRead
Jun 12, 2024
Merged
[GPUHeuristic] Modify schedule generator to consider distribution of tranfer_read layout anchor #17636
Changes from 6 commits
Commits
Show all changes
7 commits
Select commit
Hold shift + click to select a range
b34d72f
Modify schedule generator to take into account distribution of tranfe…
raikonenfnu a762d17
Clarify magic number
raikonenfnu 0d83334
Generalize distributable/good transfer_read layout.
raikonenfnu a6a65fd
Clean up todo notes that is done.
raikonenfnu b606ec7
Simplify transpose detector
raikonenfnu f4125a6
fix minor bug
raikonenfnu 1d6ba84
NIT affinemap
raikonenfnu File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -913,13 +913,6 @@ LogicalResult setCooperativeMatrixConfig( | |
int64_t sharedMemoryLimitInBytes = | ||
targetEnv.getResourceLimits().getMaxComputeSharedMemorySize(); | ||
|
||
FailureOr<GPUMMASchedule> schedule = | ||
deduceMMASchedule(problem, intrinsics, seeds, sharedMemoryLimitInBytes); | ||
if (failed(schedule)) | ||
return failure(); | ||
|
||
auto pipeline = CodeGenPipeline::SPIRVCooperativeMatrixVectorize; | ||
|
||
std::optional<int64_t> subgroupSize = limits.getSubgroupSize(); | ||
// AMD RDNA architectures supports both wave32 and wave64 modes. Prefer to use | ||
// wave32 mode for better performance. | ||
|
@@ -928,6 +921,23 @@ LogicalResult setCooperativeMatrixConfig( | |
subgroupSize = *minSize; | ||
} | ||
|
||
// Infer if lhs or rhs is transposed to help generate better schedule. | ||
auto maps = op.getIndexingMapsArray(); | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. also here |
||
bool transposedLhs = | ||
kIndex != | ||
llvm::cast<AffineDimExpr>(maps[0].getResults().back()).getPosition(); | ||
bool transposedRhs = | ||
nIndex != | ||
llvm::cast<AffineDimExpr>(maps[1].getResults().back()).getPosition(); | ||
|
||
FailureOr<GPUMMASchedule> schedule = | ||
deduceMMASchedule(problem, intrinsics, seeds, sharedMemoryLimitInBytes, | ||
*subgroupSize, transposedLhs, transposedRhs); | ||
if (failed(schedule)) | ||
return failure(); | ||
|
||
auto pipeline = CodeGenPipeline::SPIRVCooperativeMatrixVectorize; | ||
|
||
std::array<int64_t, 3> workgroupSize{schedule->nWarpCount * *subgroupSize, | ||
schedule->mWarpCount, 1}; | ||
|
||
|
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks2 done :)