Reapply "[Codegen][GPU] Add range information to GPU dispatch IDs" (#19361) #19372

krzysz00 · 2024-12-04T17:15:24Z

This reverts commit cb5be1d.

Compaled to the previous revision, this one works around a correctness
bug in dataflow analysis that's being fixed by removing the analysis
after SCF->CF.

First, this patch implements InferIntRangeInterface for hal.interface.workgroup.{size,id,count} using a local upper_bound attribute.

Then, it adds a -iree-codegen-gpu-propagate-dispatch-size-bounds pass that adds these upper_bounds identifiers to the interface.workgroup operations and to gpu.thread_id based on static information available late in the codegen pipeline.

Then, it uses -optimize-int-arithmetic to optimize indexing after -lower-affine, getting rid of a bunch of "if the input's negative" logic that isn't actually needed in many of our kernels.

It also ensures that these upper_bound values propagate to LLVM.

benvanik · 2024-12-12T01:11:29Z

compiler/src/iree/compiler/Dialect/HAL/IR/HALOps.td

-def HAL_InterfaceWorkgroupIDOp : HAL_PureOp<"interface.workgroup.id", [
-  DeclareOpInterfaceMethods<OpAsmOpInterface, ["getAsmResultNames"]>,
-]> {
+class HAL_InterfaceWorkgroupOp<string mnemonic, list<Trait> traits = []>


nice cleanup!

benvanik

LGTM for HAL changes! may want @MaheshRavishankar or someone to take a peek at the codegen side.

qedawkins

IIRC this got reverted for a regression. If the regression is fixed LGTM!

kuhar

Could explain what fixed the regression that caused the initial revert? It would be good to include this in the PR description.

…ree-org#19361) This reverts commit cb5be1d. Compaled to the previous revision, this one works around a correctness bug in dataflow analysis that's being fixed by removing the analysis after SCF->CF. --- First, this patch implements InferIntRangeInterface for hal.interface.workgroup.{size,id,count} using a local upper_bound attribute. Then, it adds a -iree-codegen-gpu-propagate-dispatch-size-bounds pass that adds these upper_bounds identifiers to the interface.workgroup operations and to gpu.thread_id based on static information available late in the codegen pipeline. Then, it uses -optimize-int-arithmetic to optimize indexing after -lower-affine, getting rid of a bunch of "if the input's negative" logic that isn't actually needed in many of our kernels. It also ensures that these upper_bound values propagate to LLVM.

krzysz00 force-pushed the gpu-id-ranges-2 branch from a6ba1ba to 780b2ab Compare December 10, 2024 22:15

krzysz00 marked this pull request as ready for review December 11, 2024 23:49

krzysz00 requested review from antiagainst, MaheshRavishankar, kuhar, qedawkins, Groverkss and benvanik as code owners December 11, 2024 23:49

krzysz00 mentioned this pull request Dec 12, 2024

[Codegen][GPU] Let integer range optimization narrow GPU computations to i32 #19473

Draft

benvanik reviewed Dec 12, 2024

View reviewed changes

benvanik approved these changes Dec 12, 2024

View reviewed changes

qedawkins approved these changes Dec 12, 2024

View reviewed changes

krzysz00 force-pushed the gpu-id-ranges-2 branch from 780b2ab to dbba7fa Compare December 12, 2024 16:59

kuhar reviewed Dec 12, 2024

View reviewed changes

krzysz00 force-pushed the gpu-id-ranges-2 branch from dbba7fa to 7edb543 Compare December 12, 2024 17:12

krzysz00 merged commit 63cdc7d into iree-org:main Dec 13, 2024
39 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reapply "[Codegen][GPU] Add range information to GPU dispatch IDs" (#19361) #19372

Reapply "[Codegen][GPU] Add range information to GPU dispatch IDs" (#19361) #19372

krzysz00 commented Dec 4, 2024 •

edited

Loading

benvanik Dec 12, 2024

benvanik left a comment

qedawkins left a comment

kuhar left a comment

Reapply "[Codegen][GPU] Add range information to GPU dispatch IDs" (#19361) #19372

Reapply "[Codegen][GPU] Add range information to GPU dispatch IDs" (#19361) #19372

Conversation

krzysz00 commented Dec 4, 2024 • edited Loading

benvanik Dec 12, 2024

Choose a reason for hiding this comment

benvanik left a comment

Choose a reason for hiding this comment

qedawkins left a comment

Choose a reason for hiding this comment

kuhar left a comment

Choose a reason for hiding this comment

krzysz00 commented Dec 4, 2024 •

edited

Loading