Revert "[Codegen][GPU] Add range information to GPU dispatch IDs" #19361

krzysz00 · 2024-12-03T21:37:38Z

Potential regression and I'm not in today to debug it, reverting

)" This reverts commit 0f21989.

…ee-org#19361) Reverts iree-org#17707 Potential regression and I'm not in today to debug it, reverting Signed-off-by: Giacomo Serafini <[email protected]>

…ree-org#19361) This reverts commit cb5be1d. --- First, this patch implements InferIntRangeInterface for hal.interface.workgroup.{size,id,count} using a local upper_bound attribute. Then, it adds a -iree-codegen-gpu-propagate-dispatch-size-bounds pass that adds these upper_bounds identifiers to the interface.workgroup operations and to gpu.thread_id based on static information available late in the codegen pipeline. Then, it uses -optimize-int-arithmetic to optimize indexing after -lower-affine, getting rid of a bunch of "if the input's negative" logic that isn't actually needed in many of our kernels. It also ensures that these upper_bound values propagate to LLVM.

…ree-org#19361) This reverts commit cb5be1d. Compaled to the previous revision, this one works around a correctness bug in dataflow analysis that's being fixed by removing the analysis after SCF->CF. --- First, this patch implements InferIntRangeInterface for hal.interface.workgroup.{size,id,count} using a local upper_bound attribute. Then, it adds a -iree-codegen-gpu-propagate-dispatch-size-bounds pass that adds these upper_bounds identifiers to the interface.workgroup operations and to gpu.thread_id based on static information available late in the codegen pipeline. Then, it uses -optimize-int-arithmetic to optimize indexing after -lower-affine, getting rid of a bunch of "if the input's negative" logic that isn't actually needed in many of our kernels. It also ensures that these upper_bound values propagate to LLVM.

…19361) (#19372) This reverts commit cb5be1d. Compaled to the previous revision, this one works around a correctness bug in dataflow analysis that's being fixed by removing the analysis after SCF->CF. --- First, this patch implements InferIntRangeInterface for hal.interface.workgroup.{size,id,count} using a local upper_bound attribute. Then, it adds a -iree-codegen-gpu-propagate-dispatch-size-bounds pass that adds these upper_bounds identifiers to the interface.workgroup operations and to gpu.thread_id based on static information available late in the codegen pipeline. Then, it uses -optimize-int-arithmetic to optimize indexing after -lower-affine, getting rid of a bunch of "if the input's negative" logic that isn't actually needed in many of our kernels. It also ensures that these upper_bound values propagate to LLVM.

Revert "[Codegen][GPU] Add range information to GPU dispatch IDs (#17707

aad2779

)" This reverts commit 0f21989.

krzysz00 requested review from antiagainst, MaheshRavishankar, kuhar, qedawkins, Groverkss and benvanik as code owners December 3, 2024 21:37

kuhar approved these changes Dec 3, 2024

View reviewed changes

kuhar enabled auto-merge (squash) December 3, 2024 21:56

kuhar merged commit cb5be1d into main Dec 3, 2024
40 of 41 checks passed

kuhar deleted the revert-17707-annotate-gpu-ranges branch December 3, 2024 21:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "[Codegen][GPU] Add range information to GPU dispatch IDs" #19361

Revert "[Codegen][GPU] Add range information to GPU dispatch IDs" #19361

krzysz00 commented Dec 3, 2024

Revert "[Codegen][GPU] Add range information to GPU dispatch IDs" #19361

Revert "[Codegen][GPU] Add range information to GPU dispatch IDs" #19361

Conversation

krzysz00 commented Dec 3, 2024