[custom op] Generalize shape library logic to work with dtypes #1594

ramiro050 · 2022-11-16T03:06:43Z

This commit generalizes the shape library logic, so that dtype rules
for ops can also be expressed using the same mechanism. In other
words, each op can now have a shape function and a dtype function
specified in Python that is imported during lowering to calculate the
shapes and dtypes throught a program. For more information about how
to specify a dtype function, see the updated
docs/adding_a_shape_and_dtype_function.md.

For those not familiar with how the shape library works, the file
docs/calculations_lib.md provides an overview.

To make the reviewing a bit easier, I suggest the following review
order:

Get familiar with the overall architecture by reading
docs/calculations_lib.md
New op declarations
- include/torch-mlir/Dialect/Torch/IR/TorchOps.td
- lib/Dialect/Torch/IR/TorchOps.cpp
New passes
- include/torch-mlir/Dialect/Torch/Transforms/Passes.td
- include/torch-mlir/Dialect/Torch/Transforms/Passes.h
- lib/Dialect/Torch/Transforms/Passes.cpp
- lib/Dialect/Torch/Transforms/ReifyDtypeCalculations.cpp
- lib/Dialect/Torch/Transforms/SimplifyDtypeCalculations.cpp
- lib/Dialect/Torch/Transforms/ReifyCalculationsUtils.cpp
- lib/Dialect/Torch/Transforms/ReifyCalculationsUtils.h
- lib/Dialect/Torch/Transforms/SimplifyCalculationsUtils.cpp
- lib/Dialect/Torch/Transforms/SimplifyCalculationsUtils.h
The *Utils.* files include logic that is shared by dtype and
shape passes.
Tests
- test/Dialect/Torch/ops.mlir
- test/Dialect/Torch/reify-dtype-calculations.mlir
- test/Dialect/Torch/simplify-dtype-calculations.mlir
Introduce torch_mlir_promote_dtypes
- python/torch_mlir/dialects/torch/importer/jit_ir/csrc/node_importer.cpp
- python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/library_generator.py
Simple refactoring/generalizing
- include/torch-mlir/Dialect/Torch/Utils/Utils.h
- lib/Dialect/Torch/Utils/Utils.cpp
- lib/Dialect/Torch/Transforms/DropCalculations.cpp
- lib/Dialect/Torch/Transforms/RefineTypes.cpp
- lib/Dialect/Torch/Transforms/ReifyShapeCalculations.cpp
- lib/Dialect/Torch/Transforms/SimplifyShapeCalculations.cpp
- python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/registry.py
- python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/testing_framework.py
- python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/calculations_lib_gen.py
- lib/Dialect/Torch/Transforms/CalculationsLibrary.cpp
The rest of the files include minor changes
- Replace shape with calculations
- Replace m_TorchConstantIntList with
  m_TorchListOfConstantInts (needed to avoid ambiguity with new
  pattern m_TorchListOfOptionalConstantInts)

.github/workflows/RollPyTorch.yml

silvasean

First round of comments. Overall looks good. Will probably have a few more nits after these are addressed but not much more than that.

docs/adding_a_shape_and_dtype_function.md

.github/workflows/RollPyTorch.yml

docs/adding_a_shape_and_dtype_function.md

lib/Dialect/Torch/Transforms/ReifyCalculationsUtils.cpp

lib/Dialect/Torch/Transforms/ReifyDtypeCalculations.cpp

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/testing_framework.py

python/torch_mlir/dialects/torch/importer/jit_ir/csrc/node_importer.cpp

test/Dialect/Torch/simplify-dtype-calculations.mlir

lib/Dialect/Torch/Transforms/ReifyDtypeCalculations.cpp

ramiro050 · 2022-11-20T22:15:11Z

@silvasean, I've addressed the comments in separate commits. Also, I had to make this change to make all tests pass: a24b7c6. The commit message (pasted below) has the explanation:

This commit adds back the ops that use the new dtype refinement
pass. This is needed to avoid the catch-22 that results from ops in
the new dtype refinement pass needing dtype information from ops in
the RefineTypes pass, and ops in the RefineTypes pass needing dtype
information from ops in the new dtype refinement pass.

The reason this catch-22 problem is not handled by the iterative
application of passes when lowering to the backend contract is because
the DecomposeComplexOps pass is not currently designed to run more
than once, since every op with a decomposition gets marked illegal
after the pass is done. Marking ops as legal results in no
decomposition patterns being applied because the graph is already in a
legal state.

Adding back the ops to RefineTypes seems like the simplest solution to
this problem while the rest of the ops get relocated to use the new
dtype refinement pass.

silvasean · 2022-11-21T13:09:28Z

@silvasean, I've addressed the comments in separate commits. Also, I had to make this change to make all tests pass: a24b7c6. The commit message (pasted below) has the explanation:

This commit adds back the ops that use the new dtype refinement
pass. This is needed to avoid the catch-22 that results from ops in
the new dtype refinement pass needing dtype information from ops in
the RefineTypes pass, and ops in the RefineTypes pass needing dtype
information from ops in the new dtype refinement pass.
The reason this catch-22 problem is not handled by the iterative
application of passes when lowering to the backend contract is because
the DecomposeComplexOps pass is not currently designed to run more
than once, since every op with a decomposition gets marked illegal
after the pass is done. Marking ops as legal results in no
decomposition patterns being applied because the graph is already in a
legal state.
Adding back the ops to RefineTypes seems like the simplest solution to
this problem while the rest of the ops get relocated to use the new
dtype refinement pass.

I think that before we land this we should fix this so that the iterative lowering behaves as intended here (perhaps as easy as switching DecomposeComplexOps to use the greedy rewriter? and then perhaps move the final legality check into satisfiesBackendContract somehow?). That seems like an independently useful improvement as well as the right thing to do here.

I would generally bias pretty heavily against keeping the old path in RefineTypes alive unless absolutely necessary -- when we migrate an op, we should delete the old support. That guarantees monotonic progress towards the new system by making the new system load-bearing. If the old system still works for these ops, it is easy to add support for ops in the new system, but the e2e test is still ends up using the old system for some reason and so weren't even testing the new code -- thus we accumulate buggy, untested code in the new system.

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/library_generator.py

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/registry.py

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/testing_framework.py

lib/Dialect/Torch/Transforms/Passes.cpp

lib/Dialect/Torch/Transforms/ReifyAbstractInterpCalculationsUtils.h

lib/Dialect/Torch/Transforms/ReifyDtypeCalculations.cpp

lib/Dialect/Torch/Transforms/SimplifyShapeCalculations.cpp

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/abstract_interp_lib_gen.py

This commit generalizes the shape library logic, so that dtype rules for ops can also be expressed using the same mechanism. In other words, each op can now have a shape function and a dtype function specified in Python that is imported during lowering to calculate the shapes and dtypes throught a program. For more information about how to specify a dtype function, see the updated `docs/adding_a_shape_and_dtype_function.md`. For those not familiar with how the shape library works, the file `docs/calculations_lib.md` provides an overview.

ramiro050 · 2022-12-13T00:05:29Z

I've replaced two of the ops I was using (AtenAddScalarOp and AtenAddTensorOp) for e2e testing this patch to ops that are less commonly encountered in large workloads: 634dd9a 11ec64d. The new ops maintain the same level of test coverage that the previous ops were doing.

While running decomposeComplexOps multiple times did fix the catch-22 I was encountering before, I would've still required increasing the number of iterations run in lowerToBackendContractPass to get tests like Resnet18 and mobilenet to pass, since they use a lot of adds, noticeably affecting the time it took torch-mlir to lower to a particular backend.

The best approach for moving those common ops into the abstract_lib_gen.py file is to do a good chunk of them in a single go, which should be straight forward.

silvasean

Nice :) Let's do this!

…1594) * [custom op] Generalize shape library logic to work with dtypes This commit generalizes the shape library logic, so that dtype rules for ops can also be expressed using the same mechanism. In other words, each op can now have a shape function and a dtype function specified in Python that is imported during lowering to calculate the shapes and dtypes throught a program. For more information about how to specify a dtype function, see the updated `docs/adding_a_shape_and_dtype_function.md`. For those not familiar with how the shape library works, the file `docs/calculations_lib.md` provides an overview.

ramiro050 requested a review from silvasean November 16, 2022 03:06

ramiro050 commented Nov 16, 2022

View reviewed changes

.github/workflows/RollPyTorch.yml Outdated Show resolved Hide resolved

silvasean requested changes Nov 16, 2022

View reviewed changes

ramiro050 force-pushed the custom-op-dtypes branch from cd6fd6f to 1a31c0b Compare November 20, 2022 22:11

ramiro050 requested a review from silvasean November 20, 2022 22:15

silvasean reviewed Nov 21, 2022

View reviewed changes

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/library_generator.py Show resolved Hide resolved

silvasean requested changes Nov 21, 2022

View reviewed changes

ramiro050 force-pushed the custom-op-dtypes branch from 1a31c0b to de0bc05 Compare December 12, 2022 23:55

ramiro050 added 10 commits December 12, 2022 23:57

Remove usage of ...shape/dtype... contraction in docs

e54df07

Fix build errors after LLVM bump

b9f941d

Replace AtenAddScalarOp with AtenRsubScalarOp

634dd9a

Replace AtenAddTensorOp with AtenFloorDivideOp

11ec64d

Update abstract interp lib with new shape functions

28ff67a

Remove template from createRefinementPipeline

470838c

Assert that input to isTensorTypeOrWrappedTensorType is not tuple

35f08c0

Add NonZeroDTensorWithDtype and ZeroDTensorWithDtype helpers

02a11c4

Add simple explanations/comments + basic renamings

89c6332

ramiro050 force-pushed the custom-op-dtypes branch from de0bc05 to 89c6332 Compare December 12, 2022 23:57

ramiro050 requested a review from silvasean December 13, 2022 00:05

silvasean approved these changes Dec 13, 2022

View reviewed changes

ramiro050 merged commit a710237 into llvm:main Dec 13, 2022

ramiro050 deleted the custom-op-dtypes branch December 13, 2022 16:25

ramiro050 mentioned this pull request Dec 14, 2022

[TORCH] Add Complex Number support #1673

Merged

li-plus mentioned this pull request Dec 22, 2022

tanh + select.int causing compiler core dumped #1748

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[custom op] Generalize shape library logic to work with dtypes #1594

[custom op] Generalize shape library logic to work with dtypes #1594

ramiro050 commented Nov 16, 2022

silvasean left a comment

ramiro050 commented Nov 20, 2022

silvasean commented Nov 21, 2022

ramiro050 commented Dec 13, 2022

silvasean left a comment

[custom op] Generalize shape library logic to work with dtypes #1594

[custom op] Generalize shape library logic to work with dtypes #1594

Conversation

ramiro050 commented Nov 16, 2022

silvasean left a comment

Choose a reason for hiding this comment

ramiro050 commented Nov 20, 2022

silvasean commented Nov 21, 2022

ramiro050 commented Dec 13, 2022

silvasean left a comment

Choose a reason for hiding this comment