Add dtype functions for floating point ops #1813

li-plus · 2023-01-19T05:36:06Z

This PR is part of #1807. It added dtype functions for floating point ops that always return a tensor of dtype float32, except for input dtype float64, float16, or bfloat16.

@ramiro050 would you review this PR?

Update: the CI is crashing at RandnDtypeDeviceModule_basic in torchdynamo config, probably caused by the latest PyTorch as mentioned in #1792. I guess we should rebase dtype-functions-staging to the main branch to make the CI pass.

Update: rebasing on master still doesn't work. I have temporarily disabled this test in torchdynamo by --crashing_tests_to_not_attempt_to_run_and_a_bug_is_filed RandnDtypeDeviceModule_basic. This test case will be decomposed into a complex series of ops as shown below. Dtype functions for some ops lie in RefineTypes while others are in abstract_interp_lib. They are interleaving and failing the compilation. When the migration is finished, I will re-enable it.

module attributes {torch.debug_module_name = "_lambda"} {
  func.func @forward() -> !torch.vtensor<[],f64> {
    %false = torch.constant.bool false
    %none = torch.constant.none
    %true = torch.constant.bool true
    %int2097152 = torch.constant.int 2097152
    %int4 = torch.constant.int 4
    %int512 = torch.constant.int 512
    %int1024 = torch.constant.int 1024
    %float1.000000e00 = torch.constant.float 1.000000e+00
    %int2097151 = torch.constant.int 2097151
    %int1 = torch.constant.int 1
    %float0.000000e00 = torch.constant.float 0.000000e+00
    %float-2.000000e00 = torch.constant.float -2.000000e+00
    %float6.283180e00 = torch.constant.float 6.283180e+00
    %int2 = torch.constant.int 2
    %int0 = torch.constant.int 0
    %cpu = torch.constant.device "cpu"
    %0 = torch.prim.ListConstruct %int4, %int512, %int1024 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int>
    %1 = torch.aten.empty.memory_format %0, %none, %none, %cpu, %none, %none : !torch.list<int>, !torch.none, !torch.none, !torch.Device, !torch.none, !torch.none -> !torch.vtensor<[4,512,1024],f64>
    %2 = torch.aten.empty.memory_format %0, %none, %none, %cpu, %none, %none : !torch.list<int>, !torch.none, !torch.none, !torch.Device, !torch.none, !torch.none -> !torch.vtensor<[4,512,1024],f64>
    %3 = torch.aten.uniform %1, %float0.000000e00, %float1.000000e00, %none : !torch.vtensor<[4,512,1024],f64>, !torch.float, !torch.float, !torch.none -> !torch.vtensor<[4,512,1024],f64>
    %4 = torch.aten.uniform %2, %float0.000000e00, %float1.000000e00, %none : !torch.vtensor<[4,512,1024],f64>, !torch.float, !torch.float, !torch.none -> !torch.vtensor<[4,512,1024],f64>
    %5 = torch.aten.log %3 : !torch.vtensor<[4,512,1024],f64> -> !torch.vtensor<[4,512,1024],f64>
    %6 = torch.aten.mul.Scalar %5, %float-2.000000e00 : !torch.vtensor<[4,512,1024],f64>, !torch.float -> !torch.vtensor<[4,512,1024],f64>
    %7 = torch.aten.sqrt %6 : !torch.vtensor<[4,512,1024],f64> -> !torch.vtensor<[4,512,1024],f64>
    %8 = torch.aten.mul.Scalar %4, %float6.283180e00 : !torch.vtensor<[4,512,1024],f64>, !torch.float -> !torch.vtensor<[4,512,1024],f64>
    %9 = torch.aten.cos %8 : !torch.vtensor<[4,512,1024],f64> -> !torch.vtensor<[4,512,1024],f64>
    %10 = torch.aten.mul.Tensor %7, %9 : !torch.vtensor<[4,512,1024],f64>, !torch.vtensor<[4,512,1024],f64> -> !torch.vtensor<[4,512,1024],f64>
    %11 = torch.prim.ListConstruct %int0, %int1, %int2 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int>
    %12 = torch.aten.sum.dim_IntList %10, %11, %true, %none : !torch.vtensor<[4,512,1024],f64>, !torch.list<int>, !torch.bool, !torch.none -> !torch.vtensor<[1,1,1],f64>
    %13 = torch.aten.div.Scalar %12, %int2097152 : !torch.vtensor<[1,1,1],f64>, !torch.int -> !torch.vtensor<[1,1,1],f64>
    %14 = torch.aten.sub.Tensor %10, %13, %float1.000000e00 : !torch.vtensor<[4,512,1024],f64>, !torch.vtensor<[1,1,1],f64>, !torch.float -> !torch.vtensor<[4,512,1024],f64>
    %15 = torch.aten.mul.Tensor %14, %14 : !torch.vtensor<[4,512,1024],f64>, !torch.vtensor<[4,512,1024],f64> -> !torch.vtensor<[4,512,1024],f64>
    %16 = torch.aten.sum.dim_IntList %15, %11, %false, %none : !torch.vtensor<[4,512,1024],f64>, !torch.list<int>, !torch.bool, !torch.none -> !torch.vtensor<[],f64>
    torch.runtime.assert %true, "correction value should be less than or equal to productDimSize + 1"
    %17 = torch.aten.div.Scalar %16, %int2097151 : !torch.vtensor<[],f64>, !torch.int -> !torch.vtensor<[],f64>
    %18 = torch.aten.sqrt %17 : !torch.vtensor<[],f64> -> !torch.vtensor<[],f64>
    return %18 : !torch.vtensor<[],f64>
  }
}

ramiro050

Thanks! LGTM

ramiro050 · 2023-01-20T18:37:09Z

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/abstract_interp_lib_gen.py

+        Invocation(ZeroDTensorWithDtype(torch.bool), *args),
+    ]
+
+def _get_invocations_for_fp_only_op_with_tensor_arg_followed_by(*args):


I will be making a PR soon that improves the testing helper functions quite a bit, so if you're working on another set of ops, I would wait until that lands

Awesome! I haven't started working on the next task. Will wait for your PR before further development.

ramiro050 · 2023-01-20T18:37:20Z

build_tools/python_deploy/build_linux_packages.sh

@@ -277,7 +277,7 @@ function test_in_tree() {
  python -m e2e_testing.main --config=lazy_tensor_core -v

  echo ":::: Run TorchDynamo e2e integration tests"
-  python -m e2e_testing.main --config=torchdynamo -v
+  python -m e2e_testing.main --config=torchdynamo -v --crashing_tests_to_not_attempt_to_run_and_a_bug_is_filed RandnDtypeDeviceModule_basic


This is fine, thanks!

commit bafe339 Author: Ramiro Leal-Cavazos <[email protected]> Date: Mon May 8 21:26:56 2023 +0000 Add dtype functions for aten.atan and prims.squeeze commit bebf695 Author: Ramiro Leal-Cavazos <[email protected]> Date: Mon May 8 21:26:10 2023 +0000 Remove duplicate code from merge with main commit 0d11895 Author: Ramiro Leal-Cavazos <[email protected]> Date: Fri May 5 21:39:02 2023 +0000 Update LLVM tag commit 73d5c07 Merge: 899d8bc eaaaeb6 Author: Ramiro Leal-Cavazos <[email protected]> Date: Fri May 5 21:30:09 2023 +0000 Merge remote-tracking branch 'upstream/main' into merge-main commit 899d8bc Author: Ramiro Leal-Cavazos <[email protected]> Date: Mon Mar 13 21:39:14 2023 +0000 Add dtype functions for `aten.ge.Tensor` and `aten.le.Tensor` commit f58f9c2 Merge: ce7abf4 4912c39 Author: Ramiro Leal-Cavazos <[email protected]> Date: Mon Mar 13 21:32:00 2023 +0000 Merge branch 'main' into merge-main commit ce7abf4 Author: Jiahao Li <[email protected]> Date: Wed Feb 22 06:54:41 2023 +0800 Add dtype functions for ops that take dtype from 2nd operand (llvm#1891) commit 63945a2 Author: Ramiro Leal-Cavazos <[email protected]> Date: Mon Feb 13 17:56:09 2023 -0800 Change dtype functions interface to take ints tuple for each tensor (llvm#1865) The original design for the dtype functions outlined in llvm#1462 was unable to properly handle ops that take optional tensors as an input when the optional tensor has a value of None. By the time the op gets imported into torch-mlir, if an optional value is None, all information about the original type is lost from the op type signature, preventing torch-mlir from knowing if a value of None was from an optional tensor or not, which was crucial in the original design since each tensor argument must be turned into two separate arguments for the dtype function. This commit changes the interface to dtype functions such that each tensor turns into a tuple of two ints, the first representing the rank of the tensor and the second the dtype of the tensor. Since now there is a one-to-one correspondence between the operands of an op and the operands of its dtype function, there is no ambiguity about which operand of the op corresponds with which operand of the dtype function. To test the implementation, this commit defines dtype functions for the convolution ops, all of which take one optional tensor as an argument. commit 981ac88 Author: Ramiro Leal-Cavazos <[email protected]> Date: Wed Feb 1 22:30:27 2023 +0000 Add dtype functions for two tensor promotion ops (llvm#1831) This commit adds dtype functions for ops in RefineTypes under the category of "Promote the two dtypes". The only ops not added here are convolution ops, since they take an optional tensor argument, and the dtype pipeline currently does not correctly handle that case. I will add a follow up patch fixing this. This commit also adds two helper functions that perform a very thorough testing of dtype functions. The helper function `_check_two_tensor_op` is able to independently test invalid input dtypes and invalid output dtypes. Lastly, this commit also XFAILs "MobilenetV3Module_basic". commit 83d4e89 Author: Jiahao Li <[email protected]> Date: Sat Jan 21 02:39:41 2023 +0800 Add dtype functions for floating point ops (llvm#1813) commit 8cae5ba Author: Ramiro Leal-Cavazos <[email protected]> Date: Mon Jan 16 14:32:23 2023 -0800 Add dtype functions for comparison ops (llvm#1806) This commit adds dtype functions for comparison ops that always return a tensor of dtype `i1`. commit 5b77c15 Author: Ramiro Leal-Cavazos <[email protected]> Date: Mon Jan 16 20:27:49 2023 +0000 Add CI to `dtype-functions-staging` branch commit ac94ba2 Author: Ramiro Leal-Cavazos <[email protected]> Date: Thu Jan 12 22:41:04 2023 +0000 Move dtype functions into their own section in lib gen file In order to easily keep track of the dtype functions that have been moved to `abstract_interp_lib_gen.py` and make it easier to add new ones, this commit groups all the dtype functions together, rather than having them interspersed between the shape functions.

ramiro050 self-requested a review January 19, 2023 19:29

li-plus force-pushed the dtype-float32 branch from ecf5486 to be7ad84 Compare January 20, 2023 09:26

Add dtype functions for floating point ops

1926ff8

li-plus force-pushed the dtype-float32 branch from be7ad84 to 1926ff8 Compare January 20, 2023 11:20

ramiro050 approved these changes Jan 20, 2023

View reviewed changes

ramiro050 mentioned this pull request Jan 20, 2023

Push for finishing transition to Python dtype functions #1807

Closed

23 tasks

ramiro050 merged commit 83d4e89 into llvm:dtype-functions-staging Jan 20, 2023

li-plus deleted the dtype-float32 branch January 21, 2023 01:55

ramiro050 pushed a commit to ramiro050/torch-mlir that referenced this pull request Jan 27, 2023

Add dtype functions for floating point ops (llvm#1813)

8b8d071

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dtype functions for floating point ops #1813

Add dtype functions for floating point ops #1813

li-plus commented Jan 19, 2023 •

edited

Loading

ramiro050 left a comment

ramiro050 Jan 20, 2023

li-plus Jan 21, 2023

ramiro050 Jan 20, 2023

Add dtype functions for floating point ops #1813

Add dtype functions for floating point ops #1813

Conversation

li-plus commented Jan 19, 2023 • edited Loading

ramiro050 left a comment

Choose a reason for hiding this comment

ramiro050 Jan 20, 2023

Choose a reason for hiding this comment

li-plus Jan 21, 2023

Choose a reason for hiding this comment

ramiro050 Jan 20, 2023

Choose a reason for hiding this comment

li-plus commented Jan 19, 2023 •

edited

Loading