Rewrite dtype functions for softmax ops in Python #1987

gpetters94 · 2023-03-29T23:25:30Z

No description provided.

gpetters94 · 2023-03-29T23:26:08Z

@ramiro050 these are all one-liners, do you think they require tests?

ramiro050 · 2023-03-30T20:47:35Z

@ramiro050 these are all one-liners, do you think they require tests?

Yes, every dtype function should be tested. What's nice is that the testing code is also a one-liner (two-liner tops) for these ops 🙂

See below for examples of testing ops with one tensor inputs and two tensor inputs:

torch-mlir/python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/abstract_interp_lib_gen.py

Lines 1305 to 1312 in d3a49fd

    
           @check_dtype_function(_check_tensors_with_the_same_dtype(num_of_tensors=1, other=0.0) + 
        
                                 _check_tensors_with_the_same_dtype(num_of_tensors=1, other=0)) 
        
           def aten〇eq〇Scalar〡dtype(self_rank_dtype: Tuple[int, int], other: Union[int, float]) -> int: 
        
               return torch.bool 
        
           @check_dtype_function(_check_two_tensor_op()) 
        
           def aten〇eq〇Tensor〡dtype(self_rank_dtype: Tuple[int, int], other_rank_dtype: Tuple[int, int]) -> int: 
        
               return torch.bool

ramiro050 · 2023-03-30T20:48:35Z

The CI is failing on the refine-types lit tests. Since we are slowly removing logic from RefineTypes, you can just delete the failing MLIR test

gpetters94 · 2023-04-05T16:25:36Z

@ramiro050 It's failing only on dynamo, is this a known issue? Here's the IR before failure:

func.func @forward(%arg0: !torch.vtensor<[3,2,4],f32>, %arg1: !torch.vtensor<[3,2,4],f32>) -> !torch.vtensor<[3,2,4],unk> {
  %int1 = torch.constant.int 1
  %none = torch.constant.none
  %int6 = torch.constant.int 6
  %0 = torch.aten.clone %arg0, %none : !torch.vtensor<[3,2,4],f32>, !torch.none -> !torch.vtensor<[3,2,4],f32>
  %1 = torch.aten.clone %arg1, %none : !torch.vtensor<[3,2,4],f32>, !torch.none -> !torch.vtensor<[3,2,4],f32>
  %2 = torch.aten._softmax_backward_data %0, %1, %int1, %int6 : !torch.vtensor<[3,2,4],f32>, !torch.vtensor<[3,2,4],f32>, !torch.int, !torch.int -> !torch.vtensor<[3,2,4],unk>
  return %2 : !torch.vtensor<[3,2,4],unk>
}

ramiro050 · 2023-04-06T14:39:18Z

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/abstract_interp_lib_gen.py

@@ -2232,6 +2232,36 @@ def aten〇native_batch_norm〡dtype(input_rank_dtype: Tuple[int, int], weight_r
    assert is_float_dtype(input_dtype)
    return input_dtype, input_dtype, input_dtype

+@check_dtype_function([
+    Invocation(TensorOfShape(2, 3, 4, dtype=torch.float32), dim=0, half_to_float=False)])


Can you use the helper functions for testing? These create invocations that make very thorough checks of the dtype functions. Same comment applies to the rest of the functions. See:

torch-mlir/python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/abstract_interp_lib_gen.py

Lines 1305 to 1312 in d3a49fd

@check_dtype_function(_check_tensors_with_the_same_dtype(num_of_tensors=1, other=0.0) +

_check_tensors_with_the_same_dtype(num_of_tensors=1, other=0))

def aten〇eq〇Scalar〡dtype(self_rank_dtype: Tuple[int, int], other: Union[int, float]) -> int:

return torch.bool

@check_dtype_function(_check_two_tensor_op())

def aten〇eq〇Tensor〡dtype(self_rank_dtype: Tuple[int, int], other_rank_dtype: Tuple[int, int]) -> int:

return torch.bool

ramiro050 · 2023-04-06T14:41:32Z

lib/Dialect/Torch/Transforms/RefineTypes.cpp

@@ -619,19 +614,17 @@ void TypeAnalysis::visitOperation(Operation *op,
          AtenBitwiseNotOp, AtenToPrimDeviceOp, AtenCpuOp, AtenContiguousOp,
          AtenDetachOp, AtenMaskedFill_ScalarOp, AtenCopyOp, AtenCumsumOp,
          AtenLayerNormOp, AtenClampOp, AtenClampMinOp, AtenClampMaxOp,
-          AtenNegOp, AtenFloorOp, Aten_SoftmaxBackwardDataOp, AtenDropoutOp,
-          AtenTanhBackwardOp, AtenHardtanhBackwardOp,


There is already a patch that will handle the ops here: #1895. Can you undo these changes to avoid conflicts?

ramiro050 · 2023-04-06T14:47:54Z

python/torch_mlir/dialects/torch/importer/jit_ir/build_tools/abstract_interp_lib_gen.py

+@check_dtype_function([
+    Invocation(TensorOfShape(2, 3, 4, dtype=torch.float32), TensorOfShape(2, 3, 4, dtype=torch.float32), dim=0, input_dtype=torch.float32)])
+def aten〇_softmax_backward_data〡dtype(grad_output_rank_dtype: Tuple[int, int], output_rank_dtype: Tuple[int, int], dim: int, input_dtype: int) -> int:
+    return grad_output_rank_dtype[1]


I think the reason the test is failing is because there is currently no support for indexing tuples in dtype functions. Can you change this to grad_output_rank, grad_output_dtype = grad_output_rank_dtype like the other functions here to see if it fixes it?

ramiro050 · 2023-04-27T23:07:59Z

Hey George, any updates on the dtype functions?

gpetters94 requested a review from ramiro050 March 29, 2023 23:25

Rewrite dtype functions for softmax ops in Python

54d7f28

gpetters94 force-pushed the dtype-softmax branch from fd8d8af to 54d7f28 Compare March 31, 2023 18:07

ramiro050 requested changes Apr 6, 2023

View reviewed changes

ramiro050 mentioned this pull request Apr 27, 2023

Push for finishing transition to Python dtype functions #1807

Closed

23 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite dtype functions for softmax ops in Python #1987

Rewrite dtype functions for softmax ops in Python #1987

gpetters94 commented Mar 29, 2023

gpetters94 commented Mar 29, 2023 •

edited

Loading

ramiro050 commented Mar 30, 2023

ramiro050 commented Mar 30, 2023

gpetters94 commented Apr 5, 2023

ramiro050 Apr 6, 2023

ramiro050 Apr 6, 2023

ramiro050 Apr 6, 2023

ramiro050 commented Apr 27, 2023

	@check_dtype_function(_check_tensors_with_the_same_dtype(num_of_tensors=1, other=0.0) +
	_check_tensors_with_the_same_dtype(num_of_tensors=1, other=0))
	def aten〇eq〇Scalar〡dtype(self_rank_dtype: Tuple[int, int], other: Union[int, float]) -> int:
	return torch.bool

	@check_dtype_function(_check_two_tensor_op())
	def aten〇eq〇Tensor〡dtype(self_rank_dtype: Tuple[int, int], other_rank_dtype: Tuple[int, int]) -> int:
	return torch.bool

Rewrite dtype functions for softmax ops in Python #1987

Are you sure you want to change the base?

Rewrite dtype functions for softmax ops in Python #1987

Conversation

gpetters94 commented Mar 29, 2023

gpetters94 commented Mar 29, 2023 • edited Loading

ramiro050 commented Mar 30, 2023

ramiro050 commented Mar 30, 2023

gpetters94 commented Apr 5, 2023

ramiro050 Apr 6, 2023

Choose a reason for hiding this comment

ramiro050 Apr 6, 2023

Choose a reason for hiding this comment

ramiro050 Apr 6, 2023

Choose a reason for hiding this comment

ramiro050 commented Apr 27, 2023

gpetters94 commented Mar 29, 2023 •

edited

Loading