[NNAPI QDQ] Add QDQReshape op support #10533

YUNQIUGUO · 2022-02-12T02:26:22Z

Description: Describe your changes.

Add QDQReshape op support and basic test case

Motivation and Context

More operator support

guoyu-wang · 2022-02-13T07:26:34Z

onnxruntime/core/providers/nnapi/nnapi_builtin/builders/op_builder.cc

+  if (IsQuantizedOp(node_unit)) {
+    AddQuantizationScaleAndZeroPointToSkip(model_builder, *node_unit.Inputs()[0].quant_param);   // x_scale, x_zp
+    AddQuantizationScaleAndZeroPointToSkip(model_builder, *node_unit.Outputs()[0].quant_param);  // y_scale, y_zp
+  } else {


The perm initializer needs always to be ignored here

guoyu-wang · 2022-02-13T07:30:48Z

onnxruntime/core/providers/nnapi/nnapi_builtin/builders/op_builder.cc

+        initializers, node_unit.Inputs()[0], node_unit.ModelPath(), x_scale, x_zero_point));
+    ORT_RETURN_IF_ERROR(IsValidInputQuantizedType(model_builder, input, x_scale, x_zero_point));
+  }
+
  return AddReshapeOperator(model_builder, node_unit, input, shape);


You need to pass the x_scale and x_zero_point into AddReshapeOperator, otherwise those will be set to 0

guoyu-wang · 2022-02-13T07:32:57Z

onnxruntime/core/providers/nnapi/nnapi_builtin/builders/op_support_checker.cc

+  if (!GetType(node_unit.Inputs()[0].node_arg, input_type))
+    return false;
+
+  if (input_type != ONNX_NAMESPACE::TensorProto_DataType_FLOAT &&


These are not required here, this is only for these operators

onnxruntime/onnxruntime/core/providers/nnapi/nnapi_builtin/builders/helper.cc

Lines 286 to 293 in bfb20b3

static const std::unordered_set<std::string> internal_quantized_op_types =

{

"Transpose",

"Resize",

"Concat",

"MaxPool",

};

edgchen1 · 2022-02-14T18:42:00Z

onnxruntime/core/providers/nnapi/nnapi_builtin/builders/op_builder.cc

@@ -867,10 +867,20 @@ class ReshapeOpBuilder : public BaseOpBuilder {
  Status AddToModelBuilderImpl(ModelBuilder& model_builder, const NodeUnit& node_unit) const override;
  static bool CanSkipReshape(const ModelBuilder& model_builder, const NodeUnit& node_unit,
                             size_t input_rank, size_t output_rank);
+  static bool IsQuantizedOp(const NodeUnit& node_unit) ORT_MUST_USE_RESULT;  // TODO, see if we want to move this to BaseOpBuilder


see if we want to move this to BaseOpBuilder

doesn't have to be in this PR, but when will the decision be made?

yes, @gwang-msft do you think we should move it to baseopbuilder now or keep it here at individual opbuilder level as we don't have that many qdq ops supported?

We can move it as a virtual function for BaseOpBuilder and by default return false, each individual builder will override this if necessary
Same for BaseOpSupportChecker

guoyu-wang · 2022-02-14T23:55:26Z

onnxruntime/core/providers/nnapi/nnapi_builtin/builders/op_builder.cc

@@ -971,11 +982,11 @@ void ReshapeOpBuilder::AddInitializersToSkip(ModelBuilder& model_builder, const
    // Add new shape
    Shape shape_dimen = {static_cast<uint32_t>(shape.size())};
    std::string shape_name = model_builder.GetUniqueName(node_unit.Name() + input + "newshape");
-    OperandType shape_operand_type(Type::TENSOR_INT32, shape_dimen);
+    OperandType shape_operand_type(Type::TENSOR_INT32, shape_dimen, scale, zero_point);


This is the new shape of the reshape op, it does not need scale and zero_point

guoyu-wang · 2022-02-15T01:20:09Z

onnxruntime/core/providers/nnapi/nnapi_builtin/builders/op_builder.cc

@@ -947,7 +957,8 @@ void ReshapeOpBuilder::AddInitializersToSkip(ModelBuilder& model_builder, const
 /* static */ Status ReshapeOpBuilder::AddReshapeOperator(ModelBuilder& model_builder,
                                                         const NodeUnit& node_unit,
                                                         const std::string& input,
-                                                         const std::vector<int32_t>& shape) {
+                                                         const std::vector<int32_t>& shape,
+                                                         float scale, int32_t zero_point) {


It will be easier to get the scale and zero_point from the input inside this function, instead of passing them in, use something like this

+ // For reshape, the output type should be the same as the input type except the shape is different + auto output_operand_type = operand_types.at(input); + output_operand_type.SetDimensions(shaper[output]); + // Since Reshape is not running using hardware in NNAPI for some CPU (e.g. Qualcomm SD for now) // We will try to see if we the skip the Reshape to prevent context switching between // NNAPI CPU impl and NNAPI hardware accelerator impl if (CanSkipReshape(model_builder, node_unit, input_rank, output_rank)) { // Since reshape can be skipped, only register the dimension and type, with same index and new name - const OperandType output_operand_type(operand_types.at(input).type, shaper[output], scale, zero_point); model_builder.RegisterOperand(output, operand_indices.at(input), output_operand_type, false); } else { // We still need to perform a reshape here // Add new shape Shape shape_dimen = {static_cast<uint32_t>(shape.size())}; std::string shape_name = model_builder.GetUniqueName(node_unit.Name() + input + "newshape"); - OperandType shape_operand_type(Type::TENSOR_INT32, shape_dimen, scale, zero_point); + OperandType shape_operand_type(Type::TENSOR_INT32, shape_dimen); ORT_RETURN_IF_ERROR(model_builder.AddOperandFromPersistMemoryBuffer(shape_name, shape.data(), shape_operand_type)); input_indices.push_back(operand_indices.at(shape_name)); - - const OperandType output_operand_type(operand_types.at(input).type, shaper[output], scale, zero_point); ORT_RETURN_IF_ERROR(model_builder.AddOperation(ANEURALNETWORKS_RESHAPE, input_indices, {output}, {output_operand_type}, {false})); }

rachguo added 3 commits February 11, 2022 17:41

wip

d06928e

wip

724e305

save

47ad71a

YUNQIUGUO requested review from guoyu-wang, edgchen1 and skottmckay February 12, 2022 02:26

vvchernov mentioned this pull request Feb 12, 2022

[TVM EP] Rename Standalone TVM (STVM) Execution Provider to TVM EP #10260

Merged

guoyu-wang reviewed Feb 13, 2022

View reviewed changes

edgchen1 reviewed Feb 14, 2022

View reviewed changes

rachguo added 3 commits February 14, 2022 12:01

address partial pr comments

c245cb5

update

7107029

minor change

7b9ae8a

guoyu-wang reviewed Feb 14, 2022

View reviewed changes

rachguo added 5 commits February 14, 2022 16:01

move isquantizedop to baseopbuilderorchecker

dd772fb

update

df51aa9

format

cfe4fc5

update

a778b55

update

ea24eba

guoyu-wang reviewed Feb 15, 2022

View reviewed changes

rachguo added 2 commits February 14, 2022 20:40

address pr comments

3fad98b

update

2a45a2c

guoyu-wang approved these changes Feb 15, 2022

View reviewed changes

YUNQIUGUO merged commit 8e47bb9 into master Feb 15, 2022

YUNQIUGUO deleted the yguo/nnapi-qdq-reshape-support-pr branch February 15, 2022 20:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NNAPI QDQ] Add QDQReshape op support #10533

[NNAPI QDQ] Add QDQReshape op support #10533

YUNQIUGUO commented Feb 12, 2022

guoyu-wang Feb 13, 2022

guoyu-wang Feb 13, 2022

guoyu-wang Feb 13, 2022

edgchen1 Feb 14, 2022

YUNQIUGUO Feb 14, 2022

guoyu-wang Feb 14, 2022

guoyu-wang Feb 14, 2022

guoyu-wang Feb 15, 2022

	static const std::unordered_set<std::string> internal_quantized_op_types =
	{
	"Transpose",
	"Resize",
	"Concat",
	"MaxPool",
	};

[NNAPI QDQ] Add QDQReshape op support #10533

[NNAPI QDQ] Add QDQReshape op support #10533

Conversation

YUNQIUGUO commented Feb 12, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment