feature/trt engine op test #11182

Superjomn · 2018-06-05T05:41:07Z

pass a BlockDesc to tensorrt_engine_op
execute the whole phase

NEXT STEP:

write a tool to execute a larger model and output the benchmark.

luotao1 · 2018-06-06T02:54:04Z

paddle/fluid/operators/tensorrt_engine_op.h

@@ -14,7 +14,7 @@

 #pragma once

-#ifdef PADDLE_WITH_CUDA
+#if PADDLE_WITH_CUDA


这里是ifdef

luotao1 · 2018-06-06T02:55:23Z

paddle/fluid/operators/tensorrt_engine_op_test.cc

+#include "paddle/fluid/inference/tensorrt/convert/op_converter.h"
+#include "paddle/fluid/inference/tensorrt/convert/ut_helper.h"
+
+USE_CPU_ONLY_OP(tensorrt_engine);


请问为什么tensorrt_engine_op是CPU ONLY呢？不应该是GPU ONLY么

tensorrt engine op 的kernel逻辑是在 cpu里跑，会触发 gpu

luotao1 · 2018-06-06T02:57:22Z

paddle/fluid/inference/tensorrt/convert/op_converter.h

@@ -34,12 +35,15 @@ class OpConverter {

  // Converter logic for an op.
  virtual void operator()(const framework::proto::OpDesc& op,
-                          const framework::Scope& scope) {}
+                          const framework::Scope& scope,
+                          bool test_mode = false) {}

  // Convert a single fluid operaotr and add the corresponding layer to TRT.


operaotr-》operator

luotao1 · 2018-06-06T03:00:54Z

paddle/fluid/inference/tensorrt/convert/mul_op.cc

@@ -37,12 +36,18 @@ class MulOpConverter : public OpConverter {
        engine_, MatrixMultiply, *const_cast<nvinfer1::ITensor*>(input1), false,
        *const_cast<nvinfer1::ITensor*>(input2), false);

-    engine_->DeclareOutput(layer, 0, op_desc.Output("Out")[0]);
+    auto output_name = op_desc.Output("Out")[0];
+    engine_->SetITensor(output_name, layer->getOutput(0));


为什么单侧（test_mode）的时候不知道输出名字是什么呢？这个能否在之后的PR中去掉呢？

如果是单侧的时候要用，可以改成unittest_mode? 不然会认为是测试阶段，而TRT只有前向过程，会造成困扰。

这里避免硬编码

luotao1

LGTM

…/trt_engine_op_test

Superjomn added 3 commits June 4, 2018 16:21

upadte

2a1dc3b

update

dd48511

fix code style

fa169f3

Superjomn requested review from luotao1 and panyx0718 June 5, 2018 05:41

Superjomn added 5 commits June 5, 2018 14:30

add test

88c6f31

fix compile

426ee28

fix code

cd19b5e

fix

d50c05b

fix

7fe7622

luotao1 reviewed Jun 6, 2018

View reviewed changes

update

c812258

luotao1 approved these changes Jun 6, 2018

View reviewed changes

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into feature…

55563a4

…/trt_engine_op_test

Superjomn merged commit 4f95bc9 into PaddlePaddle:develop Jun 6, 2018

Superjomn deleted the feature/trt_engine_op_test branch June 6, 2018 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature/trt engine op test #11182

feature/trt engine op test #11182

Superjomn commented Jun 5, 2018 •

edited

Loading

luotao1 Jun 6, 2018

luotao1 Jun 6, 2018

Superjomn Jun 6, 2018

luotao1 Jun 6, 2018

luotao1 Jun 6, 2018

Superjomn Jun 6, 2018

luotao1 left a comment

feature/trt engine op test #11182

feature/trt engine op test #11182

Conversation

Superjomn commented Jun 5, 2018 • edited Loading

luotao1 Jun 6, 2018

Choose a reason for hiding this comment

luotao1 Jun 6, 2018

Choose a reason for hiding this comment

Superjomn Jun 6, 2018

Choose a reason for hiding this comment

luotao1 Jun 6, 2018

Choose a reason for hiding this comment

luotao1 Jun 6, 2018

Choose a reason for hiding this comment

Superjomn Jun 6, 2018

Choose a reason for hiding this comment

luotao1 left a comment

Choose a reason for hiding this comment

Superjomn commented Jun 5, 2018 •

edited

Loading