[amd] convolution kernel didn't reuse the algorithm founded. #11203

dzhwinter · 2018-06-05T11:08:16Z

My PR fix the issue above https://github.com/dzhwinter/Paddle/tree/review_conv2d_1
The cudnn op is run on Cuda device, so its inputs/outputs must stay at Cuda device. In ROCm#16, it use CPU Tensor to store the algorithm selected, but our framework will automatically transform it into a temporary GPU Tensor. As a result, inside cudnn op, it can not get the real persistent Tensor.

If we allocated output and input in GPU, and copy the result to CPU, then we will get the correct result.

dzhwinter · 2018-06-05T11:09:30Z

The result

I0605 10:56:09.376890 34846 tensor_util.cu:22] TensorCopy 3 from CUDAPlace(0) to CPUPlace
I0605 10:56:09.376969 34846 conv_cudnn_op.cu.cc:151] Find Kernel:  load  0x7f0044659080 kernel :3

dzhwinter · 2018-06-21T09:44:25Z

在backward op里，是可以利用前向op的所有input, output的。需要定制一下GradOpMaker (Python端用来创建OpDesc).

class Conv2DGradMaker : public framework::SingleGradOpDescMaker {
 public:
  using framework::SingleGradOpDescMaker::SingleGradOpDescMaker;

 protected:
  std::unique_ptr<framework::OpDesc> Apply() const override {
    auto* op = new framework::OpDesc();
    op->SetType("conv2d_grad");
    op->SetInput("Input", Input("Input"));
    op->SetInput("Filter", Input("Filter"));
    op->SetInput("Algorithm", Input("Algorithm"));
    op->SetInput(framework::GradVarName("Output"), OutputGrad("Output"));

    op->SetAttrMap(Attrs());

    op->SetOutput("AlgorithmOut", Output("AlgorithmOut"));
    op->SetOutput(framework::GradVarName("Input"), InputGrad("Input"));
    op->SetOutput(framework::GradVarName("Filter"), InputGrad("Filter"));

    return std::unique_ptr<framework::OpDesc>(op);
  }
};

dzhwinter · 2018-06-21T09:45:07Z

code 在https://github.com/dzhwinter/Paddle/tree/review_conv2d_1

shanyi15 · 2018-08-15T11:01:42Z

您好，此issue在近一个月内暂无更新，我们将于今天内关闭。若在关闭后您仍需跟进提问，可重新开启此问题，我们将在24小时内回复您。因关闭带来的不便我们深表歉意，请您谅解~感谢您对PaddlePaddle的支持!
Hello, this issue has not been updated in the past month. We will close it today for the sake of other user‘s experience. If you still need to follow up on this question after closing, please feel free to reopen it. In that case, we will get back to you within 24 hours. We apologize for the inconvenience caused by the closure and thank you so much for your support of PaddlePaddle Group!

shanyi15 closed this as completed Aug 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[amd] convolution kernel didn't reuse the algorithm founded. #11203

[amd] convolution kernel didn't reuse the algorithm founded. #11203

dzhwinter commented Jun 5, 2018

dzhwinter commented Jun 5, 2018

dzhwinter commented Jun 21, 2018 •

edited

Loading

dzhwinter commented Jun 21, 2018

shanyi15 commented Aug 15, 2018

[amd] convolution kernel didn't reuse the algorithm founded. #11203

[amd] convolution kernel didn't reuse the algorithm founded. #11203

Comments

dzhwinter commented Jun 5, 2018

dzhwinter commented Jun 5, 2018

dzhwinter commented Jun 21, 2018 • edited Loading

dzhwinter commented Jun 21, 2018

shanyi15 commented Aug 15, 2018

dzhwinter commented Jun 21, 2018 •

edited

Loading