Implement cuDNN activation functions. #33

dan-zheng · 2018-10-21T20:10:07Z

Add wrapper functions for cudnnActivationForward and
cudnnActivationBackward.
Implement relu/tanh/sigmoid (original and gradient functions) using cuDNN, add tests.

TODO:

Support activations for non-4D tensors. This will likely require shape
padding with "1" dimensions to work with the cuDNN API.

- Add wrapper functions for `cudnnActivationForward` and `cudnnActivationBackward`. - Move `relu` to be backend-defined. - Implement `relu` and `relu_grad` using cuDNN, add tests. - Minor: change out-of-bounds indexing via `Dimensions.apply` to clearly throw `IndexOutOfBoundsException`. TODO: - Support `relu` for non-4D tensors. This will likely require shape padding with "1" dimensions to work with the cuDNN API. - Implement other activation functions.

dan-zheng · 2018-10-21T20:12:52Z

src/main/scala/lantern/TensorDifferentiation.scala

+      }
+    }
+
+    override def tanh(x: Tensor) = x.map(s => Math.tanh(s).toFloat)


It'd be good to decouple these backend functions from Tensor methods (map and add_oneMinusSquare_mult).

yes, I agree. We are sort things out as we encounter them.

I accidentally copied tanh's gradient.

dan-zheng · 2018-10-21T21:17:32Z

src/main/scala/lantern/TensorDifferentiation.scala

+          "CUDNN_CALL(cudnnActivationBackward(\n" +
+          "    cudnnHandle, act_desc,\n" +
+          "    ", one, ", x_desc, ", res.x.data, ", x_desc, ", res.d.data, ", x_desc, ", input.x.data, ",\n",
+          "    ", zero, ", x_desc, ", inputGrad.data, "));\n" +


As suggested by @feiwang3311: rather than using beta = 0 and input.d += inputGrad, use beta = 1.
Same elsewhere (for conv2d, etc).
I'll take on this.

dan-zheng added 3 commits October 21, 2018 15:16

Support cuDNN tanh and sigmoid.

2697e74

Make randinit backend-defined, add tanh/sigmoid tests.

5779dfc

dan-zheng requested a review from feiwang3311 October 21, 2018 20:10

dan-zheng commented Oct 21, 2018

View reviewed changes

dan-zheng changed the title ~~Start cuDNN activation function support.~~ Implement cuDNN activation functions. Oct 21, 2018

dan-zheng added 2 commits October 21, 2018 16:23

[NFC] Update comments.

dec0d9a

Fix sigmoid gradient.

ab8f23e

I accidentally copied tanh's gradient.

feiwang3311 merged commit 90ca4ad into feiwang3311:master Oct 21, 2018

dan-zheng commented Oct 21, 2018

View reviewed changes

TiarkRompf mentioned this pull request Oct 22, 2018

Accelerated Backends (CPU & GPU) #8

Open

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement cuDNN activation functions. #33

Implement cuDNN activation functions. #33

dan-zheng commented Oct 21, 2018

dan-zheng Oct 21, 2018

feiwang3311 Oct 21, 2018

dan-zheng Oct 21, 2018 •

edited

Loading

Implement cuDNN activation functions. #33

Implement cuDNN activation functions. #33

Conversation

dan-zheng commented Oct 21, 2018

dan-zheng Oct 21, 2018

Choose a reason for hiding this comment

feiwang3311 Oct 21, 2018

Choose a reason for hiding this comment

dan-zheng Oct 21, 2018 • edited Loading

Choose a reason for hiding this comment

dan-zheng Oct 21, 2018 •

edited

Loading