Skip to content

Commit

Permalink
Merge pull request #32 from dan-zheng/cudnn-conv2d
Browse files Browse the repository at this point in the history
 Implement cuDNN conv2D grad.
  • Loading branch information
feiwang3311 authored Oct 19, 2018
2 parents ceab5ee + 535c1bb commit 663632d
Show file tree
Hide file tree
Showing 3 changed files with 397 additions and 127 deletions.
4 changes: 4 additions & 0 deletions src/main/scala/lantern/DslAPI.scala
Original file line number Diff line number Diff line change
Expand Up @@ -497,6 +497,10 @@ trait DslGenCublas extends DslGenBase {
| data[index] = value;
|}
|
|__global__ void arrayFill(float *data, float value) {
| int tid = threadIdx.x + blockIdx.x * blockDim.x;
| data[tid] = value;
|}
|
|// From: https://github.com/pytorch/pytorch/blob/master/aten/src/THC/THCIntegerDivider.cuh
|// Result of div/mod operation stored together.
Expand Down
Loading

0 comments on commit 663632d

Please sign in to comment.