- cuda/convolution: workaround for 0 memory allocation (e30b59de)
- cudnnv4: passive support for cuDNNv4 (0dc46301)
- convolution: change convolution functions to require workspace (f9d40136
- pointwise: add pointwise activation functions (cuDNN) (d74821b5)
- log_softmax: add LogSoftmax operations (86a8ae67)
- cuda:
- nn_trait: remove trait bounds for NN (9ad08d9f)
- license: change license to dual MIT/Apache-2.0 (8a940690)
- convolution: implement convolutions correctly (24b164b5)
- convolution: don't do a memAlloc for a zero size workspace (73612bb5)
- native: Add support for softmax w/ test and benches. (14d6d1bc)
- native: Fix sigmoid_grad to use x_diff instead of x for dx (c25a32aa)
- bench: add bench and perf utilities (0e2d34c6)
- native: implement Sigmoid, ReLU, tanh for Native backend. (ece54e37)
- scale_params: fix ScalParams default to work on stable (43654dca)