noise shape for dropout #563

chengchingwen · 2019-01-22T15:54:26Z

I add the noise shape for dropout, similar to the noise_shape argument in tf.nn.dropout

MikeInnes · 2019-03-09T11:26:17Z

This looks like a good idea, but it also seems like it's equivalent to a dims keyword argument, which would be more standard in Julia. Any thoughts on doing that API instead?

chengchingwen · 2019-03-11T07:24:44Z

@MikeInnes Do you mean like using a dims keyword argument for indicating which dimensions should be broadcasted?

MikeInnes · 2019-03-11T10:32:04Z

Yes exactly. Though it might be better for dims to say which dims are not broadcasted, so that e.g. dims=1 does dropout along columns and dims=2 along rows.

chengchingwen · 2019-03-11T16:44:46Z

But if the dims is which not broadcasted, then what about higher dimensional input? Like if the input data is with shape of (C, W, H, B) and I want each sample to have the same dropout, than I would need to specified dims=(1,2,3) for the unbroadcasted C, W, H?

chengchingwen · 2019-03-14T14:03:43Z

@MikeInnes I made a first version of dims API, and currently the dims is the broadcasted dims. For example

julia> Dropout(0.5)(randn(5,4), 1)
5×4 Array{Float64,2}:
 -0.230536  -0.0   1.02677   -0.903341
 -0.605143   0.0   0.748388   0.732854
  2.56266   -0.0  -2.79108   -1.59313 
 -0.613482  -0.0   0.468957  -1.96    
 -0.87279   -0.0   4.01647    0.647282

1 indicate that each row use the same dropout result and dims can also be a tuple for multiple dimensions.

julia> Dropout(0.5)(randn(5,4,2), (1,3))
5×4×2 Array{Float64,3}:
[:, :, 1] =
 -0.0  -0.0  -1.66134    1.97335 
 -0.0   0.0  -0.310311   2.57003 
  0.0  -0.0   1.24803   -3.60845 
 -0.0   0.0  -1.4593    -0.755723
  0.0   0.0   0.8056     4.04177 

[:, :, 2] =
 -0.0   0.0  -0.532319   -0.836303
  0.0   0.0   0.867975   -0.309224
 -0.0  -0.0  -2.63861     1.14548 
 -0.0   0.0  -0.0331286   2.39778 
  0.0   0.0  -2.47692    -0.358082

chengchingwen · 2019-03-25T14:14:28Z

bump

chengchingwen · 2019-04-02T18:48:38Z

@MikeInnes

MikeInnes · 2019-04-04T16:40:05Z

src/layers/normalise.jl

 _dropout_kernel(y::T, p, q) where {T} = y > p ? T(1 / q) : T(0)

-function (a::Dropout)(x)
+function (a::Dropout)(x, dims=0)


It would be nicer to use dims = : for all dimensions, like the reduction functions do.

Got it. What about the dims question discuss above? I just though it might be more convenient to use dims as the broadcasted dims, but maybe it's not and dims as unbroadcasted dims is more intuitive?

Yes, it's more intuitive if it aligns with how dims is used everywhere else. For example if you wanted to sum across each image you'd likewise do sum(x, dims = (1, 2, 3)).

It should be a keyword argument, too.

Ok, I change the dims as the unbroadcasted dims and also make it a keyword argument.

MikeInnes · 2019-05-10T13:42:31Z

Ok, one last thing, I think the dims argument should be an argument to Dropout(x), not an argument when you call that layer. Actually, it'd be good if you could split out a dropout function here and use that in the Dropout layer.

We also need to update the Dropout docs and add a news item; but once that's done this will be good to go!

chengchingwen · 2019-05-10T14:50:45Z

Do you mean that we should make dims a field of the Dropout layer?

MikeInnes · 2019-05-10T15:00:23Z

Yes.

chengchingwen · 2019-05-11T18:38:12Z

@MikeInnes where should I add the docs? I can't find the old one in the docs folder

MikeInnes · 2019-05-13T16:42:19Z

Actually, dropout is part of the docs already so that's fine. Just NEWS.md needs updating.

NEWS.md

Co-Authored-By: Mike J Innes <[email protected]>

MikeInnes · 2019-05-13T17:15:54Z

bors r+

563: noise shape for dropout r=MikeInnes a=chengchingwen I add the noise shape for dropout, similar to the `noise_shape` argument in [`tf.nn.dropout`](https://www.tensorflow.org/api_docs/python/tf/nn/dropout) Co-authored-by: chengchingwen <[email protected]> Co-authored-by: Peter <[email protected]>

bors · 2019-05-13T17:55:01Z

Build succeeded

ci/gitlab/staging

noise shape for dropout

06003b7

chengchingwen added 2 commits March 14, 2019 21:51

change API to dims

934f084

update test

59da68b

MikeInnes reviewed Apr 4, 2019

View reviewed changes

change dims as unbroadcasted dims and keyword argument

2612353

FluxML deleted a comment from chengchingwen May 10, 2019

make dims as field of Dropout

5c51406

chengchingwen added 2 commits May 14, 2019 00:50

Merge remote-tracking branch 'upstream/master' into drop_shape

2fc2a52

update NEWS

bdf74fe

MikeInnes reviewed May 13, 2019

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

Update NEWS.md

9c1bb93

Co-Authored-By: Mike J Innes <[email protected]>

bors bot merged commit 9c1bb93 into FluxML:master May 13, 2019

yuehhua mentioned this pull request May 28, 2019

Provide standardize API for 1D array JuliaStats/StatsBase.jl#490

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

noise shape for dropout #563

noise shape for dropout #563

chengchingwen commented Jan 22, 2019

MikeInnes commented Mar 9, 2019

chengchingwen commented Mar 11, 2019 •

edited

Loading

MikeInnes commented Mar 11, 2019

chengchingwen commented Mar 11, 2019

chengchingwen commented Mar 14, 2019

chengchingwen commented Mar 25, 2019

chengchingwen commented Apr 2, 2019

MikeInnes Apr 4, 2019

chengchingwen Apr 4, 2019

MikeInnes Apr 4, 2019

chengchingwen Apr 4, 2019

MikeInnes commented May 10, 2019

chengchingwen commented May 10, 2019

MikeInnes commented May 10, 2019

chengchingwen commented May 11, 2019

MikeInnes commented May 13, 2019

MikeInnes commented May 13, 2019

bors bot commented May 13, 2019

noise shape for dropout #563

noise shape for dropout #563

Conversation

chengchingwen commented Jan 22, 2019

MikeInnes commented Mar 9, 2019

chengchingwen commented Mar 11, 2019 • edited Loading

MikeInnes commented Mar 11, 2019

chengchingwen commented Mar 11, 2019

chengchingwen commented Mar 14, 2019

chengchingwen commented Mar 25, 2019

chengchingwen commented Apr 2, 2019

MikeInnes Apr 4, 2019

Choose a reason for hiding this comment

chengchingwen Apr 4, 2019

Choose a reason for hiding this comment

MikeInnes Apr 4, 2019

Choose a reason for hiding this comment

chengchingwen Apr 4, 2019

Choose a reason for hiding this comment

MikeInnes commented May 10, 2019

chengchingwen commented May 10, 2019

MikeInnes commented May 10, 2019

chengchingwen commented May 11, 2019

MikeInnes commented May 13, 2019

MikeInnes commented May 13, 2019

bors bot commented May 13, 2019

Build succeeded

chengchingwen commented Mar 11, 2019 •

edited

Loading