added GlobalMaxPool, GlobalMeanPool, and flatten layers #950

gartangh · 2019-12-02T14:47:26Z

No description provided.

baggepinnen · 2019-12-03T07:44:28Z

src/layers/conv.jl

+Transforms (w,h,c,b)-shaped input into (w*h*c,b)-shaped output,
+by linearizing all values for each element in the batch.
+"""
+struct Flatten end


Does this actually have to be a struct, or could it just as well be a function?

About to say the same, we don't really need a struct here, just a function should be sufficient.

The reason I did this, is because I am treating flattening as a layer (like Conv en Pool) and not as a function (like softmax). Technically, it could also have an activation function as a property.

struct Flatten{F} σ::F function Flatten(σ::F = identity) where {F} return new{F}(σ) end end function (f::Flatten)(x::AbstractArray) σ = f.σ σ(reshape(x, :, size(x)[end])) end

Keras also has flattening implemented as a layer btw.

I can also change the whole part on flattening to:

flatten(x::AbstractArray) = reshape(x, :, size(x)[end])

Would that be the way to go?

A layer is a function. Since this layer does not need to store any parameters, a regular function is enough. Layers with internal parameters are callable structs, I. E. They are functions as well.

flatten(x::AbstractArray) = reshape(x, :, size(x)[end])
Would that be the way to go?

I would say so :)

I still tend more towards the solution with the struct.
The Flatten Layer could have properties like an activation function, the number of parameters, a name, what dimensions it flattens, the data format, ...

It could, but your implementation does not. An activation function is just the next function in the chain, or alternatively the activation function of the preceeding layer, as flatten is a linear operation. No other layers have names in Flux.

On the other hand, the Conv layers do have their activation function as a property.
You can thus specify Conv((3,3), 32=>64, softmax), which is a bit cleaner than splitting them up in a layer and a function.
Maybe we need both? A flatten function that can be called whenever necessary and a Flatten layer that calls that function, but also contains other properties?

It could, but your implementation does not.

I would commit the changes where the activation function is used as a property for the Flatten layer first, before this gets merged.

Under Flux, there is nothing special about a layer and the way its defined, a function, or any transform is treated exactly the same. For the case of layers here, since we don't really have any parameters, we should be good with a simple layers (see stateless layers as an example). The function can be an input, I feel

EDIT: If we were to add the parameters, making it a closure makes sense, and therefore, the struct.

I have added a flatten function that will be called in the Flatten layer and added support for activation functions in that layer. (see 2nd commit)

gartangh

Is this ready to get merged now?

MikeInnes · 2020-01-14T14:38:27Z

I think a flatten that respects batches is useful, but the layer seems unnecessary. Haven't reviewed everything yet.

CarloLucibello · 2020-02-26T21:08:02Z

Hi @gartangh, this PR looks good, could you rebase?

Also, maybe you could remove the Flatten struct as it seems to be the majority's opinion and document and export the flatten function.

gartangh · 2020-02-27T16:37:11Z

Hi @CarloLucibello , I rebased, removed the Flatten struct and exported the flatten function like you asked me.

DhairyaLGandhi · 2020-02-27T16:51:33Z

Could you perhaps rebase on master? I just updated the environment, since we had a fix go in in Zygote

CarloLucibello · 2020-02-27T19:10:34Z

docs/src/models/layers.md

 MeanPool
+GlobalMeanPool
 DepthwiseConv
 ConvTranspose
 CrossCor


add flatten here? not sure if it is the right place, maybe we could use the NNlib section for the functional alternative of struct layers, but I think adding flatten here is ok for the time being

@CarloLucibello , I rebased and added the flatten documentation tag.

updated documentation

DhairyaLGandhi · 2020-03-08T14:24:19Z

Lgtm

CarloLucibello · 2020-03-08T14:26:55Z

nice, thanks!

bors r+

bors · 2020-03-08T14:44:49Z

Build succeeded

ci/gitlab/gitlab.com

baggepinnen reviewed Dec 3, 2019

View reviewed changes

gartangh commented Dec 15, 2019

View reviewed changes

gartangh requested a review from baggepinnen December 15, 2019 10:26

gartangh force-pushed the layers branch from f8766d3 to 19c8120 Compare February 27, 2020 16:33

CarloLucibello closed this Feb 27, 2020

CarloLucibello reopened this Feb 27, 2020

gartangh force-pushed the layers branch from 19c8120 to 1a2762c Compare February 27, 2020 17:45

CarloLucibello changed the title ~~added GlobalMaxPool, GlobalMeanPool, and Flatten layers~~ added GlobalMaxPool, GlobalMeanPool, and flatten layers Feb 27, 2020

CarloLucibello reviewed Feb 27, 2020

View reviewed changes

gartangh added 4 commits March 8, 2020 14:18

added GlobalMaxPool, GlobalMeanPool, and Flatten layers

3e14bd8

split up Flatten layer to use the flatten function

82e16a5

removed Flatten struct

746e331

updated documentation

updated documentation

fc3af68

gartangh force-pushed the layers branch from 1a2762c to fc3af68 Compare March 8, 2020 13:51

bors bot merged commit d4cf143 into FluxML:master Mar 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added GlobalMaxPool, GlobalMeanPool, and flatten layers #950

added GlobalMaxPool, GlobalMeanPool, and flatten layers #950

gartangh commented Dec 2, 2019

baggepinnen Dec 3, 2019

DhairyaLGandhi Dec 3, 2019

gartangh Dec 3, 2019 •

edited

Loading

baggepinnen Dec 3, 2019

gartangh Dec 4, 2019

baggepinnen Dec 4, 2019

gartangh Dec 5, 2019

gartangh Dec 5, 2019 •

edited

Loading

DhairyaLGandhi Dec 5, 2019 •

edited

Loading

gartangh Dec 5, 2019

gartangh left a comment

MikeInnes commented Jan 14, 2020

CarloLucibello commented Feb 26, 2020

gartangh commented Feb 27, 2020

DhairyaLGandhi commented Feb 27, 2020

CarloLucibello Feb 27, 2020

gartangh Mar 8, 2020

DhairyaLGandhi commented Mar 8, 2020

CarloLucibello commented Mar 8, 2020

bors bot commented Mar 8, 2020

added GlobalMaxPool, GlobalMeanPool, and flatten layers #950

added GlobalMaxPool, GlobalMeanPool, and flatten layers #950

Conversation

gartangh commented Dec 2, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gartangh Dec 3, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gartangh Dec 5, 2019 • edited Loading

Choose a reason for hiding this comment

DhairyaLGandhi Dec 5, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gartangh left a comment

Choose a reason for hiding this comment

MikeInnes commented Jan 14, 2020

CarloLucibello commented Feb 26, 2020

gartangh commented Feb 27, 2020

DhairyaLGandhi commented Feb 27, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DhairyaLGandhi commented Mar 8, 2020

CarloLucibello commented Mar 8, 2020

bors bot commented Mar 8, 2020

Build succeeded

gartangh Dec 3, 2019 •

edited

Loading

gartangh Dec 5, 2019 •

edited

Loading

DhairyaLGandhi Dec 5, 2019 •

edited

Loading