WIP: Simple Regression with ADAM #5

DhairyaLGandhi · 2020-06-16T14:26:05Z

Using FluxML/Optimisers.jl#3 and simple sketch of a regression task with ADAM

This currently results in

ERROR: Tracing Error: No IR for Tuple{typeof(getfield),Tuple{Int64,Int64},Int64}
in Tuple{typeof(step),Chain{Tuple{Dense{typeof(relu),Array{Float64,2},Array{Float64,1}},Dense{typeof(identity),Array{Float64,2},Array{Float64,1}},typeof(softmax)}}}
in Tuple{typeof(gradient),var"#7#8",Chain{Tuple{Dense{typeof(relu),Array{Float64,2},Array{Float64,1}},Dense{typeof(identity),Array{Float64,2},Array{Float64,1}},typeof(softmax)}}}
in Tuple{Zygote.var"#38#39"{typeof(∂(#7))},Float64}
in Tuple{typeof(∂(#7)),Float64}
in Tuple{typeof(∂(mse)),Float64}
in Tuple{Zygote.var"#3121#back#1219"{Zygote.var"#1215#1217"{Array{Float64,2}}},Float64}
in Tuple{Zygote.var"#1215#1217"{Array{Float64,2}},Float64}
in Tuple{typeof(fill),Float64,Tuple{Int64,Int64}}
in Tuple{Type{Array{Float64,2}},UndefInitializer,Tuple{Int64,Int64}}
in Tuple{typeof(getfield),Tuple{Int64,Int64},Int64}
Stacktrace:
 [1] trace(::Any, ::Any, ::Vararg{Any,N} where N) at /Users/dhairyagandhi/.julia/packages/Mjolnir/eyPSM/src/trace.jl:266
 [2] trace(::Any, ::Vararg{Any,N} where N) at /Users/dhairyagandhi/Downloads/new_clones/XLATools.jl/src/compile/rt.jl:39
 [3] (::XLA.XFunction)(::Any) at /Users/dhairyagandhi/Downloads/new_clones/XLATools.jl/src/compile/rt.jl:55
 [4] train2(::Chain{Tuple{Dense{typeof(relu),Array{Float64,2},Array{Float64,1}},Dense{typeof(identity),Array{Float64,2},Array{Float64,1}},typeof(softmax)}}) at ./REPL[31]:3
 [5] top-level scope at REPL[33]:1

MikeInnes · 2020-06-16T15:12:47Z

I think you may as well just edit the original mnist.jl for this.

Does this script work if you avoid the xla conversion? Presumably it needs to be modified to pass the optimiser state around.

DhairyaLGandhi · 2020-06-17T08:01:36Z

It does work without the call to xla

MikeInnes · 2020-06-17T09:26:31Z

Ok, this needs to change to pass the optimiser state around explicitly though, rather than using a stateful IdDict internally; otherwise it can't work with XLA (or other immutable objects like StaticArrays) in principle.

DhairyaLGandhi · 2020-06-17T09:47:15Z

Yeah, I have removed the abstract parts from the pr and reworking the bit to remove the IdDict. In this case, is there any preference to what init(::ADAM, x) should return?

MikeInnes · 2020-06-17T10:16:02Z

init should just return whatever state would normally be stored in the IdDict, eg a zero array of some kind.

Our functional training loop has the broad structure of

opt = ADAM(...)
st = state(opt, m)
for _ in _
  dm = gradient(...)
  m, st = update(opt, m, dm, st)
end

so ideally the state variable st is type stable, which might make it clearer what init should do.

DhairyaLGandhi · 2020-06-17T11:11:28Z

I've made some changes in FluxML/Optimisers.jl#3 to reflect the initialisation conditions. We still need to fix call to (o::ADAM)(m, m\bar) from its current form

MikeInnes · 2020-06-17T11:23:53Z

While you're looking at it, it may make sense to try Optimisers.jl on the rnn example. Zygote returns Refs for mutable struct gradients, which I don't think the optimisers support yet.

DhairyaLGandhi · 2020-06-17T11:50:44Z

Yes, taking a look

simple regression

a6e68a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Simple Regression with ADAM #5

WIP: Simple Regression with ADAM #5

DhairyaLGandhi commented Jun 16, 2020

MikeInnes commented Jun 16, 2020

DhairyaLGandhi commented Jun 17, 2020

MikeInnes commented Jun 17, 2020

DhairyaLGandhi commented Jun 17, 2020

MikeInnes commented Jun 17, 2020 •

edited

Loading

DhairyaLGandhi commented Jun 17, 2020

MikeInnes commented Jun 17, 2020

DhairyaLGandhi commented Jun 17, 2020

WIP: Simple Regression with ADAM #5

Are you sure you want to change the base?

WIP: Simple Regression with ADAM #5

Conversation

DhairyaLGandhi commented Jun 16, 2020

MikeInnes commented Jun 16, 2020

DhairyaLGandhi commented Jun 17, 2020

MikeInnes commented Jun 17, 2020

DhairyaLGandhi commented Jun 17, 2020

MikeInnes commented Jun 17, 2020 • edited Loading

DhairyaLGandhi commented Jun 17, 2020

MikeInnes commented Jun 17, 2020

DhairyaLGandhi commented Jun 17, 2020

MikeInnes commented Jun 17, 2020 •

edited

Loading