Flux.loadparams! is slow. #1764

Gregliest · 2021-11-09T19:30:25Z

According to the test case below, Flux.loadparams! is about 7x slower than creating a new Flux model and results in a lot more allocations. Is there a reason for that? The speed of loadparams! also seems to scale more poorly than creating a new model if more layers are added.

using Flux, BenchmarkTools

model = Chain(
  Dense(2, 3),
  Dense(3, 2),
  Dense(2, 2),
  Dense(2, 1)
)

p = Flux.params(model)

function buildModel(params)
  return Chain(
    Dense(params[1], params[2]),
    Dense(params[3], params[4]),
    Dense(params[5], params[6])
  )
end

@btime buildModel(p) # 691.101 ns (5 allocations: 224 bytes)
@btime Flux.loadparams!(model, p) # 4.963 μs (191 allocations: 7.08 KiB)

The text was updated successfully, but these errors were encountered:

ToucheSir · 2021-11-09T20:17:54Z

loadparams! has to iterate through the entire model structure recursively because it calls params, so the additional time + allocations are expected:

Flux.jl/src/functor.jl

Line 58 in 460f005

for (p, x) in zip(params(m), xs)

. Generally there shouldn't be a need to run it on any hot code path though, is there a reason you want to repeatedly call loadparams!?

Gregliest · 2021-11-09T22:01:44Z

Ok, that makes sense. I was thinking of using it in a Turing model to replace parameters when sampling stochastic weights, but I don't think that works with the gradient calculation.

Gregliest closed this as completed Nov 9, 2021

FelixBenning mentioned this issue May 30, 2022

Weird Side Effects of loadparams! #1979

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flux.loadparams! is slow. #1764

Flux.loadparams! is slow. #1764

Gregliest commented Nov 9, 2021

ToucheSir commented Nov 9, 2021

Gregliest commented Nov 9, 2021

Flux.loadparams! is slow. #1764

Flux.loadparams! is slow. #1764

Comments

Gregliest commented Nov 9, 2021

ToucheSir commented Nov 9, 2021

Gregliest commented Nov 9, 2021