Introduce optimizer_base_type in support of different optimizers #116

milancurcic · 2023-01-17T17:04:51Z

This is the first step toward decoupling the optimizer logic from the concrete layers.

This PR only introduces the abstract optimizer_base_type and a concrete sgd type.

The update of weights is still hardcoded in network % train and the concrete layer implementations; decoupling that remains a TODO.

In a nutshell, the idea is to have

Concrete optimizer types such as sgd, adam, etc. in nf_optimizers.f90 (and its submodules, eventually);
The type constructors would expect optimizer parameters from the user (e.g. adam(learning_rate, beta1, beta2, epsilon, ...))
Each concrete type would define an update subroutine which would expect the needed gradients (dw, db) as input, and also the weights and biases arrays as intent(out) to update.

@rweed let me know if this approach seems reasonable to you.

…ern-fortran#116)

Introduce optimizer_base_type in support of different optimizers

421895b

milancurcic merged commit edd3f70 into modern-fortran:main Jan 19, 2023

milancurcic deleted the refactor-optimizer-stub branch January 19, 2023 15:30

wilsonify pushed a commit to wilsonify/modern-fortran that referenced this pull request Jan 27, 2023

Introduce optimizer_base_type in support of different optimizers (mod…

ebf132d

…ern-fortran#116)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce optimizer_base_type in support of different optimizers #116

Introduce optimizer_base_type in support of different optimizers #116

milancurcic commented Jan 17, 2023

Introduce optimizer_base_type in support of different optimizers #116

Introduce optimizer_base_type in support of different optimizers #116

Conversation

milancurcic commented Jan 17, 2023