Refactored solvers with fluent pattern #1759

nakosung · 2015-01-20T16:06:39Z

I refactored solvers with fluent pattern to add a solver like rmsprop, which isn't included in this PR. After merging this PR, rmsprop will be proposed.

A cool thing introduced by this PR is keeping code DRY(Do not Repeat Yourself). You don't have to write similar code paths for cpu/gpu.

Modified ADAGRAD solver is like below:

op
      // compute square of gradient in update
      .sqr(param, update)
      // update history
      .add(update,history,history)
      // prepare update
      .sqrt(history,update)
      .add_scalar(delta,update)
      .div(param,update,update)      
      // scale and copy
      .axpby(local_rate,update,Dtype(0),param)

nakosung · 2015-01-21T00:33:18Z

Some tests are failed due to 'different trained weights'. It may not be an 'failure' because I've changed pow into sqr/sqrt, which can drive weights different. If maintainers decide to merge this PR, I'll update test cases.

jeffdonahue · 2015-01-21T09:22:30Z

This is a cool concept, but I'm not sure we want to adopt a new pattern to unify the math interface specifically for the solvers, as opposed to a more general unification at the level of the math interface itself, as in the device abstraction effort (#610, which should be restored eventually...). Maybe there are some interesting ideas here that could complement that effort though?

jeffdonahue · 2015-01-21T09:22:58Z

(Sorry, didn't mean to immediately close...)

shelhamer · 2015-01-22T03:07:50Z

Agreed that this is a neat concept, but it's perhaps too local when done only for the solver.

Re: RMSprop, it is a solver that is on our TODO list so we'd welcome a PR for it.

sguada · 2015-01-22T03:43:22Z

It would be better if it were part of Blob so it could be used in the
layers.

Sergio

2015-01-21 19:07 GMT-08:00 Evan Shelhamer [email protected]:

Agreed that this is a neat concept, but it's perhaps too local when done
only for the solver.

Re: RMSprop, it is a solver that is on our TODO list so we'd welcome a PR
for it.

—
Reply to this email directly or view it on GitHub
#1759 (comment).

nakosung · 2015-01-27T04:51:46Z

In fact I had implemented RMSprop in this pattern(fluent), PR for RMSprop should be re-written in the orginal caffe style.

nakosung · 2015-01-27T04:53:38Z

Applying this pattern to whole program will not be a hard thing.

shelhamer · 2015-01-29T18:09:46Z

@nakosung please PR RMSprop in the current Caffe style first. Once that is in, we can figure out a plan for the fluent style in Caffe. Thanks for both the consideration of how to make the Caffe code more readable and concise.

nakosung · 2015-02-01T13:00:58Z

@shelhamer Okay, I'll try to PR RMSprop in a separate branch, but for now it doesn't have a priority.

nakosung added 6 commits January 21, 2015 00:38

Introducing fluent pattern

a2bd5f4

unnecessary stmt removed

07fb983

pow 0.5 --> sqrt

6c2abf6

missing file

64edf78

Lint

e654fc1

test...

684ea09

jeffdonahue closed this Jan 21, 2015

jeffdonahue reopened this Jan 21, 2015

nakosung closed this Feb 1, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactored solvers with fluent pattern #1759

Refactored solvers with fluent pattern #1759

nakosung commented Jan 20, 2015

nakosung commented Jan 21, 2015

jeffdonahue commented Jan 21, 2015

jeffdonahue commented Jan 21, 2015

shelhamer commented Jan 22, 2015

sguada commented Jan 22, 2015

nakosung commented Jan 27, 2015

nakosung commented Jan 27, 2015

shelhamer commented Jan 29, 2015

nakosung commented Feb 1, 2015

Refactored solvers with fluent pattern #1759

Refactored solvers with fluent pattern #1759

Conversation

nakosung commented Jan 20, 2015

nakosung commented Jan 21, 2015

jeffdonahue commented Jan 21, 2015

jeffdonahue commented Jan 21, 2015

shelhamer commented Jan 22, 2015

sguada commented Jan 22, 2015

nakosung commented Jan 27, 2015

nakosung commented Jan 27, 2015

shelhamer commented Jan 29, 2015

nakosung commented Feb 1, 2015