BatchNormsync with Adam Optimizer #35

tarun005 · 2018-09-26T06:07:01Z

Is the bnsync code written specifically for SGD optimizer? The loss is not converging if I use and train the model with Adam optimizer.

d-li14 · 2018-12-08T00:42:38Z

@tarun005 Have you tested with SGD optimizer? Does it drive the training process to convergence?

tarun005 · 2018-12-09T09:51:43Z

Yes, the model converges with SGD, but same model does not if I replace SGD with adam.

d-li14 · 2018-12-09T10:17:19Z

@tarun005 Although I suppose that BN should be irrelevant to the optimization method, when I used the syncbn by just adding the folder lib to $PATH, I met an error saying 'segmentation fault'. What's your usage?

tarun005 · 2019-02-26T06:42:01Z

Agree that BN shouldn't be relevant to optimization method, but I have read somewhere that Adam requires global statistics at every iteration, so the implementation of BNsync given here could be an issue.

jakubLangr · 2019-12-24T21:23:56Z

Hi were either of you @tarun005 @d-li14 be able to upload the models to e.g. dropbox as they don't seem to be accessible on the Princeton site. That'd be awesome!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BatchNormsync with Adam Optimizer #35

BatchNormsync with Adam Optimizer #35

tarun005 commented Sep 26, 2018

d-li14 commented Dec 8, 2018

tarun005 commented Dec 9, 2018

d-li14 commented Dec 9, 2018

tarun005 commented Feb 26, 2019

jakubLangr commented Dec 24, 2019

BatchNormsync with Adam Optimizer #35

BatchNormsync with Adam Optimizer #35

Comments

tarun005 commented Sep 26, 2018

d-li14 commented Dec 8, 2018

tarun005 commented Dec 9, 2018

d-li14 commented Dec 9, 2018

tarun005 commented Feb 26, 2019

jakubLangr commented Dec 24, 2019