Learning Rate Decay Method #1

kuan-wang · 2018-03-09T13:07:28Z

Hello, That's really a good result :) Can you share your training method to train mobilenetv2 that high?( such as default learning rate and weight-decay method) . Thanks~
I am currently training the model with SGD and keeping other hyper-parameters the same (except that I use batch size 256), but only get Top1: 71.17

ericsun99 · 2018-03-16T08:00:14Z

I have training with SGD, and have using some data augment method.

SophieZhou · 2018-04-13T03:05:19Z

@ericsun99 which data augmentation did you use during your training? Using crop, flip, resize, I only got about top 1 accuracy 68 % and top5 about 88%.
And I evaluate the model by center crop.

SophieZhou · 2018-04-13T05:21:31Z

@ericsun99 @THUKey In your test, do you test the model with tencrops?

Coderx7 · 2018-08-17T10:27:35Z

@THUKey ,@ericsun99 could you please share your hyperparameters?

CF2220160244 · 2018-12-20T03:05:00Z

Hello @THUKey . I use SGD train with a 1080Ti, batchsize=96, lr=0.045, weight decay=0.00004, and decrease the lr 0.98 for each epoch, after 2 days only get 67%, can you tell me your GPU, batchsize, and the initial lr to help me.
I am a student in beijing institute of technology. Thank you very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learning Rate Decay Method #1

Learning Rate Decay Method #1

kuan-wang commented Mar 9, 2018

ericsun99 commented Mar 16, 2018

SophieZhou commented Apr 13, 2018

SophieZhou commented Apr 13, 2018

Coderx7 commented Aug 17, 2018

CF2220160244 commented Dec 20, 2018

Learning Rate Decay Method #1

Learning Rate Decay Method #1

Comments

kuan-wang commented Mar 9, 2018

ericsun99 commented Mar 16, 2018

SophieZhou commented Apr 13, 2018

SophieZhou commented Apr 13, 2018

Coderx7 commented Aug 17, 2018

CF2220160244 commented Dec 20, 2018