EfficientNet implementation #796

leonid-pishchulin · 2019-06-05T20:45:43Z

Hey, are there any EfficientNet (https://arxiv.org/abs/1905.11946) implementations available? If not, is somebody working on it?

KellenSunderland · 2019-06-05T21:03:25Z

Just for some background: we're mostly interested so no one duplicates effort.

zhreshold · 2019-06-07T18:47:59Z

here's one candidate @sufeidechabei
To summarize, it might be easy to write the definitions(https://github.com/mnikitin/EfficientNet/blob/master/efficientnet_model.py), but training part is unpredictable yet due to the missing training hyper-parameters in paper.

Should we split the risk by training the network simultanously?

bermanmaxim · 2019-06-10T22:54:25Z

Note that https://github.com/tensorflow/tpu/blob/master/models/official/efficientnet/main.py has the training script that seems to have been used and provides hyperparameters.

bermanmaxim · 2019-06-10T22:58:10Z

Also there might be some differences with the paper, author says "source code is correct": see tensorflow/tpu#383, tensorflow/tpu#390

ryanjay0 · 2019-06-11T00:05:01Z

They never said "source code is correct" about tpu issue tensorflow/tpu#390. Did they? Seems like a much larger discrepancy than the padding issues in tensorflow/tpu#383

bermanmaxim · 2019-06-11T00:18:15Z

True, what I meant is that since the author said source code is correct on tensorflow/tpu#383 I was assuming the code is what they actually used, including concerning tensorflow/tpu#390. But you are right that this resolution discrepancy is a big difference 🤔

bermanmaxim · 2019-06-18T22:08:07Z

Regarding the training of efficientnet, see the remarks of Ross Wightman here: https://forums.fast.ai/t/efficientnet/46978/67 ; it might be that keeping an exponential moving average of the weights during training, for use at testing, helps this family of models a lot.

bermanmaxim · 2019-06-20T01:03:06Z

Similar discussion over here: pytorch/vision#980

hetong007 · 2019-06-24T19:13:17Z

@sufeidechabei please check your PR with the resources above.

github-actions · 2021-05-24T06:43:44Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

github-actions bot added the Stale label May 24, 2021

github-actions bot closed this as completed May 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EfficientNet implementation #796

EfficientNet implementation #796

leonid-pishchulin commented Jun 5, 2019 •

edited

Loading

KellenSunderland commented Jun 5, 2019

zhreshold commented Jun 7, 2019

bermanmaxim commented Jun 10, 2019 •

edited

Loading

bermanmaxim commented Jun 10, 2019

ryanjay0 commented Jun 11, 2019 •

edited

Loading

bermanmaxim commented Jun 11, 2019 •

edited

Loading

bermanmaxim commented Jun 18, 2019

bermanmaxim commented Jun 20, 2019

hetong007 commented Jun 24, 2019

github-actions bot commented May 24, 2021

EfficientNet implementation #796

EfficientNet implementation #796

Comments

leonid-pishchulin commented Jun 5, 2019 • edited Loading

KellenSunderland commented Jun 5, 2019

zhreshold commented Jun 7, 2019

bermanmaxim commented Jun 10, 2019 • edited Loading

bermanmaxim commented Jun 10, 2019

ryanjay0 commented Jun 11, 2019 • edited Loading

bermanmaxim commented Jun 11, 2019 • edited Loading

bermanmaxim commented Jun 18, 2019

bermanmaxim commented Jun 20, 2019

hetong007 commented Jun 24, 2019

github-actions bot commented May 24, 2021

leonid-pishchulin commented Jun 5, 2019 •

edited

Loading

bermanmaxim commented Jun 10, 2019 •

edited

Loading

ryanjay0 commented Jun 11, 2019 •

edited

Loading

bermanmaxim commented Jun 11, 2019 •

edited

Loading