Add EfficientNet Architecture in TorchVision #4293

datumbox · 2021-08-19T10:14:42Z

Fixes #980

The implementation follows a similar approach as MobileNetV3 and reuses components from existing models.

The weights are ported from the repos of @rwightman and @lukemelas.

rvandeghen · 2021-08-23T08:02:14Z

Hi,

Do you know when this will be released ?

datumbox · 2021-08-23T10:08:37Z

@rvandeghen No concrete date yet but as you can see I'm working on it. It's still early days and the architecture needs to be verified for all variants, add pre-trained weights etc.

Subscribe to this PR to get notified when it's merged.

datumbox · 2021-08-25T14:13:12Z

The failing tests are not related.

rwightman · 2021-08-25T16:18:16Z

@datumbox thanks for putting the mention of weight sources in doc file and weight links comment, the B0-B4 took some time on my part to train. The B0, B3, and B4 that I trained are all better than the original TF baseline, RandAugment, and AdvProp + AA weights on ImageNet-1k, the B1 and B2 are earlier attempts that are decent, but not to the same level, I didn't follow up on since I rarely use those model sizes.

Re those B5-B7 weights, I believe those are just the weights from the original tensorflow models (https://github.com/tensorflow/tpu/tree/master/models/official/efficientnet, equivalent to my tf_ weights) as ported by Luke. They need assymetric TF 'SAME' padding for full performance and will suffer reduced accuracy without. That drop will vary as you change the input size and have different padding offsets through the model relative to what SAME would produce. That can be more problematic using them as backbones for say obj detection / segmentation, esp if you freeze some of the layers.

I'm always open to training B5-B7/B8 but would need some sort of business arrangement + someone elses compute resources as it'd would be a significant commitment in time and compute to iterate and produce good weights.

datumbox · 2021-08-25T16:35:08Z

@rwightman No worries, thanks for your work on it. :)

Indeed B5-B7 require the "same" padding. We were debating whether to add code to handle this in our implementation but we decided not to because a) the degradation is very small (0.1-0.2 points), b) it's slower and c) as you said it can be resolved by retraining the models on the future. For now, users who want to faithfully reproduce the models can use your implementation.

Concerning B1 and B2, yes I noticed that they can be further optimized but I think the models are good enough for now. We are happy to review them once the #3911 epic is completed as it will add lots of standard utils in TorchVision to achieve SOTA results. Some of them are added directly in PyTorch (LabelSmoothing, Warmup schedulers etc), so it might be feasible to reduce the amount of code you maintain on timm and help you focus on bringing more models.

fmassa

Looks great, thanks!

fmassa · 2021-08-26T09:22:00Z

references/classification/train.py

+    resize_size, crop_size = 256, 224
+    interpolation = InterpolationMode.BILINEAR
+    if args.model == 'inception_v3':
+        resize_size, crop_size = 342, 299
+    elif args.model.startswith('efficientnet_'):
+        sizes = {
+            'b0': (256, 224), 'b1': (256, 240), 'b2': (288, 288), 'b3': (320, 300),
+            'b4': (384, 380), 'b5': (456, 456), 'b6': (528, 528), 'b7': (600, 600),
+        }
+        e_type = args.model.replace('efficientnet_', '')
+        resize_size, crop_size = sizes[e_type]
+        interpolation = InterpolationMode.BICUBIC


Discussed with @datumbox on chat, I think it would be good in the future to factor this out somewhere else, maybe as a set of custom preset transforms

fmassa · 2021-08-26T09:47:03Z

torchvision/models/efficientnet.py

+from .._internally_replaced_utils import load_state_dict_from_url
+from torchvision.ops import StochasticDepth
+
+from torchvision.models.mobilenetv2 import ConvBNActivation, _make_divisible


nit: we might want to put some of those helper functions elsewhere in the future. ConvBNActivation could even be in torchvision.ops

Agreed. I want to defer this until Batteries Included is completed to see how many ops/layers we need and then refactor.

Possibly, A right place for helpers could be torchvision.layers or torchvision.nn.
One small reason why I think ops might not be good place is to distinguish post and preprocessing operations such as nms, IoU, box operations from generic layers that build models.

Torchtext does something similar https://github.com/pytorch/text/tree/main/torchtext/nn

There are a few candidates. such as ConvBNActivation, ConvActBN, squeezeExcite, MLP , to name a few.

Yeah we need to talk about this. I might also need to move the StochasticDepth layer from ops for exactly the same reason. Do you want to open an issue with the potential things we want to share across models along with their location? I remember there was an old issue asking about having nn but I would probably open a new one with increased scope (sharing blocks across models).

Yes, I will open a new issue and also list down potential things we would like to share 😃

Summary: * Adding code skeleton * Adding MBConvConfig. * Extend SqueezeExcitation to support custom min_value and activation. * Implement MBConv. * Replace stochastic_depth with operator. * Adding the rest of the EfficientNet implementation * Update torchvision/models/efficientnet.py * Replacing 1st activation of SE with SiLU. * Adding efficientnet_b3. * Replace mobilenetv3 assets with custom. * Switch to standard sigmoid and reconfiguring BN. * Reconfiguration of efficientnet. * Add repr * Add weights. * Update weights. * Adding B5-B7 weights. * Update docs and hubconf. * Fix doc link. * Fix typo on comment. Reviewed By: fmassa Differential Revision: D30793344 fbshipit-source-id: 74b5fed89fd251372d17234d33984b71abd1a860

datumbox added module: models new feature labels Aug 19, 2021

facebook-github-bot added the cla signed label Aug 19, 2021

datumbox marked this pull request as draft August 19, 2021 10:14

datumbox added enhancement topic: classification and removed new feature labels Aug 19, 2021

Adding code skeleton

411ce25

datumbox force-pushed the models/efficientnet branch from 4e5957f to 411ce25 Compare August 19, 2021 10:52

datumbox added 2 commits August 19, 2021 18:16

Adding MBConvConfig.

447a336

Extend SqueezeExcitation to support custom min_value and activation.

e173b8f

datumbox force-pushed the models/efficientnet branch 2 times, most recently from 95dedaf to edbe693 Compare August 20, 2021 07:59

Implement MBConv.

bb1bb17

datumbox force-pushed the models/efficientnet branch from edbe693 to bb1bb17 Compare August 20, 2021 08:00

datumbox and others added 4 commits August 20, 2021 14:21

Merge branch 'master' into models/efficientnet

fb2087e

Replace stochastic_depth with operator.

d15bf78

Adding the rest of the EfficientNet implementation

b78399b

Update torchvision/models/efficientnet.py

990826b

datumbox added 3 commits August 23, 2021 11:35

Merge branch 'main' into models/efficientnet

3bf8fbc

Replacing 1st activation of SE with SiLU.

697eee9

Adding efficientnet_b3.

8ff7604

datumbox force-pushed the models/efficientnet branch from 75c132a to 8ff7604 Compare August 23, 2021 15:03

datumbox mentioned this pull request Aug 23, 2021

RegNet in torchvision ? #2655

Closed

datumbox added 2 commits August 23, 2021 19:26

Replace mobilenetv3 assets with custom.

ca9e619

Switch to standard sigmoid and reconfiguring BN.

627dbe5

datumbox added 2 commits August 24, 2021 17:05

Reconfiguration of efficientnet.

4fc26bc

Add repr

14ce91f

datumbox force-pushed the models/efficientnet branch from 12fa4d2 to 14ce91f Compare August 24, 2021 17:49

datumbox added 5 commits August 24, 2021 18:51

Add weights.

0dca77d

Merge branch 'main' into models/efficientnet

4735fb6

Update weights.

d2bfd63

Adding B5-B7 weights.

8330fab

Update docs and hubconf.

901b282

datumbox marked this pull request as ready for review August 25, 2021 13:27

datumbox requested a review from fmassa August 25, 2021 13:27

datumbox force-pushed the models/efficientnet branch from def1c52 to 86f812a Compare August 25, 2021 15:28

Fix doc link.

7f8dae3

datumbox force-pushed the models/efficientnet branch from 86f812a to 7f8dae3 Compare August 25, 2021 15:42

fmassa approved these changes Aug 26, 2021

View reviewed changes

datumbox and others added 2 commits August 26, 2021 11:03

Fix typo on comment.

210b3e2

Merge branch 'main' into models/efficientnet

d6dd9df

datumbox merged commit 37a9ee5 into pytorch:main Aug 26, 2021

datumbox deleted the models/efficientnet branch August 26, 2021 10:03

oke-aditya mentioned this pull request Aug 29, 2021

[RFC] API For Common Layers In Torchvision #4333

Open

datumbox mentioned this pull request Sep 4, 2021

[RFC] TorchVision with Batteries included - Phase 1 #3911

Closed

16 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add EfficientNet Architecture in TorchVision #4293

Add EfficientNet Architecture in TorchVision #4293

datumbox commented Aug 19, 2021 •

edited

Loading

rvandeghen commented Aug 23, 2021

datumbox commented Aug 23, 2021

datumbox commented Aug 25, 2021

rwightman commented Aug 25, 2021 •

edited

Loading

datumbox commented Aug 25, 2021

fmassa left a comment

fmassa Aug 26, 2021

fmassa Aug 26, 2021

datumbox Aug 26, 2021

oke-aditya Aug 26, 2021

datumbox Aug 26, 2021 •

edited

Loading

oke-aditya Aug 26, 2021

Add EfficientNet Architecture in TorchVision #4293

Add EfficientNet Architecture in TorchVision #4293

Conversation

datumbox commented Aug 19, 2021 • edited Loading

rvandeghen commented Aug 23, 2021

datumbox commented Aug 23, 2021

datumbox commented Aug 25, 2021

rwightman commented Aug 25, 2021 • edited Loading

datumbox commented Aug 25, 2021

fmassa left a comment

Choose a reason for hiding this comment

fmassa Aug 26, 2021

Choose a reason for hiding this comment

fmassa Aug 26, 2021

Choose a reason for hiding this comment

datumbox Aug 26, 2021

Choose a reason for hiding this comment

oke-aditya Aug 26, 2021

Choose a reason for hiding this comment

datumbox Aug 26, 2021 • edited Loading

Choose a reason for hiding this comment

oke-aditya Aug 26, 2021

Choose a reason for hiding this comment

datumbox commented Aug 19, 2021 •

edited

Loading

rwightman commented Aug 25, 2021 •

edited

Loading

datumbox Aug 26, 2021 •

edited

Loading