CIFAR100 has different architecture from official implementation #13

zijian-hu · 2020-09-09T07:59:24Z

In your implementation, WRN28-10 is used which has about 36M parameters.
Your model definition:

Lines 165 to 169 in 9044f2e

    
           elif args.dataset == 'cifar100': 
        
               args.num_classes = 100 
        
               if args.arch == 'wideresnet': 
        
                   args.model_depth = 28 
        
                   args.model_width = 10

I used the following code to get the number of parameters

wrn = build_wideresnet(depth=28, widen_factor=10, dropout=0, num_classes=100)

print(f"# params: {sum(p.numel() for p in wrn.parameters()):,}")

which gives the following output:

In the official TensorFlow implementation, a WRN with about 23M parameters is used for CIFAR100 (see below image).

Notes

The CLI args for the official code can be found in this issue.

kekmodel · 2020-11-09T06:36:05Z

fixed! thanks!

zijian-hu changed the title ~~Architecture used for CIFAR100 is different from official implementation~~ CIFAR100 has different architecture from official implementation Sep 9, 2020

kekmodel closed this as completed Nov 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CIFAR100 has different architecture from official implementation #13

CIFAR100 has different architecture from official implementation #13

zijian-hu commented Sep 9, 2020

kekmodel commented Nov 9, 2020

CIFAR100 has different architecture from official implementation #13

CIFAR100 has different architecture from official implementation #13

Comments

zijian-hu commented Sep 9, 2020

Notes

kekmodel commented Nov 9, 2020