Questions on the communication size of AlexNet (on CIFAR 10) and LeNet (on MNIST) in Table 2 of the paper #8

SkyTu · 2022-11-17T01:55:08Z

We noticed that the shape of data samples in CIFAR 10 is 32x32x3, while for MNIST is 28x28x1. Moreover, the number of parameters in AlexNet is much bigger than LeNet. Combining these two observations, we arrived at why, with the same protocol (e.g., Falcon), the communication size of LeNet training MNIST is 485.90 GB while the communication size of AlexNet training CIFAR 10 is 382.18GB. Is there any optimization used in AlexNet?

ElleryQu · 2023-06-03T09:51:58Z

Both AlexNet and VGG16 use avgpool instead of Maxpool, I guess that's the reason.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions on the communication size of AlexNet (on CIFAR 10) and LeNet (on MNIST) in Table 2 of the paper #8

Questions on the communication size of AlexNet (on CIFAR 10) and LeNet (on MNIST) in Table 2 of the paper #8

SkyTu commented Nov 17, 2022 •

edited

Loading

ElleryQu commented Jun 3, 2023

Questions on the communication size of AlexNet (on CIFAR 10) and LeNet (on MNIST) in Table 2 of the paper #8

Questions on the communication size of AlexNet (on CIFAR 10) and LeNet (on MNIST) in Table 2 of the paper #8

Comments

SkyTu commented Nov 17, 2022 • edited Loading

ElleryQu commented Jun 3, 2023

SkyTu commented Nov 17, 2022 •

edited

Loading