Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions on the communication size of AlexNet (on CIFAR 10) and LeNet (on MNIST) in Table 2 of the paper #8

Open
SkyTu opened this issue Nov 17, 2022 · 1 comment

Comments

@SkyTu
Copy link

SkyTu commented Nov 17, 2022

We noticed that the shape of data samples in CIFAR 10 is 32x32x3, while for MNIST is 28x28x1. Moreover, the number of parameters in AlexNet is much bigger than LeNet. Combining these two observations, we arrived at why, with the same protocol (e.g., Falcon), the communication size of LeNet training MNIST is 485.90 GB while the communication size of AlexNet training CIFAR 10 is 382.18GB. Is there any optimization used in AlexNet?

@ElleryQu
Copy link

ElleryQu commented Jun 3, 2023

Both AlexNet and VGG16 use avgpool instead of Maxpool, I guess that's the reason.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants