GPU memory consumption too high? #1

jenspetersen · 2018-12-13T15:57:20Z

Hey, thanks for implementing GQN in PyTorch! I can only fit batches of size 8 on my Titan Xp (12GB), is that the same for you, or can you fit the default size of 36? In my own implementation I can manage a batch size of 36, but the results don't look too good, so I wanted to try your version :D

Best,
Jens

iShohei220 · 2018-12-13T21:49:37Z

Hi!

As you said, this implementation needs very very high computational power because of enormous number of parameters.
This implementation is based on the advice from Dr. Ali Eslami, the first author of GQN paper, so the setting of hyperparameters is absolutely the same as the one on the original paper.
If you have only limited GPU memory, I recommend to use the option of --shared_core True (default: False) or --layers 8 (default: 12) in train.py in order to reduce the number of parameters.
Although the setting would be different with the original paper, you can get enough results as far as I have experimented.

Thank you.

jenspetersen · 2018-12-14T09:11:32Z

Hi, yes, I totally missed that this uses separate cores by default. Thanks!

jenspetersen closed this as completed Dec 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU memory consumption too high? #1

GPU memory consumption too high? #1

jenspetersen commented Dec 13, 2018

iShohei220 commented Dec 13, 2018 •

edited

Loading

jenspetersen commented Dec 14, 2018

GPU memory consumption too high? #1

GPU memory consumption too high? #1

Comments

jenspetersen commented Dec 13, 2018

iShohei220 commented Dec 13, 2018 • edited Loading

jenspetersen commented Dec 14, 2018

iShohei220 commented Dec 13, 2018 •

edited

Loading