Benchmark scripts for training time performance #139

HanYuanyuaner · 2019-03-18T06:58:22Z

❓ Questions & Help

Hi,
I run the gcn.py in /example folder, and change dataset name to "PubMed". In the website the training time about this dataset with gcn.py is 2.0s. But, in my server it only need about 0.7+s. The training time of gat.py is about 3s not 12s. My GPU is GTX 1080 Ti, 200 epochs. So do you know the reason?

rusty1s · 2019-03-18T09:54:10Z

Yes, you are right! Training speed performance increased once more with this PR. However, training speed may still vary for same GPUs for different PyTorch or CUDA versions. I will try to keep the performance table as up to date as possible and provide training time evaluation scripts for verification.

HanYuanyuaner · 2019-03-18T10:29:50Z

Yes, you are right! Training speed performance increased once more with this PR. However, training speed may still vary for same GPUs for different PyTorch or CUDA versions. I will try to keep the performance table as up to date as possible and provide training time evaluation scripts for verification.

Thank you for your respond. I checked my code in message_passing.py. The code is still old and torch embedding is not used. And my environment is ubuntu 16.04, torch 1.0, cuda 9.0 and cudnn7.0. For dataset Cora and Citeseer the experiment results are similar with yours. Only PubMed is different. I only modify dataset name in gcn.py/gat.py. Do I need to modify any other code?

rusty1s · 2019-03-18T13:52:07Z

I added a small script in benchmark/runtime to check current running times of model-dataset pairs. Actually, PubMed is now similar in speed to Cora and CiteSeer. I wonder what caused the delay back then. I will update running times ASAP.

HanYuanyuaner · 2019-03-19T01:23:58Z

I added a small script in benchmark/runtime to check current running times of model-dataset pairs. Actually, PubMed is now similar in speed to Cora and CiteSeer. I wonder what caused the delay back then. I will update running times ASAP.

Thank you for your respond. Another question is about distributed computation. In example the dataset is small, for real case the dataset may be large for one worker, so how to separate graph to sub-graph may be a problem, do you have any suggestion? Or dose PyG have the potential to support distributed computation?

rusty1s · 2019-03-19T12:16:38Z

Hi, you can always use more workers if this proves to be beneficial. We support distributed training via torch.distributed or nn.DataParallel. What do you mean with "so how to separate graph to sub-graph may be a problem"?

HanYuanyuaner · 2019-03-20T01:30:52Z

Hi, you can always use more workers if this proves to be beneficial. We support distributed training via torch.distributed or nn.DataParallel. What do you mean with "so how to separate graph to sub-graph may be a problem"?

I mean if the graph is too large for one GPU to store, how to store it?

rusty1s · 2019-03-20T05:23:19Z

Sadly, we currently do not support giant graph processing. Giant graphs are usually processed via sampling techniques. This is a rather difficult but important feature for PyG and it is definitively on my ToDo list. I will close this request in favour of #64.

rusty1s added the feature label Mar 18, 2019

rusty1s changed the title ~~Question about training time with PubMed dataset~~ Benchmark scripts for training time performance Mar 18, 2019

rusty1s closed this as completed Mar 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark scripts for training time performance #139

Benchmark scripts for training time performance #139

HanYuanyuaner commented Mar 18, 2019

rusty1s commented Mar 18, 2019 •

edited

Loading

HanYuanyuaner commented Mar 18, 2019

rusty1s commented Mar 18, 2019 •

edited

Loading

HanYuanyuaner commented Mar 19, 2019

rusty1s commented Mar 19, 2019

HanYuanyuaner commented Mar 20, 2019

rusty1s commented Mar 20, 2019

Benchmark scripts for training time performance #139

Benchmark scripts for training time performance #139

Comments

HanYuanyuaner commented Mar 18, 2019

❓ Questions & Help

rusty1s commented Mar 18, 2019 • edited Loading

HanYuanyuaner commented Mar 18, 2019

rusty1s commented Mar 18, 2019 • edited Loading

HanYuanyuaner commented Mar 19, 2019

rusty1s commented Mar 19, 2019

HanYuanyuaner commented Mar 20, 2019

rusty1s commented Mar 20, 2019

rusty1s commented Mar 18, 2019 •

edited

Loading

rusty1s commented Mar 18, 2019 •

edited

Loading