Is this a bug? - unnecessary CPU/GPU copying in supporters.py just for aggregating loss #12408

isvogor-foi · 2022-03-22T12:15:12Z

I've noticed that while training, this chunk here is eating a lot of time.

https://github.com/PyTorchLightning/pytorch-lightning/blob/0b682b807a76b732492a3d43e3208a80c80b6c2f/pytorch_lightning/trainer/supporters.py#L76-L80

I'm wondering whether this is a bug, and why we don't have it like:

 self.memory = torch.zeros(self.window_length, *x.shape, device=x.device)

It works, and it's much, faster.

cc @Borda @akihironitta

The text was updated successfully, but these errors were encountered:

rohitgr7 · 2022-03-22T13:04:48Z

this will probably be removed in some time: #9372

carmocca · 2022-03-22T13:57:56Z

This is a very old piece of code. I guess the choice was to prefer less GPU memory usage over speed

isvogor-foi · 2022-03-23T08:50:19Z

Well, I'm not sure how impactful this is on the memory, but it's an easy fix. Should I submit a PR?
I've made some measurements... So in the advance function I've measured how much does it take to perform a training step.. And then I've summed it up through all epochs, for a simple MNIST example.

Never mind the other bars in the plot, but focus on the red highlighted one. The total time to run is around 105 S, out of this time the green bar (10 S) is the training, the remaining time, red, is performing the loss update.

Just adding the x.device() fixes this. I see no reason why this would not be fixed...

carmocca · 2022-03-23T16:34:22Z

Yes, go ahead!

closes Lightning-AI#12408

isvogor-foi changed the title ~~Is this a bug?~~ Is this a bug? - unnecessary CPU/GPU copying in supporters.py just for aggregating loss Mar 22, 2022

carmocca added performance question Further information is requested labels Mar 22, 2022

isvogor-foi added a commit to isvogor-foi/pytorch-lightning that referenced this issue Mar 23, 2022

Time performance improvement

8396aab

closes Lightning-AI#12408

This was referenced Mar 23, 2022

Time performance improvement #12426

Closed

Create accumulator element (for loss) directly on the device #12430

Merged

carmocca closed this as completed in #12430 Mar 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is this a bug? - unnecessary CPU/GPU copying in supporters.py just for aggregating loss #12408

Is this a bug? - unnecessary CPU/GPU copying in supporters.py just for aggregating loss #12408

isvogor-foi commented Mar 22, 2022 •

edited by github-actions bot

Loading

rohitgr7 commented Mar 22, 2022

carmocca commented Mar 22, 2022

isvogor-foi commented Mar 23, 2022

carmocca commented Mar 23, 2022

Is this a bug? - unnecessary CPU/GPU copying in supporters.py just for aggregating loss #12408

Is this a bug? - unnecessary CPU/GPU copying in supporters.py just for aggregating loss #12408

Comments

isvogor-foi commented Mar 22, 2022 • edited by github-actions bot Loading

rohitgr7 commented Mar 22, 2022

carmocca commented Mar 22, 2022

isvogor-foi commented Mar 23, 2022

carmocca commented Mar 23, 2022

isvogor-foi commented Mar 22, 2022 •

edited by github-actions bot

Loading