You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[1,2]:Traceback (most recent call last):
[1,2]: File "OptimizationDriver.py", line 325, in
[1,2]: block.run()
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/optimize/process_block.py", line 157, in run
[1,2]: fom = self.train_model()
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/optimize/process_block.py", line 127, in train_model
[1,2]: checkpoint_interval=self.checkpoint_interval)
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/mpi/manager.py", line 420, in init
[1,2]: checkpoint=checkpoint, checkpoint_interval=checkpoint_interval)
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/mpi/manager.py", line 165, in init
[1,2]: self.make_comms(comm)
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/mpi/manager.py", line 266, in make_comms
[1,2]: checkpoint=self.checkpoint, checkpoint_interval=self.checkpoint_interval
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/mpi/process.py", line 493, in init
[1,2]: checkpoint=checkpoint, checkpoint_interval=checkpoint_interval )
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/mpi/process.py", line 121, in init
[1,2]: self.train()
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/mpi/process.py", line 556, in train
[1,2]: train_metrics = self.model.train_on_batch( x=batch[0], y=batch[1] )
[1,2]: File "/nfshome/vlimant/NNLO/nnlo/train/model.py", line 314, in train_on_batch
[1,2]: pred = self.model.forward(x)
[1,2]: File "/nfshome/vlimant/NNLO/examples/example_jedi_torch.py", line 74, in forward
[1,2]: E = torch.transpose(E, 1, 2).contiguous()
[1,2]:RuntimeError: CUDA out of memory. Tried to allocate 1.67 GiB (GPU 0; 7.93 GiB total capacity; 4.98 GiB already allocated; 1.09 GiB free; 1.33 GiB cached)
The text was updated successfully, but these errors were encountered:
hard to say how to solve this, or whether it has to be solve, but running
I get
The text was updated successfully, but these errors were encountered: