Multi-GPU training #49

wk565 · 2024-09-26T04:24:29Z

When I use multi-GPU training, I encounter the following problem：

subprocess.CalledProcessError: Command '['/home/a/anaconda3/envs/mambayolo/bin/python', '-m', 'torch.distributed.run', '--nproc_per_node', '2', '--master_port', '50193', '/home/a/.config/Ultralytics/DDP/_temp_ue6wdvcg123161149577872.py']' returned non-zero exit status 1.

EthanW-coder · 2024-09-28T17:47:00Z

Can you provide more detailed information about the error report, it will help to troubleshoot the real cause of the error report.

tjumaojingjun · 2024-11-06T09:26:23Z

I have the same issue, how did you solve it?

EricLiuUCAS · 2024-12-21T10:40:06Z

me too ，out of memory？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-GPU training #49

Multi-GPU training #49

wk565 commented Sep 26, 2024

EthanW-coder commented Sep 28, 2024

tjumaojingjun commented Nov 6, 2024

EricLiuUCAS commented Dec 21, 2024

Multi-GPU training #49

Multi-GPU training #49

Comments

wk565 commented Sep 26, 2024

EthanW-coder commented Sep 28, 2024

tjumaojingjun commented Nov 6, 2024

EricLiuUCAS commented Dec 21, 2024