-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Segmentation fault (core dumped) #21
Comments
updating gcc to 4.9 and rebuild can fix it |
Hi, Could you run the python script via gdb and show the traceback?
Also, Does this segfault when running on the CPU? Thanks! |
@youngkyoonjang Hello,I also encountered the problem, after the output of "Start training" ,the process stopped without any output. And I saw "[1] 8817 segmentation fault (core dumped) python tools/train_net.py --config-file" in the shell. |
Hi @ll490187880 , Could you try running the aforementioned commands and give the output of the stack trace? gdb python
>> run "tools/train_net.py --config-file ..." and once it crashes, run
and paste the result? |
@fmassa yeah, as follows: |
@fmassa I did it before following @senlinuc 's comments (updating gcc to 4.9 / rebuilding it): Program received signal SIGSEGV, Segmentation fault. And Yes! Segmentation fault (core dumped) happens even when I run on the CPU (python webcam.py --min-image-size 300 MODEL.DEVICE cpu). |
Thanks for the stack trace! This is a known issue with gcc < 4.9, see pytorch/pytorch#6987. While compiling the extensions you probably saw a warning like
The solution is to upgrade to gcc 4.9 or higher following the instructions in https://gist.github.com/goldsborough/d466f43e8ffc948ff92de7486c5216d6 I'm closing the issue, and I'll be adding a note on the TROUBLESHOOTING section mentioning this issue. Let us know if after updating gcc and recompiling the library (after |
I've just improved the README with instructions on how to address this issue in #38 |
#0 0x00007ffff7de6d56 in _dl_relocate_object () from /lib64/ld-linux-x86-64.so.2 |
It breaks by returning message 'Segmentation fault (core dumped)'.
I think I followed the installation guide correctly. But when I run the code (maskrcnn-benchmark/demo/webcam.py), it breaks by returning 'Segmentation fault (core dumped)'.
The specific location returning the message is
line 27 in the file 'boxlist_ops.py': keep = _box_nms(boxes, score, nms_thresh)
from
line 114 in the file 'inference.py': boxlist = boxlist_nms( .... )
from
line 138 in the file 'inference.py': sampled_boxes.append(self.forward_for_single_feature_map(a, o, b))
from
line 122 in the file 'rpn.py': boxes = self.box_selector_test(anchors, objectness, rpn_box_regression)
from
line 96 in the file 'rpn.py': return self._forward_test(anchors, objectness, rpn_box_regression)
from
line 50 in the file 'generalised_rcnn.py': proposals, proposal_losses = self.rpn(images, features,
targets)
from
line 205 in the file 'predictor.py': predictions = self.model(image_list)
from
file 'webcam.py': composite = coco_demo.run_on_opencv_image(img)
I thought it was because of my small GPU memory problem on my local machine (PC). But when I tried to run the same code on an available deep learning machine which has enough (about 11GB GPU) memory. It still returns the same message and breaks.
What is my problem? It would be grateful if someone has an answer. Thanks in advance.
Best,
Young
The text was updated successfully, but these errors were encountered: