Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FatalError: Segmentation fault is detected by the operating system. #3644

Closed
2 of 3 tasks
dulicui742 opened this issue Feb 7, 2024 · 6 comments
Closed
2 of 3 tasks
Assignees
Labels
bug Something isn't working

Comments

@dulicui742
Copy link

问题确认 Search before asking

Bug描述 Describe the Bug

在执行https://github.com/PaddlePaddle/PaddleSeg/blob/release/2.6/docs/quick_start_cn.md demo时,python train.py及python val.py均能正常运行,但是在执行python predict.py时报错:

C++ Traceback (most recent call last):

0 ImagingZipEncode
1 deflateReset


Error Message Summary:

FatalError: Segmentation fault is detected by the operating system.
[TimeInfo: *** Aborted at 1707286977 (unix time) try "date -d @1707286977" if you are using GNU date ***]
[SignalInfo: *** SIGSEGV (@0x0) received by PID 4193083 (TID 0x7f6cb3650480) from PID 0 ***]

Segmentation fault (core dumped)

经过Debug,确认是在脚本的最后执行 pred_mask.save(pred_saved_path) 语句时报错的,进一步调试发现,是在执行ImageFile.py中的errcode, data = encoder.encode(bufsize)[1:]语句时返回的。

复现环境 Environment

------------Environment Information-------------
platform: Linux-5.19.0-50-generic-x86_64-with-glibc2.35
Python: 3.9.18 (main, Sep 11 2023, 13:41:44) [GCC 11.2.0]
Paddle compiled with cuda: True
NVCC: Build cuda_11.7.r11.7/compiler.31294372_0
cudnn: 8.4
GPUs used: 1
CUDA_VISIBLE_DEVICES: 0
GPU: ['GPU 0: NVIDIA RTX', 'GPU 1: NVIDIA RTX', 'GPU 2: NVIDIA RTX', 'GPU 3: NVIDIA RTX']
GCC: gcc (Ubuntu 11.3.0-1ubuntu1~22.04.1) 11.3.0
PaddleSeg: 2.9.0
PaddlePaddle: 2.6.0
OpenCV: 4.5.5

使用conda安装的Paddle:
conda install paddlepaddle-gpu==2.6.0 cudatoolkit=11.6 -c https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/Paddle/ -c conda-forge

Bug描述确认 Bug description confirmation

  • 我确认已经提供了Bug复现步骤、代码改动说明、以及环境信息,确认问题是可以复现的。I confirm that the bug replication steps, code change instructions, and environment information have been provided, and the problem can be reproduced.

是否愿意提交PR? Are you willing to submit a PR?

  • 我愿意提交PR!I'd like to help by submitting a PR!
@dulicui742 dulicui742 added the bug Something isn't working label Feb 7, 2024
@shiyutang
Copy link
Collaborator

你好,根据你提供的信息,这个错误出现在文件保存中,请进一步查看pred_mask中是否有非法值。

@TheMattBin
Copy link

I also faced similar issue as you when I was training my own model which is HrSegNet. I could train my own model in Windows OS but not Linux, my Linux environment is the same as yours. Not sure where cause the issues.

@ezone1987
Copy link

同样的segment fault.

@TingquanGao
Copy link
Collaborator

Thanks for this issue. As it has been inactive for a long time, we would close it. If you has any questions, please feel free to reopen or new issue, and we will follow up and resolve it.

@yhw2003
Copy link

yhw2003 commented Dec 12, 2024

我遇到了相同的问题,并且它可以正常生成叠加图,但是无法保存mask,这个bug未解决。

@yhw2003
Copy link

yhw2003 commented Dec 12, 2024

这个问题似乎是在paddlepaddle的2.6版本引入的,切换到2.5版本可以正常工作。
pip install paddlepaddle-gpu==2.5 -i https://pypi.tuna.tsinghua.edu.cn/simple

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

6 participants