-
Notifications
You must be signed in to change notification settings - Fork 2.5k
RuntimeError: CUDA error: a PTX JIT compilation failed #519
Comments
you probably have a version mismatch between CUDA and your environment? Can you remove the |
Ok, I removed build folder and ran Here's what I've got |
Can you try following the installation instructions in
|
I created the new conda environment and followed the instructions from #509 The setup.py imports function _find_cuda_home() from torch/utils/cpp_extension.py, where the line 38 is |
Hey, I would make sure that your installation of CUDA / nvcc are visible from the default paths. If Line 31 in 4a21d69
and recompile again. But this might be a sign that your install of CUDA is not in a standard place, or that you don't have all the environment variables setup properly |
i had this error when cupy is imported and some of its methods called, then pytorch failes. when cupy is removed from the runtime, then pytorch executes normally @fmassa |
related: pytorch/pytorch#21004 |
❓ Questions and Help
I try to run the webcam.py demo and get the following error:
...
File "/home/mvp/anaconda3/envs/mrcnn/lib/python3.6/site-packages/maskrcnn-benchmark/maskrcnn_benchmark/structures/boxlist_ops.py", line 27, in boxlist_nms
keep = _box_nms(boxes, score, nms_thresh)
RuntimeError: CUDA error: a PTX JIT compilation failed (launch_kernel at /opt/conda/conda-bld/pytorch_1549630534704/work/aten/src/ATen/native/cuda/Loops.cuh:62)
frame #0: c10::Error::Error(c10::SourceLocation, std::string const&) + 0x45 (0x7fb8b1b2fcf5 in /home/mvp/anaconda3/envs/mrcnn/lib/python3.6/site-packages/torch/lib/libc10.so)
frame #1: void at::native::gpu_index_kernel<__nv_dl_wrapper_t<__nv_dl_tag<void ()(at::TensorIterator&, c10::ArrayRef, c10::ArrayRef), &(void at::native::index_kernel_impl<at::native::OpaqueType<8> >(at::TensorIterator&, c10::ArrayRef, c10::ArrayRef)), 1u>> >(at::TensorIterator&, c10::ArrayRef, c10::ArrayRef, __nv_dl_wrapper_t<__nv_dl_tag<void ()(at::TensorIterator&, c10::ArrayRef, c10::ArrayRef), &(void at::native::index_kernel_impl<at::native::OpaqueType<8> >(at::TensorIterator&, c10::ArrayRef, c10::ArrayRef)), 1u>> const&) + 0x339 (0x7fb8b73763f9 in /home/mvp/anaconda3/envs/mrcnn/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so)
frame #2: + 0x27999b6 (0x7fb8b73719b6 in /home/mvp/anaconda3/envs/mrcnn/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so)
frame #3: + 0x279a1e5 (0x7fb8b73721e5 in /home/mvp/anaconda3/envs/mrcnn/lib/python3.6/site-packages/torch/lib/libcaffe2_gpu.so)
frame #4: + 0x6847da (0x7fb8b23c57da in /home/mvp/anaconda3/envs/mrcnn/lib/python3.6/site-packages/torch/lib/libcaffe2.so)
...
Collected environment information:
PyTorch version: 1.0.1.post2
Is debug build: No
CUDA used to build PyTorch: 10.0.130
OS: Ubuntu 18.04.2 LTS
GCC version: (Ubuntu 6.5.0-2ubuntu1~18.04) 6.5.0 20181026
CMake version: version 3.10.2
Python version: 3.6
Is CUDA available: Yes
CUDA runtime version: 10.1.105
GPU models and configuration: GPU 0: GeForce GTX 1080 Ti
Nvidia driver version: 415.27
cuDNN version: Could not collect
Versions of relevant libraries:
[pip] numpy==1.14.3
[pip] numpydoc==0.8.0
[conda] blas 1.0 mkl
[conda] mkl 2018.0.2 intel_1 intel
[conda] mkl_fft 1.0.1 np114py36_intel_0 [intel] intel
[conda] mkl_random 1.0.1 np114py36_intel_0 [intel] intel
The text was updated successfully, but these errors were encountered: