You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[W custom_kernel.cpp:149] nvrtc: error: invalid value for --gpu-architecture (-arch)
Failed to use NVRTC for JIT compilation in this Pytorch version, try another approach using CUDA compiler.. (To always disable NVRTC, please: export USE_NVRTC=0)
Traceback (most recent call last):
File "test.py", line 6, in <module>
cumsum_tutel = fast_cumsum_sub_one(matrix, dim=0) + 1
File "/home/jdhwang/.local/lib/python3.8/site-packages/tutel/jit_kernels/gating.py", line 22, in fast_cumsum_sub_one
return torch.ops.tutel_ops.cumsum(data)
File "/home/jdhwang/conda/envs/cl/lib/python3.8/site-packages/torch/_ops.py", line 502, in __call__
return self._op(*args, **kwargs or {})
RuntimeError: (0) == (cuModuleLoadDataEx(&hMod, image.c_str(), sizeof(options) / sizeof(*options), options, values)) INTERNAL ASSERT FAILED at "/tmp/pip-req-build-c9h2prbs/tutel/custom/custom_kernel.cpp":205, please report a bug to PyTorch. CHECK_EQ fails.
following #203, I exported export USE_NVRTC=1 and I am using RTX4090 with torch ('2.0.0+cu117') and Cuda 11.7 (nvcc as well).
The text was updated successfully, but these errors were encountered:
Does anyone of export USE_NVRTC=1 & export USE_NVRTC=0 work? Seems like it is environmental problem (e.g. Multi CUDA version / ..), and it isn't likely to happen if CUDA + Pytorch are in a clean docker container.
Hi,
I installed tutel via
python3 -m pip install --user --upgrade git+https://github.com/microsoft/tutel@main
I am running a test script
and facing error
following #203, I exported
export USE_NVRTC=1
and I am using RTX4090 with torch ('2.0.0+cu117') and Cuda 11.7 (nvcc as well).The text was updated successfully, but these errors were encountered: