-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUBLAS_STATUS_EXECUTION_FAILED when calling cublasSgemm
#29
Comments
cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
cublasSgemm
Could you provide more details, e.g., where/which line does this error occur in the project? |
The error occurred in this line. |
I am getting the same issue. Tried different cuda versions no luck. Were you @Lopa07 able to fix it? Thanks! |
I was able to fix this by using cuda-toolkit 10.2 with cudnn8.7 for cuda10.2 (https://developer.nvidia.com/rdp/cudnn-archive#a-collapse870-102). Hope this helps. @Lopa07 The following version works as well as long as the appropriate cudann version is installed. This guide is helpful for cudann installation. |
Getting the error,
RuntimeError: CUDA error: CUBLAS_STATUS_EXECUTION_FAILED when calling 'cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)'
while trying the following training command:I am using python 3.7, CUDA 10.1.
The text was updated successfully, but these errors were encountered: