-
Notifications
You must be signed in to change notification settings - Fork 751
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CUDA] Failed to build crossEntropy with error in backend: Cannot select: t38: v2f16 = bitcast t37 #5969
Comments
I didn't observe compile/run errors using HIP backend. |
This looks like a bug in NVPTX LLVM target back-end. |
Would it not be an UB to typecast between |
The cuda compiler supports the conversion. |
I've run it on a titan machine with DPCPP just now: # ./crossEntropy.run
Matrix size(bs * W * H) = 128 * 81 * 8732
LossNLL_FWD CPU time(ms)=. 2.01997
LossNLL_FWD GPU time(ms)=. 2.00132
FWDBandWidth = 179.963433 (GB / s), 386722816.000000
LossNLL_BWD CPU time(ms)=. 1.82208
LossNLL_BWD GPU time(ms)=. 1.80568
BWDBandWidth = 214.169755 (GB / s), 386722816.000000
|
Make sure NVPTX backend can handle bitcasting between `float` and `<2 x half>` types. This was discovered through: intel/llvm#5969 I'm not suggesting that such bitcasts make much sense, but it feels like the compiler should not hard crash on them. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D124171
This has been merged into |
Make sure NVPTX backend can handle bitcasting between `float` and `<2 x half>` types. This was discovered through: intel/llvm#5969 I'm not suggesting that such bitcasts make much sense, but it feels like the compiler should not hard crash on them. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D124171
Describe the bug
[CUDA] Failed to build crossEntropy with error in backend: Cannot select: t38: v2f16 = bitcast t37
To Reproduce
Error message:
Environment
The text was updated successfully, but these errors were encountered: