-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QST] Windows compatibility #1126
Comments
@ucgggg Thanks for reporting! Would you happen to have the opportunity to test with a newer CUDA Toolkit? Windows support is new to the CUTLASS 3.x series, so we've mainly been testing with CUDA 12.2. |
@mhoemmen Thanks for your reply! I delved into the issue, and then wrote the following code:
Compile:
It works well with nvcc 12.2
|
@ucgggg Thanks for testing! Are you able to switch to CUDA 12.2? |
@mhoemmen I tried to compile FlashAttention-2 with CUDA 12.2 and got a few different errors. But these new errors have nothing to do with CUTLASS. They happen in FlashAttention-2. CUTLASS works well with CUDA 12.2. So I think CUTLASS is compatible with CUDA 12.2, but not fully compatible with CUDA 11.8. Use this to reproduce: I just tested on Windows. I don't know if the same problem occurs on Linux. |
@ucgggg Thanks so much for testing CUDA 11.8 on Windows! We test several different CUDA Toolkit versions on Linux, so I'm pretty confident that 11.8 would work for you there. |
I came across the same issue when trying to build VLLM in windows. For CUDA 11.8, the following check is failed and causing unary operations, such as
cutlass/include/cute/numeric/math.hpp Lines 63 to 67 in 5b283c8
|
@toothache Could you please post exactly what version of CUTLASS you used, and how you were calling CUTLASS? |
I'm trying to compile VLLM repo on Windows, but then I encountered several build errors related with CUTLASS. Later, I discovered that the build failures were caused by the following checks with CUDA 11.8. Switching to CUDA 12.x resolved those issues.
|
@toothache Thanks for reporting this! I've spawned a separate bug to track this: #1689 . Please put all further discussion there. Thanks! |
I'm trying to compile FlashAttention-2 on Windows, which it does not support Windows yet, and got a lot of errors. One of the errors can be reproduced simply with the following code, using only CUTLASS:
The errors are as follows:
include/cute/algorithm/functional.hpp(104): error : no instance of overloaded function "cute::abs" matches the argument list
include/cute/layout.hpp(590): error : no instance of overloaded function "cute::as_arithmetic_tuple" matches the argument list
......
Visual Studio 2022
CUDA 11.8
Windows 10
CUTLASS 3.2.1
Is there an error in the above code, or does CUTLASS not fully support Windows yet?
The text was updated successfully, but these errors were encountered: