-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QST] Are there plans to add specialisations for Sm90? #1123
Comments
the link points to dual gemm example not dgemm. do you want dual gemm or dgemm? |
Ah sorry, laxness on my part. I meant dual GEMM.
…On Wed, 4 Oct 2023, 15:58 Haicheng Wu, ***@***.***> wrote:
the link points to dual gemm example not dgemm. do you want dual gemm or
dgemm?
—
Reply to this email directly, view it on GitHub
<#1123 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AF7CASFCIQHJBUBAE42GUETX5V2SPAVCNFSM6AAAAAA5SXO35GVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONBXGAZDQOBSGY>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
you should be able to still run this example on h100 without changing the source code. just use sm90 in nvcc/cmake command line. this example does not use new features of hopper. we are looking to write a hopper dedicated version using cutlass 3.x |
Note: we with @joerowell and others are working on a diverged hard fork of xformers that has little to do with the mainstream 😉 We are rewriting the orignal kernels, doing a lot of optimizations, fixing bugs, etc. |
@joerowell optimizations for dual GEMM are on our roadmap for H1'24. |
This issue has been labeled |
This issue has been labeled |
@mnicely Hi! Is any update in roadmap for dual GEMM optimizations? Will this optimization appear? Or you reprioritized this task? Or it is too difficult to optimize for hopper ? |
What is your question?
I recently tried to change the type tags on the DGEMM examples to
cutlass::arch::Sm90
, which caused a load of compile errors. This is primarily because there's certain template specialisations missing forSm90
, e.g there's no specialisation like this.Is this something that's on the plans to be added / are PRs welcome?
The text was updated successfully, but these errors were encountered: