[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

SimpleTheoryOfTypes · 2024-11-07T23:50:41Z

What is your question?
By "natively," I mean without relying on third-party implementations. If I understand correctly, FasterTransformer and TVM have already developed their own CUTLASS extensions for constructing INT4/INT8 x FLOAT16 GEMMs. Just wondering if the latest CUTLASS release can already do this now? Thanks!

thakkarV · 2024-11-08T00:04:17Z

see example 55

SimpleTheoryOfTypes · 2024-11-08T00:07:53Z

see example 55

Thank you so much! sorry, I forgot to mention that my question is about int4 x fp16 GEMMs on Ampere, not Hopper. :).

thakkarV · 2024-11-08T00:15:52Z

#1084

github-actions · 2024-12-08T00:27:56Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

SimpleTheoryOfTypes added ? - Needs Triage question Question labels Nov 7, 2024

github-actions bot added the inactive-30d label Dec 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

SimpleTheoryOfTypes commented Nov 7, 2024 •

edited

Loading

thakkarV commented Nov 8, 2024

SimpleTheoryOfTypes commented Nov 8, 2024 •

edited

Loading

thakkarV commented Nov 8, 2024

github-actions bot commented Dec 8, 2024

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

Comments

SimpleTheoryOfTypes commented Nov 7, 2024 • edited Loading

thakkarV commented Nov 8, 2024

SimpleTheoryOfTypes commented Nov 8, 2024 • edited Loading

thakkarV commented Nov 8, 2024

github-actions bot commented Dec 8, 2024

SimpleTheoryOfTypes commented Nov 7, 2024 •

edited

Loading

SimpleTheoryOfTypes commented Nov 8, 2024 •

edited

Loading