Overhead of setting workspace? #23

chenhongyu2048 · 2024-10-28T08:24:48Z

Hello,
I'm currently trying to use the grouped gemm code in my project, but I've noticed that in every iteration, workspace is initialized (based on torch::Tensor workspace = torch::empty(workspace_size, options)); that seems unnecessary?
Because cutlass's workspace is reuseable. And it seems to affect performance when used frequently, such as in many MoE layers, or when the MxNxK is large. Has anyone tested the effects of this?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overhead of setting workspace? #23

Overhead of setting workspace? #23

chenhongyu2048 commented Oct 28, 2024

Overhead of setting workspace? #23

Overhead of setting workspace? #23

Comments

chenhongyu2048 commented Oct 28, 2024