Fix datatype issue with Sparse Attention softmax #363

jeffra · 2020-09-04T17:58:17Z

Fixes a dataype issue with softmax where the number of blocks being sent to the Triton kernel source was a torch.Tensor but should have been a python integer. On some environments (e.g., conda) this resulted in triton not knowing how to serialize the input (and crashing in our tests). Once switching to the correct datatype that triton expects this seems to have solved the issue.

In the future it would be great if we can add python type hints across DeepSpeed to help with these types of issues. https://docs.python.org/3/library/typing.html

Fixes a dataype issue with softmax where the number of blocks being sent to the Triton kernel source was a torch.Tensor but should have been a python integer. On some environments (e.g., conda) this resulted in triton not knowing how to serialize the input (and crashing in our tests). Once switching to the correct datatype that triton expects this seems to have solved the issue.

jeffra requested review from arashashari, awan-10, cli99, conglongli, eltonzheng, minjiaz, niumanar, RezaYazdaniAminabadi, samyam, ShadenSmith and tjruwase as code owners September 4, 2020 17:58

arashashari approved these changes Sep 4, 2020

View reviewed changes

Shaden Smith and others added 2 commits September 6, 2020 09:58

Merge branch 'master' into jeffra-patch-3

f1ee4f0

Merge branch 'master' into jeffra-patch-3

d7132c1

jeffra merged commit dca0b78 into master Sep 10, 2020

jeffra deleted the jeffra-patch-3 branch September 10, 2020 07:07

bobisapotato mentioned this pull request Jan 24, 2021

Another thing to merge. (MY EYES HURT) bobisai/DeepSpeed#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix datatype issue with Sparse Attention softmax #363

Fix datatype issue with Sparse Attention softmax #363

jeffra commented Sep 4, 2020 •

edited

Loading

Fix datatype issue with Sparse Attention softmax #363

Fix datatype issue with Sparse Attention softmax #363

Conversation

jeffra commented Sep 4, 2020 • edited Loading

jeffra commented Sep 4, 2020 •

edited

Loading