Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[cuTENSOR] Automatically enable/disable TensorFloat32 #40

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

Linux-cpp-lisp
Copy link

(Thanks to @springer13 for making your work on a PyTorch cuTENSOR wrapper public!)

Currently, the Python cuTENSOR wrapper always uses TensorFloat32 as the compute dtype for 32-bit float tensors, which is unsupported on non-Ampere GPUs. This PR uses the PyTorch and Tensorflow configuration options for TensorFloat32 (which autodetect Ampere) to set the compute dtype to normal 32-bit float when tf32 is not supported.

@Linux-cpp-lisp Linux-cpp-lisp changed the title Automatically enable/disable TensorFloat32 [cuTENSOR] Automatically enable/disable TensorFloat32 Jul 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants