[cuTENSOR] Automatically enable/disable TensorFloat32 #40

Linux-cpp-lisp · 2021-07-12T21:57:22Z

(Thanks to @springer13 for making your work on a PyTorch cuTENSOR wrapper public!)

Currently, the Python cuTENSOR wrapper always uses TensorFloat32 as the compute dtype for 32-bit float tensors, which is unsupported on non-Ampere GPUs. This PR uses the PyTorch and Tensorflow configuration options for TensorFloat32 (which autodetect Ampere) to set the compute dtype to normal 32-bit float when tf32 is not supported.

springer13 and others added 5 commits June 28, 2021 09:01

cuTENSOR 1.3.1: einsum sample + python (einsum) bindings

b542aa1

Merge branch 'master' of https://github.com/NVIDIA/CUDALibrarySamples

442d715

Merge branch 'master' of https://github.com/NVIDIA/CUDALibrarySamples

ec627be

Clean-up

cccbab5

Automatically configure TensorFloat32

4fc626e

Linux-cpp-lisp changed the title ~~Automatically enable/disable TensorFloat32~~ [cuTENSOR] Automatically enable/disable TensorFloat32 Jul 12, 2021

gsakhnovsky-nvidia force-pushed the master branch from ec627be to 5c03ab7 Compare July 21, 2021 01:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cuTENSOR] Automatically enable/disable TensorFloat32 #40

[cuTENSOR] Automatically enable/disable TensorFloat32 #40

Linux-cpp-lisp commented Jul 12, 2021

[cuTENSOR] Automatically enable/disable TensorFloat32 #40

Are you sure you want to change the base?

[cuTENSOR] Automatically enable/disable TensorFloat32 #40

Conversation

Linux-cpp-lisp commented Jul 12, 2021