Warning on float64 training with PyTorch 2.1 #306

ilyes319 · 2024-01-24T11:34:14Z

PyTorch 2.1 has a known bug preventing training in float64: https://discuss.pytorch.org/t/tensors-of-the-same-index-must-be-on-the-same-device-and-the-same-dtype-except-step-tensors-that-can-be-cpu-and-float32-notwithstanding/190335

There are three workarounds:

Downgrade to PyTorch 2.0 (the current PR)
Select foreach = False in the optimizer but this downgrades the computational performance.
Upcast manually all tensors to float64 by hand instead of doing torch.set_default_dtype(torch.float64).

As PyTorch seems to fix that in the next release, I recommend just making the first point. If it is not fixed in 2.2, then we should try the last point.

Ilyes319 patch 3

ilyes319 added 2 commits January 24, 2024 11:17

Warning on float64 training with PyTorch 2.1

c110e88

Merge pull request #305 from ACEsuit/ilyes319-patch-3

480f511

Ilyes319 patch 3

ilyes319 merged commit 88d49f9 into main Jan 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Warning on float64 training with PyTorch 2.1 #306

Warning on float64 training with PyTorch 2.1 #306

ilyes319 commented Jan 24, 2024 •

edited

Loading

Warning on float64 training with PyTorch 2.1 #306

Warning on float64 training with PyTorch 2.1 #306

Conversation

ilyes319 commented Jan 24, 2024 • edited Loading

ilyes319 commented Jan 24, 2024 •

edited

Loading