Find a better solution for backends that don't support i64/f64 #1615

silvasean · 2022-11-18T14:32:53Z

Some backends, such as IREE and TOSA (by today's spec) don't support i64 or f64 (or it is very expensive). This bug is to start collecting use cases and motivation to better support reducing bit width for these targets in Torch-MLIR.

The general theme is that PyTorch's semantics are the 64-bit semantics, but sometimes it is "common sense" to want to drop those down to smaller bit widths. E.g. for a 100kB model, a user might be quite comfortable to use a 32-bit integer to hold the tensor sizes, even if technically it is supposed to be a 64-bit integer.

As we think about this, there are 4 basic cases that probably need separate treatment:

Scalar floats (!torch.float). This represents the PyTorch/TorchScript float type which is 64-bit.
- Thoughts: I don't have specific insight here, but there are likely many scenarios where 64-bit is overkill (e.g. when deploying to a microcontroller). Some scientific codes might care about some factor being very precise though.
Scalar ints (!torch.int). This represents the PyTorch/TorchScript int type which is specced as 64-bit in TorchScript (though the Python type is arbitrary precision)
- Thoughts: Often these are used to calculate view sizes and such. With large language models in the 100's of GB these days, we cannot arbitrarily use 32-bit indexing though (though perhaps individual tensor dimensions remain in 32-bit range?).
Tensors with 64-bit floating point numbers (!torch.tensor<[10,12],f64>). This represents tensor computations on f64.
- Thoughts: PyTorch defaults to f32, so if a user asks for f64 they probably actually want the extra precision (?).
Tensors with 64-bit integers (e.g. !torch.tensor<[10,12],si64>). This is probably most common for embedding indices.
- Thoughts: Most embeddings are likely OK to index with 32-bit indices, but they seem to be getting larger and larger, and it is not out of the question to need 64-bit indices there.

We need to discuss with the PyTorch devs and see their thinking on this and align on a solution.

The text was updated successfully, but these errors were encountered:

silvasean · 2022-11-18T14:42:52Z

I've created a PyTorch dev discussion: https://dev-discuss.pytorch.org/t/how-to-approach-targets-that-dont-support-i64-f64/867

AmosLewis · 2023-01-16T22:25:21Z

#1802

silvasean mentioned this issue Nov 18, 2022

[TOSA] Add aten.Scalar si64 type support #1604

Merged

silvasean mentioned this issue Jan 9, 2023

Converting fp32 models to fp16 without cuda. #1783

Closed

AmosLewis mentioned this issue Jan 16, 2023

[TOSA] Add torch.prim.NumToTensor.Scalar float support #1802

Merged

AmosLewis mentioned this issue Jan 16, 2023

DistilGPT2 to TOSA nod-ai/SHARK-Studio#494

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Find a better solution for backends that don't support i64/f64 #1615

Find a better solution for backends that don't support i64/f64 #1615

silvasean commented Nov 18, 2022

silvasean commented Nov 18, 2022

AmosLewis commented Jan 16, 2023

Find a better solution for backends that don't support i64/f64 #1615

Find a better solution for backends that don't support i64/f64 #1615

Comments

silvasean commented Nov 18, 2022

silvasean commented Nov 18, 2022

AmosLewis commented Jan 16, 2023