Add fp16 PyTorch models #152

kkontny · 2022-10-10T08:12:10Z

In Pytorch you can convert model to fp16 with module.half() call. I think it should be called before converting to TorchScript. See docs https://pytorch.org/docs/stable/generated/torch.nn.Module.html. I think it should be quite simple implementation.

The text was updated successfully, but these errors were encountered:

jan-grzybek-ampere · 2022-10-10T21:25:35Z

@dkupnicki please take a look

dkupnicki · 2022-10-12T15:42:19Z

I added self.__model.half() before

ampere_model_library/utils/pytorch.py

Line 46 in 209549f

self.__frozen_script = torch.jit.freeze(torch.jit.script(self.__model))

and tested a few models. All of them stopped working, but they threw slightly different errors:

resnet_50_v1 and a few others threw RuntimeError: "rsqrt_cpu" not implemented for 'Half'
alexnet and ssd_vgg_16 threw RuntimeError: expected scalar type Float but found Half
roberta_base_squad threw RuntimeError: "LayerNormKernelImpl" not implemented for 'Half'

Looks like half() only works on GPUs.

kkontny · 2022-10-12T15:45:01Z

Yes, it is expected. Since x86 doesn't support fp16 natively, nobody cared about this on CPU. However Altra supports it natively, so when eager mode will be implemented I expect it to work.

jan-grzybek-ampere assigned dkupnicki Oct 10, 2022

stas-sl mentioned this issue Aug 14, 2023

2x performance drop using pytorch depending on how input data is fed into model #198

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fp16 PyTorch models #152

Add fp16 PyTorch models #152

kkontny commented Oct 10, 2022

jan-grzybek-ampere commented Oct 10, 2022

dkupnicki commented Oct 12, 2022

kkontny commented Oct 12, 2022

Add fp16 PyTorch models #152

Add fp16 PyTorch models #152

Comments

kkontny commented Oct 10, 2022

jan-grzybek-ampere commented Oct 10, 2022

dkupnicki commented Oct 12, 2022

kkontny commented Oct 12, 2022