🐛 [Bug] Cannot export models to TensorRT with int8 quantization #1222

domef · 2022-08-02T08:32:49Z

🐛 Describe the bug

I'm trying to convert a resnet18 to TensorRT. It works fine when setting enabled_precisions to torch.float and to torch.float16. It doesn't work with torch.int8.

import torch
import torchvision
import torch_tensorrt


model = torchvision.models.resnet18().eval().cuda()
model_jit = torch.jit.script(model)
# model_jit = torch.jit.trace(model, torch.rand((1, 3, 256, 256), device="cuda"))

trt_model = torch_tensorrt.ts.compile(
    model_jit,
    inputs=[torch_tensorrt.Input((1, 3, 256, 256))],
    device={
        "device_type": torch_tensorrt.DeviceType.GPU,
        "gpu_id": 0,
        "dla_core": 0,
        "allow_gpu_fallback": True,
    },
    enabled_precisions={torch.int8},
)

When using the model exported with torch.jit.script, the error is the following:

  File "test_tensorrt.py", line 10, in <module>
    trt_model = torch_tensorrt.ts.compile(
  File "/opt/conda/lib/python3.8/site-packages/torch_tensorrt/ts/_compiler.py", line 113, in compile
    compiled_cpp_mod = _C.compile_graph(module._c, _parse_compile_spec(spec))
RuntimeError: Unknown type bool encountered in graph lowering. This type is not supported in ONNX export.

When using the model exported with torch.jit.trace, the program exit with:

Segmentation fault (core dumped)

I'm using the nvidia container nvcr.io/nvidia/pytorch:22.06-py3.

Edit:

If I convert the nn.Module I get the same error as using the scripted model:

model = torchvision.models.resnet18().eval().cuda()

trt_model = torch_tensorrt.compile(
    model,
    inputs=[torch_tensorrt.Input((1, 3, 256, 256))],
    device={
        "device_type": torch_tensorrt.DeviceType.GPU,
        "gpu_id": 0,
        "dla_core": 0,
        "allow_gpu_fallback": True,
    },
    enabled_precisions={torch.int8},
)

Versions

PyTorch version: 1.13.0a0+340c412
Is debug build: False
CUDA used to build PyTorch: 11.7
ROCM used to build PyTorch: N/A

OS: Ubuntu 20.04.4 LTS (x86_64)
GCC version: (Ubuntu 9.4.0-1ubuntu1~20.04.1) 9.4.0
Clang version: Could not collect
CMake version: version 3.23.2
Libc version: glibc-2.31

Python version: 3.8.13 | packaged by conda-forge | (default, Mar 25 2022, 06:04:10) [GCC 10.3.0] (64-bit runtime)
Python platform: Linux-5.13.0-52-generic-x86_64-with-glibc2.10
Is CUDA available: True
CUDA runtime version: 11.7.99
GPU models and configuration: GPU 0: NVIDIA GeForce RTX 2080 Ti
Nvidia driver version: 510.73.05
cuDNN version: Probably one of the following:
/usr/lib/x86_64-linux-gnu/libcudnn.so.8.4.1
/usr/lib/x86_64-linux-gnu/libcudnn_adv_infer.so.8.4.1
/usr/lib/x86_64-linux-gnu/libcudnn_adv_train.so.8.4.1
/usr/lib/x86_64-linux-gnu/libcudnn_cnn_infer.so.8.4.1
/usr/lib/x86_64-linux-gnu/libcudnn_cnn_train.so.8.4.1
/usr/lib/x86_64-linux-gnu/libcudnn_ops_infer.so.8.4.1
/usr/lib/x86_64-linux-gnu/libcudnn_ops_train.so.8.4.1
HIP runtime version: N/A
MIOpen runtime version: N/A
Is XNNPACK available: True

Versions of relevant libraries:
[pip3] numpy==1.22.4
[pip3] pytorch-quantization==2.1.2
[pip3] torch==1.13.0a0+340c412
[pip3] torch-tensorrt==1.1.0a0
[pip3] torchtext==0.13.0a0
[pip3] torchvision==0.13.0a0
[conda] mkl 2020.4 h726a3e6_304 conda-forge
[conda] mkl-include 2020.4 h726a3e6_304 conda-forge
[conda] numpy 1.22.4 py38h99721a1_0 conda-forge
[conda] pytorch-quantization 2.1.2 pypi_0 pypi
[conda] torch 1.13.0a0+340c412 pypi_0 pypi
[conda] torch-tensorrt 1.1.0a0 pypi_0 pypi
[conda] torchtext 0.13.0a0 pypi_0 pypi
[conda] torchvision 0.13.0a0 pypi_0 pypi

cc @jerryzh168 @jianyuh @raghuramank100 @jamesr66a @vkuzo

The text was updated successfully, but these errors were encountered:

github-actions · 2022-11-01T00:02:46Z

This issue has not seen activity for 90 days, Remove stale label or comment or this will be closed in 10 days

chava100 · 2022-11-03T16:52:06Z

I encountered the same issue ,
I would appreciate it if anyone has information about this error.

ousinkou · 2022-12-15T02:55:07Z

I use the example in tutorial https://pytorch.org/TensorRT/tutorials/use_from_pytorch.html#, same problem occurred.
I add model=model.eval(), and the input type dtype=torch.half, the problem disappeared.

Charlyo · 2023-02-19T16:49:56Z

+1 have the same problem!

ichitaka · 2023-04-11T10:20:04Z

+1

Charlyo · 2023-04-18T09:40:31Z

@peri044 Could you take a look at the matter, please?

AhmetHamzaEmra · 2023-06-13T17:21:51Z

I have the same issue, any update in this?

peri044 · 2023-06-13T18:03:51Z

@AhmetHamzaEmra Can you provide a repro of the error and the error message ? 22.06 container is quite old and you should probably try with the main branch

AhmetHamzaEmra · 2023-06-13T21:49:36Z

@AhmetHamzaEmra Can you provide a repro of the error and the error message ? 22.06 container is quite old and you should probably try with the main branch

#2018

SongDabao · 2023-07-06T07:46:42Z

I have the same issue, any update in this?

github-actions · 2023-10-05T00:02:17Z

This issue has not seen activity for 90 days, Remove stale label or comment or this will be closed in 10 days

domef · 2023-10-13T18:08:21Z

Any update on the uint8 export?

domef · 2023-10-24T05:12:31Z

Is int8 quantization now working?

iamjkh · 2024-01-19T05:42:25Z

I saw same error message as this page.
For int quntization, You have to put into calibrator with dataloader.
Follow page is will be helpful for this issue.

https://pytorch.org/TensorRT/tutorials/ptq.html

aungpaing-gw · 2024-04-23T04:08:17Z

Any update on the quantization the model to int8 format ?

domef added the bug Something isn't working label Aug 2, 2022

narendasan added the component: quantization Issues re: Quantization label Aug 2, 2022

github-actions bot assigned peri044 Aug 2, 2022

github-actions bot added the No Activity label Nov 1, 2022

ncomly-nvidia removed the No Activity label Nov 7, 2022

github-actions bot added the No Activity label Oct 5, 2023

github-actions bot closed this as completed Oct 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 [Bug] Cannot export models to TensorRT with int8 quantization #1222

🐛 [Bug] Cannot export models to TensorRT with int8 quantization #1222

domef commented Aug 2, 2022

github-actions bot commented Nov 1, 2022

chava100 commented Nov 3, 2022

ousinkou commented Dec 15, 2022 •

edited

Loading

Charlyo commented Feb 19, 2023

ichitaka commented Apr 11, 2023

Charlyo commented Apr 18, 2023

AhmetHamzaEmra commented Jun 13, 2023

peri044 commented Jun 13, 2023

AhmetHamzaEmra commented Jun 13, 2023

SongDabao commented Jul 6, 2023

github-actions bot commented Oct 5, 2023

domef commented Oct 13, 2023

domef commented Oct 24, 2023

iamjkh commented Jan 19, 2024

aungpaing-gw commented Apr 23, 2024

🐛 [Bug] Cannot export models to TensorRT with int8 quantization #1222

🐛 [Bug] Cannot export models to TensorRT with int8 quantization #1222

Comments

domef commented Aug 2, 2022

🐛 Describe the bug

Versions

github-actions bot commented Nov 1, 2022

chava100 commented Nov 3, 2022

ousinkou commented Dec 15, 2022 • edited Loading

Charlyo commented Feb 19, 2023

ichitaka commented Apr 11, 2023

Charlyo commented Apr 18, 2023

AhmetHamzaEmra commented Jun 13, 2023

peri044 commented Jun 13, 2023

AhmetHamzaEmra commented Jun 13, 2023

SongDabao commented Jul 6, 2023

github-actions bot commented Oct 5, 2023

domef commented Oct 13, 2023

domef commented Oct 24, 2023

iamjkh commented Jan 19, 2024

aungpaing-gw commented Apr 23, 2024

ousinkou commented Dec 15, 2022 •

edited

Loading