Int8 calculation problem #76

CoinCheung · 2024-09-21T02:23:42Z

Hi,

I posted the error message in the repo of Tensorrt and they refered to this repo, so I open an issue here. The problem is that when I quantize the model in pytorch with modelopt and export it to onnx, the tensorrt will fail to compile the onnx file into tensorrt engine..

Here is the link with an example code piece which shows how to reproduce the error message:

NVIDIA/TensorRT#4095 (comment)

Please help me to make it work.

cjluo-omniml · 2024-09-25T15:55:59Z

Have you tried the onnx PTQ workflow which exports to the onnx first then do the quantization? See https://github.com/NVIDIA/TensorRT-Model-Optimizer/tree/main/onnx_ptq

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Int8 calculation problem #76

Int8 calculation problem #76

CoinCheung commented Sep 21, 2024

cjluo-omniml commented Sep 25, 2024

Int8 calculation problem #76

Int8 calculation problem #76

Comments

CoinCheung commented Sep 21, 2024

cjluo-omniml commented Sep 25, 2024