Onnx to tensorrt conversion for input layer: cast uint8 to fp32 #4131

maxlacourchristensen · 2024-09-17T14:41:39Z

My pytorch and onnx model has an uint8 to fp32 cast layer which divides by 255. This cast layer is applied to the input tensor. When i convert the onnx model to tensorrt INT8 i get the following warning:

“Missing scale and zero-point for tensor input, expect fall back to non-int8 implementation for any layer consuming or producing given tensor”

For INT8 should i remove the cast layer before exporting the onnx model or does tensorrt deal with it itself? What is the recommended approach for best INT8 performance?

Platform is Jetson Orin AGX, Xavier NX and Orin NX

lix19937 · 2024-09-18T00:55:01Z

Similar case #3959

moraxu added Quantization: PTQ triaged Issue has been triaged by maintainers labels Sep 20, 2024

kevinch-nv removed the Quantization: PTQ label Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Onnx to tensorrt conversion for input layer: cast uint8 to fp32 #4131

Onnx to tensorrt conversion for input layer: cast uint8 to fp32 #4131

maxlacourchristensen commented Sep 17, 2024

lix19937 commented Sep 18, 2024

Onnx to tensorrt conversion for input layer: cast uint8 to fp32 #4131

Onnx to tensorrt conversion for input layer: cast uint8 to fp32 #4131

Comments

maxlacourchristensen commented Sep 17, 2024

lix19937 commented Sep 18, 2024