convert a static size onnx model to tensorrt but for multiple batch inference #3632

ninono12345 · 2024-01-24T21:25:11Z

Description

Hello, this is not a bug report, more like asking for advice.

I have an onnx model that's used in a tracking algorithm. But the model originally doesn't accept multiple batches, if there are 4 objects tracked, the model processes the frame 4 times, instead of sending a tensor with a batch of 4.

My question is, is it possible to convert that onnx model to tensorrt, but to make it accept multiple batches at once? so for example if an input should be 1x3x224x224, I would send in 4x3x224x224, so perhaps I could infer all 4 frames with a single call to the engine?

Thank you

ninono12345 · 2024-01-25T11:52:43Z

I understand, that to some of you this might sound like a stupid question, but I have had so many problems just convert a model to onnx, then to tensorrt and make it work, my head hurts. I have to dig deep again just to find an answer for this, perhaps anybody knows how to do it?

Thank you

zerollzeng · 2024-01-27T08:04:13Z

For some onnx models it's doable, it can be quickly done by polygraphy. See https://github.com/NVIDIA/TensorRT/tree/main/tools/Polygraphy/examples/cli/surgeon/03_modifying_input_shapes

While some models won't work, e.g. if your model has a reshape node that do the reshape to a fixed shape, then you cannot change the input batch/shape, otherwise it would fail when building engine.

ninono12345 · 2024-02-04T17:06:04Z

@zerollzeng I found out that the model can be modified in a few places, I managed to make it be able to take and return multiple batches. But now I'm facing a different problem, when adding batches to a tensorrt engine the engines inference time slows down significantly... #3646

zerollzeng · 2024-02-07T09:41:47Z

Let's discuss the new issue in #3646

I'm closing this.

zerollzeng self-assigned this Jan 27, 2024

zerollzeng added the triaged Issue has been triaged by maintainers label Jan 27, 2024

zerollzeng closed this as completed Feb 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert a static size onnx model to tensorrt but for multiple batch inference #3632

convert a static size onnx model to tensorrt but for multiple batch inference #3632

ninono12345 commented Jan 24, 2024

ninono12345 commented Jan 25, 2024

zerollzeng commented Jan 27, 2024

ninono12345 commented Feb 4, 2024

zerollzeng commented Feb 7, 2024

convert a static size onnx model to tensorrt but for multiple batch inference #3632

convert a static size onnx model to tensorrt but for multiple batch inference #3632

Comments

ninono12345 commented Jan 24, 2024

Description

ninono12345 commented Jan 25, 2024

zerollzeng commented Jan 27, 2024

ninono12345 commented Feb 4, 2024

zerollzeng commented Feb 7, 2024