Deploy Yolov8 model on TensorRT

Introduction

NVIDIA TensorRT is an SDK for optimizing trained deep learning models to enable high-performance inference. TensorRT contains a deep learning inference optimizer for trained deep learning models, and a runtime for execution.

After you have trained your deep learning model in a framework of your choice, TensorRT enables you to run it with higher throughput and lower latency.
You need a device with a GPU to use this project.

Convert model

First we convert trained model(with .pt extension) to TensorRT (.engine extension). For this we need to switch from pytorch model to onnx first:

  Usage: python pt2onnx.py -m model.pt -s model.onnx [options]...

  A common command: python pt2onnx.py -m some_model.pt

    -m --model               Pytorch model path(.pt). Default: model.pt
    -s --save                Onnx model save path(.onnx). Default: model.onnx

Then we have an onx model. Through it we can go to TensorRT. This is done as follows:

  Usage: python onnx2engine.py -m model.onnx -p fp16 -s model.engine [options]...

  A common command: python onnx2engine.py -m some_model.onnx

    -m --model               Pytorch model path(.pt). Default: model.pt
    -p --precision           Model quantization. Options: fp16 | fp32 | int8. Default: fp16
    -s --save                TensorRT model save path(.engine). Default: model.engine

⚡ Quick Inference

Python script

If you want, you can use the TensorRT model with python code. You can do this as follows:

  Usage: python deploy.py -m fp16.engine -v video.mp4 -c classes.txt [options]...

  A common command: python inference.py -i some.png

    -m --model               Pytorch model path(.pt). Default: fp16.engine
    -v --video               Video path. Default video.mp4
    -c --classes             Text file where classes names are stored. Default: classes.txt
    -o --output              Output type. Options: opencv - programm will out frames with opencv(cv2) | write - result will store in file(write .mp4 file). Default: opencv
    -s --save                Stored video path. If "output" type is "write", save results to video. Default: video.mp4

👀 Demos

Used streamlit for deploy. Streamlit lets you turn data scripts into shareable web apps in minutes, not weeks. It’s all Python, open-source, and free! And once you’ve created an app you can use our Community Cloud platform to deploy, manage, and share your app.

You can run following command to use streamlit in our case:

  streamlit run stream.py

The result is as follows:

🔧 Dependencies and Installation

Python >= 3.8
PyTorch >= 2

TensorRT >= 8

Installation

Clone repo

git clone https://github.com/shoxa0707/Deploy-Yolov8-with-TensorRT.git
cd Deploy-Yolov8-with-TensorRT

Install dependent packages

pip install torch==2.0.0+cu118 torchvision==0.15.1+cu118 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu118(for CUDA 11.8)
pip install -r requirements.txt

Requirements

Linux
Python 3.8
NVIDIA GPU

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
TensorRT/models		TensorRT/models
images		images
Quantization(TensorRT).ipynb		Quantization(TensorRT).ipynb
README.md		README.md
classes.txt		classes.txt
deploy.py		deploy.py
last.txt		last.txt
onnx2engine.py		onnx2engine.py
pt2onnx.py		pt2onnx.py
stream.py		stream.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deploy Yolov8 model on TensorRT

Introduction

Convert model

⚡ Quick Inference

Python script

👀 Demos

🔧 Dependencies and Installation

Installation

Requirements

About

Releases

Packages

Languages

shoxa0707/Deploy-Yolov8-with-TensorRT

Folders and files

Latest commit

History

Repository files navigation

Deploy Yolov8 model on TensorRT

Introduction

Convert model

⚡ Quick Inference

Python script

👀 Demos

🔧 Dependencies and Installation

Installation

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages