YOLOv4 with TensorRT engine

This package contains the yolov4_trt_node that performs the inference using NVIDIA's TensorRT engine

This package works for both YOLOv3 and YOLOv4. Do change the commands accordingly, corresponding to the YOLO model used.

Average FPS for yolov4-416 on a 640x360 input is ~ 8-9 FPS

Average FPS w maximum performace on Jetson is ~ 10-11 FPS

Setting up the environment

Install dependencies

Current Environment:

ROS == Melodic
DISTRO == Ubuntu 18.04
Jetson == Tx2
Jetpack == 4.4
CUDA == 10.2

Dependencies:

TensorRT == 6+
OpenCV == 3.x +
numpy == 1.15.1 +
Protobuf == 3.8.0
Pycuda == 2019.1.2
onnx == 1.4.1 (depends on Protobuf)

Install all dependencies with below commands

Install pycuda (takes awhile)
$ cd ${HOME}/catkin_ws/src/yolov4_trt_ros/dependencies
$ ./install_pycuda.sh

Install Protobuf (takes awhile)
$ cd ${HOME}/catkin_ws/src/yolov4_trt_ros/dependencies
$ ./install_protobuf-3.8.0.sh

Install onnx (depends on Protobuf above)
$ sudo pip3 install onnx==1.4.1

Please also install jetson-inference
Note: This package uses similar nodes to ros_deep_learning package. Please place a CATKIN_IGNORE in that package to avoid similar node name catkin_make error

Setting up the package

1. Clone project into catkin_ws and build it

$ cd ~/catkin_ws && catkin_make
$ source devel/setup.bash

vision_msgs not found error

sudo apt install ros-melodic-vision-msgs

2. Make libyolo_layer.so

$ cd ${HOME}/catkin_ws/src/yolov4_trt_ros/plugins
$ make

This will generate a libyolo_layer.so file

3. Place your yolo.weights and yolo.cfg file in the yolo folder

$ cd ${HOME}/catkin_ws/src/yolov4_trt_ros/yolo

** Please name the yolov4.weights and yolov4.cfg file as follows:

yolov4.weights
yolov4.cfg

Run the conversion script to convert to TensorRT engine file

$ ./convert_yolo_trt

Input the appropriate arguments
This conversion might take awhile
The optimised TensorRT engine would now be saved as yolov3-416.trt / yolov4-416.trt

4. Change the class labels

$ cd ${HOME}/catkin_ws/src/yolov4_trt_ros/utils
$ vim yolo_classes.py

Change the class labels to suit your model

5. Change the video_input and topic_name

$ cd ${HOME}/catkin_ws/src/yolov4_trt_ros/launch

yolov3_trt.launch : change the topic_name
yolov4_trt.launch : change the topic_name
video_source.launch : change the input format (refer to this Link
- video_source.launch requires jetson-inference to be installed
- Default input is CSI camera

Using the package

Running the package

Note: Run the launch files separately in different terminals

1. Run the video_source

# For csi input
$ roslaunch yolov4_trt_ros video_source.launch input:=csi://0

# For video input
$ roslaunch yolov4_trt_ros video_source.launch input:=/path_to_video/video.mp4

2. Run the yolo detector

# For YOLOv3
$ roslaunch yolov4_trt_ros yolov3_trt.launch

# For YOLOv4
$ roslaunch yolov4_trt_ros yolov4_trt.launch

3. For maximum performance

$ cd /usr/bin/
$ sudo ./nvpmodel -m 0	# Enable 2 Denver CPU
$ sudo ./jetson_clock	# Maximise CPU/GPU performance

Please ensure the jetson device is cooled appropriately to prevent overheating

Parameters

str model = "yolov3" or "yolov4"
str model_path = "/abs_path_to_model/"
int input_shape = 288/416/608
int category_num = 80
double conf_th = 0.5
bool show_img = True
Default Input FPS from CSI camera = 30.0

To change this, go to jetson-inference/utils/camera/gstCamera.cpp line 359 and change mOptions.frameRate = 15 to mOptions.frameRate = desired_frame_rate

wyf94/yolov4_trt_ros

YOLOv4 with TensorRT engine

Setting up the environment

Install dependencies

Current Environment:

Dependencies:

Install all dependencies with below commands

Setting up the package

1. Clone project into catkin_ws and build it

2. Make libyolo_layer.so

3. Place your yolo.weights and yolo.cfg file in the yolo folder

4. Change the class labels

5. Change the video_input and topic_name

Using the package

Running the package

1. Run the video_source

2. Run the yolo detector

3. For maximum performance

Parameters

Results obtained

1. Screenshots

Video_in: .mp4 video (640x360)

Avg FPS : ~9

Video_in: CSI_cam (1280x720)

Avg FPS : ~7

Licenses and References

1. Referenced source code from jkjung-avt and his project with tensorrt samples

2. Referenced video_input source code from dusty-nv