About trt inference

Question

About trt inference

leayz-888 opened this issue 8 months ago · 7 comments

The author of yolov9 has updated the re-parameterization code. Before converting to onnx, you can re-parameterize it first, and then modify the output to only one, which can greatly shorten the inference time.

spacewalk01 commented 8 months ago

Nice!

Answer 1 · 2024-02-27T06:27:01.000Z

I will look into your suggestion. Thank you :)

Answer 2 · 2024-02-27T06:47:24.000Z

It indeed reduces the number of parameters. I will apply it and update this repo. Thanks again.

Answer 3 · 2024-02-27T07:19:22.000Z

Applied re-parameterization!

Answer 4 · 2024-02-27T07:24:32.000Z

Applied re-parameterization!

nice! By the way, when I exported the onnx model, I only kept the final output. The generated engien file had fewer parameters. According to actual measurements, it can speed up by about 25%.

Answer 5 · 2024-02-27T07:30:47.000Z

like this:

Answer 6 · 2024-02-27T07:37:07.000Z

Indeed, I got the same result. Nice work!