Improved yolov6 to have the same trt model inference speed under int8 and fp16 #5803

xiaoche-24 · 2023-12-08T03:27:28Z

xiaoche-24
Dec 8, 2023

Ask a Question

Question

I improved yolov6. After converting to tensorrt, the improved model's inference speed is faster than the original yolov6 under fp32 and fp16. However, after converting to int8, the speed of the original yolov6 has doubled (fp16:66fps -> int8:122fps ), and the speed of the improved yolov6 is (fp16: 100fps -> int8: 102fps). The speed is almost not improved. What is the reason? My improvement module includes split, concat, and DropPath operations. Onnx.opset is set to 12 because 13 will report an error.Because when using opset13 to convert tensorrt on NX, the following error will be reported:

[TensorRT] VERBOSE: ModelImporter.cpp:119: Searching for input: backbone.ERBlock_2.1.spatial_mixing.partial_conv3.0.weight
[TensorRT] VERBOSE: ModelImporter.cpp:125: Conv_6 [Conv] inputs: [input.8 -> (1, 32, 160, 160)], [backbone.ERBlock_2.1.spatial_mixing.partial_conv3.0.weight -> (16, 16, 3, 3)], 
[TensorRT] VERBOSE: builtin_op_importers.cpp:450: Convolution input dimensions: (1, 32, 160, 160)
ERROR: ONNX Parse Failed
In node -1 (importConv): INVALID_NODE: Assertion failed: nchan == -1 || kernelWeights.shape.d[1] * ngroup == nchan
ERROR: failed to build the TensorRT engine!

Further information

Relevant Area:
Is this issue related to a specific model?
Model name:
Model opset: 12

Notes

xadupre · 2023-12-11T15:28:34Z

xadupre
Dec 11, 2023
Maintainer

Which runtime are you using? This issue should probably be moved there.

0 replies

justinchuby · 2023-12-12T17:57:18Z

justinchuby
Dec 12, 2023
Maintainer

I would move the issue to https://github.com/NVIDIA/TensorRT for more help. Thanks!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved yolov6 to have the same trt model inference speed under int8 and fp16 #5803

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Improved yolov6 to have the same trt model inference speed under int8 and fp16 #5803

xiaoche-24 Dec 8, 2023

Ask a Question

Question

Further information

Notes

Replies: 2 comments

xadupre Dec 11, 2023 Maintainer

justinchuby Dec 12, 2023 Maintainer

xiaoche-24
Dec 8, 2023

xadupre
Dec 11, 2023
Maintainer

justinchuby
Dec 12, 2023
Maintainer