• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2022, Vol. 44 ›› Issue (02): 312-320.

• 图形与图像 • 上一篇    下一篇

用于人体检测的YOLOv3改进及压缩算法的研究

张玉杰,董蕊   

  1. (陕西科技大学电气与控制工程学院,陕西 西安 710021)
  • 收稿日期:2020-08-16 修回日期:2020-11-12 接受日期:2022-02-25 出版日期:2022-02-25 发布日期:2022-02-18
  • 基金资助:
    陕西省科技计划(2020GY-063);西安市科技计划(2020KJRC0010)

YOLOv3 improvement and its compression algorithm for human body detection

ZHANG Yu-jie,DONG Rui   

  1. (School of Electrical and Control Engineering,Shaanxi University of Science and Technology,Xi’an 710021,China)

  • Received:2020-08-16 Revised:2020-11-12 Accepted:2022-02-25 Online:2022-02-25 Published:2022-02-18

摘要: 针对Yolov3算法应用于人体检测中的准确率低,参数量、计算量和模型体积大难以在资源有限的嵌入式平台上实现等问题,提出了YOLOv3改进及其模型压缩算法。在YOLOv3中通过引入密集连接与多分支结构,增加网络宽度和多尺度感受野,加强特征重用,提高了模型的检测精度;对改进的YOLOv3通过联合优化权重损失函数和BN层缩放因子的L1正则项等方式进行通道剪枝,从而减小了参数量和计算量,模型体积得到了大幅压缩。实验结果表明,改进后YOLOv3算法的检测精度提升了601%,模型体积减小了38.46%;经过压缩后,模型的检测精度虽然降低了3.16%,但模型体积仅为原来的3.31%,只有4.77 MB。
因此,通过改进和压缩后的YOLOv3仍然保持较高的检测精度,而且模型体积得到大幅度的压缩,为YOLOv3模型在嵌入式平台上实现人体检测提供了支撑。

关键词: YOLOv3, 人体检测, 密集连接结构, 多分支结构, 模型压缩

Abstract: Aiming at the problems of low accuracy, large amount of parameters, large amount of calculation, and large model volume, difficulty of being implemented on embedded platforms with limited resources when YOLOv3 algorithm is applied to human body detection, an improved YOLOv3 and its model compression algorithm are proposed. By introducing dense connections and multi-branch structure in YOLOv3, increasing the network width and multi-scale receptive field, and strengthening feature reuse, the accuracy of the model is improved. For the improved YOLOv3, channel pruning is performed by jointly optimizing the weight loss function and the L1 regular term of the BN layer scaling factor, thereby reducing the amount of parameters and calculations, and the volume of the model is greatly compressed. Experimental results show that the accuracy of the improved YOLOv3 algorithm is increased by 6.01%, and the model volume is reduced by 38.46%. After compression, although the accuracy of the model is reduced by 3.16%, the model volume is only 3.31% of the original, only 4.77 MB. Therefore, the improved and compressed YOLOv3 still maintains a high accuracy rate, and the model volume is greatly reduced, which provides support for the YOLOv3 model to realize human body detection in embedded deployment.


Key words: YOLOv3, human body detection, dense connection, multi-branch structure, model compression