YOLOv5s algorithm optimization based on multi-scale feature extraction

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (06): 1054-1062.

• Graphics and Images • Previous Articles Next Articles

YOLOv5s algorithm optimization based on multi-scale feature extraction

LI Xiao-lin1,2，WANG Fu-gang1,2，ZHANG Peng-fei1,2，ZHANG Lin-yu1,2

(1.School of Communication and Information Engineering,
Chongqing University of Posts and Telecommunications,Chongqing 400065;
2.Research Center of New Telecommunication Technology,
Chongqing University of Posts and Telecommunications,Chongqing 400065,China)

Received:2021-12-02 Revised:2022-05-01 Accepted:2023-06-25 Online:2023-06-25 Published:2023-06-16

Abstract

Abstract: Object detection algorithms are widely used in unmanned driving, robot vision, industrial automation and other fields, and have important research value. Among many target detection algorithm, YOLOv5s has the advantages of fast detection speed and small parameter scale, but also has the problem of low detection accuracy. Aiming at the problem that the YOLOv5s standard convolution module has weak feature extraction capabilities and feature redundancy, two convolution modules based on multi-scale feature extraction are proposed. Firstly, a multi-receptive field convolution module is proposed to improve the feature extraction ability of the model. It obtains semantic information of different granularities through convolution kernels of multiple sizes. Secondly, a feature map convolution module is proposed to improve the diversity of feature maps. It uses a small number of standard convolution kernels and grouped convolutions to reduce the mutual constraints between feature channels. Finally, some standard convolution modules of YOLOv5s are replaced by multi-receptive field convolution module and feature map convolution module, and the improved algorithm in this paper is obtained.The experimental results on Pascal VOC data set show that the improved algorithm not only improves the detection accuracy, but also maintains the real-time detection ability of YOLOv5s. mAP_0.5 and mAP_0.5:0.95 are increased by 2.4% and 4.9% respectively, which proved the effectiveness of the improved algorithm. It is further verified on DOTA data set that the improved algorithm has good generalization ability in different environments.

Key words: object detection, multi-scale feature, receptive field, feature redundancy

LI Xiao-lin, WANG Fu-gang, ZHANG Peng-fei, ZHANG Lin-yu, . YOLOv5s algorithm optimization based on multi-scale feature extraction[J]. Computer Engineering & Science, 2023, 45(06): 1054-1062.

[1]	MA Jin-lin, YAN Qi, MA Zi-ping. A multi-layer mask recognition method for Tangut characters [J]. Computer Engineering & Science, 2024, 46(12): 2227-2238.
[2]	CAO Yu-qi, XU Hui-ying, ZHU Xin-zhong, HUANG Xiao, CHEN Chen, ZHOU Si-yu, SHENG Ke. An improved fighting behavior recognition algorithm based on YOLOv8: EFD-YOLO [J]. Computer Engineering & Science, 2024, 46(10): 1825-1834.
[3]	CHEN Qing-jiang, SHAO Fei, WANG Xuan-jun. Hybrid U-shaped network and Transformer for image deblurring [J]. Computer Engineering & Science, 2024, 46(10): 1843-1851.
[4]	CHEN Lei, LIANG Zheng-you, SUN Yu, CAI Jun-min. Mobile monocular depth estimation based on multi-scale feature fusion [J]. Computer Engineering & Science, 2024, 46(09): 1616-1524.
[5]	CHEN Chen, XU Hui-ying, ZHU Xin-zhong, HUANG Xiao, SONG Jie, CAO Yu-qi, ZHOU Si-yu, SHENG Ke. FDW-YOLO:An improved indoor pedestrian fall detection algorithm based on YOLOv8 [J]. Computer Engineering & Science, 2024, 46(08): 1455-1465.
[6]	WANG Ze-yu, XU Hui-ying, ZHU Xin-zhong, LI Chen, LIU Zi-yang, WANG Zi-yi. An improved dense pedestrian detection algorithm based on YOLOv8: MER-YOLO [J]. Computer Engineering & Science, 2024, 46(06): 1050-1062.
[7]	ZHANG Wen-hao, QU Shao-jun. Retinal vessel segmentation based on multi-scale attention feature fusion network with dual-decoder structure [J]. Computer Engineering & Science, 2023, 45(12): 2175-2185.
[8]	LI Zhuo-xuan, ZHOU Ya-tong. iSFF-DBNet:An improved text detection algorithm in e-commerce images [J]. Computer Engineering & Science, 2023, 45(11): 2008-2017.
[9]	CUI Ke-bin, CUI Ye-wei. A circuit breaker moving contact tracking methods based on convolution and Transformer [J]. Computer Engineering & Science, 2023, 45(07): 1236-1244.
[10]	HUANG Xing-wei, CHEN Xi, ZHANG Su-fan. A deep learning model based on improved feature pyramid networks for small object detection [J]. Computer Engineering & Science, 2023, 45(04): 734-742.
[11]	SUN Qi, ZHAI Rui, ZUO Fang, ZHANG Yu-tao, . Facial image inpainting based on partial convolution and multi-scale feature integration [J]. Computer Engineering & Science, 2023, 45(02): 304-312.
[12]	WANG Guan-bo, ZHAO Yi-fan, LI Bo, YANG Jun-dong, DING Hong-wei. Real-time flame detection with improved YOLO v4-tiny [J]. Computer Engineering & Science, 2022, 44(12): 2196-2205.
[13]	LUO Yue-tong, DUAN Chang, JIANG Pei-feng, ZHUO Bo. An improved industrial defect data augmentation method based on pix2pix [J]. Computer Engineering & Science, 2022, 44(12): 2206-2212.
[14]	ZHANG Hai-yan, FU Ying-na, DING Gui-jiang, MENG Qing-yan. Towards Anchor-free object detection with diverse receptive fields attention feature refinement network [J]. Computer Engineering & Science, 2022, 44(11): 1995-2002.
[15]	LI Lan, LIU Jie, ZHANG Jie. A complex pedestrian detection model based on improved YOLOv4 algorithm [J]. Computer Engineering & Science, 2022, 44(08): 1449-1456.

YOLOv5s algorithm optimization based on multi-scale feature extraction

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments