基于深度卷积神经网络的小目标检测算法

计算机工程与科学

基于深度卷积神经网络的小目标检测算法

李航1,2，朱明1

(1.中国科学院长春光学精密机械与物理研究所，吉林长春 130033；2.中国科学院大学,北京 100049)

收稿日期:2019-08-29 修回日期:2019-11-26 出版日期:2020-04-25 发布日期:2020-04-25

A small object detection algorithm based

on deep convolutional neural network

LI Hang1,2，ZHU Ming1

（1.Changchun Institute of Optics,Fine Mechanics and Physics,Chinese Academy of Sciences,Changchun 130033;

2.University of Chinese Academy of Sciences,Beijing 100049,China）

Received:2019-08-29 Revised:2019-11-26 Online:2020-04-25 Published:2020-04-25

摘要/Abstract

摘要：

针对YOLO目标检测算法在小目标检测方面存在的不足，以及难以在嵌入式平台上达到实时性的问题，设计出了一种基于YOLO算法改进的dense_YOLO目标检测算法。该算法共分为2个阶段：特征提取阶段和目标检测回归阶段。在特征提取阶段，借鉴DenseNet结构的思想，设计了新的基于深度可分离卷积的slim-densenet特征提取模块，增强了小目标的特征传递，减少了参数量，加快了网络的传播速度。在目标检测阶段，提出自适应多尺度融合检测的思想，将提取到的特征进行融合，在不同的特征尺度上进行目标的分类和回归，提高了对小目标的检测准确率。实验结果表明：在嵌入式平台上，针对小目标，本文提出的dense_YOLO目标检测算法相较原YOLO算法mAP指标提高了7%，单幅图像检测时间缩短了15 ms，网络模型大小减少了90 MB，明显优于原算法。

关键词: 目标检测, 嵌入式平台, 小目标, 深度卷积神经网络, 多尺度预测

Abstract:

In view of the shortcomings of YOLO object detection algorithm in small object detection, and the difficulty of achieving real-time performance on embedded platforms, this paper designs an improved YOLO object detection algorithm, called dense_YOLO. The algorithm contains two phases: feature extraction phase and object detection regression phase. In the feature extraction phase, based on the idea of DenseNet structure, a new slim-densenet feature extraction module based on deep separable convolution is designed, which enhances the transmission of small object features and reduces the parameter quantity to accelerate the network propagation speed. In the object detection stage, the idea of adaptive multi-scale fusion detection is proposed to fuse the extracted features, and the objects are classified and regressed on different feature scales, which improves the detection accuracy of small objects. Experimental results show that, compared with the original YOLO object detection algorithm, the dense_YOLO object detection algorithm improves mAP by 7%, decreases the single picture detection time by 15 ms, and reduces the model size by 90 MB.

Key words: object detection, embedded platform, small object, convolutional neural network, multi-scale prediction

李航1,2，朱明1. 基于深度卷积神经网络的小目标检测算法[J]. 计算机工程与科学.

LI Hang1,2，ZHU Ming1.

A small object detection algorithm based

on deep convolutional neural network

[J]. Computer Engineering & Science.

[1]	张永智, 何可人, 戈珏. 改进YOLOv7网络在低空遥感图像目标检测中的应用[J]. 计算机工程与科学, 2024, 46(07): 1269-1277.
[2]	王泽宇, 徐慧英, 朱信忠, 李琛, 刘子洋, 王子奕. 基于YOLOv8改进的密集行人检测算法：MER-YOLO[J]. 计算机工程与科学, 2024, 46(06): 1050-1062.
[3]	胡昭华, 王长富, . 改进Faster R-CNN的遥感图像小目标检测算法[J]. 计算机工程与科学, 2024, 46(06): 1063-1071.
[4]	赵金源, 贾迪. 改进YOLOv5的多人姿态估计修正算法[J]. 计算机工程与科学, 2024, 46(05): 852-860.
[5]	黄珍伟, 陈伟, 王文杰, 路锦通. 基于改进 RetinaNet网络的水下机器人目标检测与实验[J]. 计算机工程与科学, 2024, 46(02): 264-271.
[6]	江志鹏, 王自全, 张永生, 于英, 程彬彬, 赵龙海, 张梦唯. 基于改进Deformable DETR的无人机视频流车辆目标检测算法[J]. 计算机工程与科学, 2024, 46(01): 91-101.
[7]	张骞, 陈紫强, 孙宗威, 赖镜安. 融合高分辨率网络的雾天目标检测算法[J]. 计算机工程与科学, 2023, 45(11): 1970-1981.
[8]	赵玥, 肖梦燕, 邱宝军, 罗军, 王小强, 罗道军. 基于机器视觉的集成电路声扫图像缺陷检测软件设计[J]. 计算机工程与科学, 2023, 45(10): 1806-1813.
[9]	余子丞, 凌捷. 基于Transformer和多特征融合的DGA域名检测方法[J]. 计算机工程与科学, 2023, 45(08): 1416-1423.
[10]	刘浩翰, 孙铖, 贺怀清, 惠康华. 基于改进YOLOv3的金属表面缺陷检测[J]. 计算机工程与科学, 2023, 45(07): 1226-1235.
[11]	李校林, 王复港, 张鹏飞, 张琳玉, . 基于多尺度特征提取的YOLOv5s算法优化[J]. 计算机工程与科学, 2023, 45(06): 1054-1062.
[12]	邓姗姗, 黄慧, 马燕. 基于改进Faster R-CNN的小目标检测算法[J]. 计算机工程与科学, 2023, 45(05): 869-877.
[13]	霍爱清, 张书涵, 杨玉艳, 胥静蓉, 王泽文. 密集交通场景中改进YOLOv3目标检测优化算法[J]. 计算机工程与科学, 2023, 45(05): 878-884.
[14]	贾志, 李茂军, 李婉婷. 基于改进YOLOv5+DeepSort算法模型的交叉路口车辆实时检测[J]. 计算机工程与科学, 2023, 45(04): 674-682.
[15]	黄星威, 陈曦, 张塑凡. 改进特征金字塔的小目标深度学习模型[J]. 计算机工程与科学, 2023, 45(04): 734-742.