基于深度神经网络的行人头部检测

计算机工程与科学

基于深度神经网络的行人头部检测

陶祝,刘正熙,熊运余,李征

(四川大学计算机学院，四川成都 610000）

收稿日期:2017-06-14 修回日期:2017-08-15 出版日期:2018-08-25 发布日期:2018-08-25
基金资助:
国家自然科学基金（61471250）

Pedestrian head detection based on deep neural networks

TAO Zhu,LIU Zhengxi,XIONG Yunyu,LI Zheng

(College of Computer,Sichuan University,Chengdu 610000,China)

Received:2017-06-14 Revised:2017-08-15 Online:2018-08-25 Published:2018-08-25

摘要/Abstract

摘要：

行人检测已成为安防、智能视频监控、景区人流量统计所依赖的核心技术，最新目标检测方法包括快速的区域卷积神经网络FastRCNN、单发多重检测器
SSD、部分形变模型DPM等，皆为对行人整体的检测。在大场景下，行人姿态各异，物体间遮挡频繁,只有通过对行人身体部分位置建模，抓住人的局部特征，才能实现准确的定位。利用FasterRCNN深度网络原型，针对行人头部建立检测模型，同时提取行人不同方向的头部特征，并加入空间金字塔池化层，保证检测速率，有效解决大场景下行人的部分遮挡问题，同时清晰地显示人群大致流动方向，相比普通的人头估计，更有利于人流量统计。

关键词: 视频分析, 行人检测, 卷积神经网络, FasterRCNN, 空间金字塔池化层

Abstract:

Pedestrian detection has become the core technology that security, intelligent video surveillance, and traffic statistics of people in the scenic area depend on. The latest object detection methods such as FastRegions with Convolution Neural Network (FastRCNN), Faster RCNN, Single Shot Multibox Detector (SSD), Deformable Part Models (DPM) are currently the classic algorithms for object detection. However, these algorithms pay more attention to detect the whole pedestrians. In large scenes, pedestrians have different postures and some of them are occluded frequently. Only modeling the position of the pedestrian’s body and grasping the local features of the pedestrians can achieve accurate positioning. The FasterRCNN deep network prototype is adopted, a detection model is built for pedestrian heads, head features in different directions are extracted at the same time, and a spatial pyramid pooling layer is added to ensure the detection rate. These can effectively solve the partial occlusion problem of pedestrians in large scenes and clearly show the general flow direction of pedestrians. The proposal is more conducive to the flow statistics than the ordinary head estimation.

Key words: video analysis, pedestrian detection, convolution neural network, Faster-RCNN, spatial pyramid pooling layer

陶祝,刘正熙,熊运余,李征. 基于深度神经网络的行人头部检测[J]. 计算机工程与科学.

TAO Zhu,LIU Zhengxi,XIONG Yunyu,LI Zheng. Pedestrian head detection based on deep neural networks[J]. Computer Engineering & Science.

[1]	肖振久, 李思琦, 曲海成. 基于多尺度特征与互监督的拥挤行人检测[J]. 计算机工程与科学, 2024, 46(07): 1278-1285.
[2]	田红鹏, 吴璟玮. RIB-NER：基于跨度的中文命名实体识别模型[J]. 计算机工程与科学, 2024, 46(07): 1311-1320.
[3]	王泽宇, 徐慧英, 朱信忠, 李琛, 刘子洋, 王子奕. 基于YOLOv8改进的密集行人检测算法：MER-YOLO[J]. 计算机工程与科学, 2024, 46(06): 1050-1062.
[4]	尹春勇, 赵峰. 基于双层注意力和深度自编码器的时间序列异常检测模型[J]. 计算机工程与科学, 2024, 46(05): 826-835.
[5]	马长林, 孙状. 基于实体知识的远程监督关系抽取[J]. 计算机工程与科学, 2024, 46(05): 945-950.
[6]	陈杰, 李程, 刘仲. 面向多核向量加速器的卷积神经网络推理和训练向量化方法[J]. 计算机工程与科学, 2024, 46(04): 580-589.
[7]	曹浩东, 汪海涛, 贺建峰. 融合序列局部信息的日期感知序列推荐算法[J]. 计算机工程与科学, 2024, 46(04): 734-742.
[8]	秦文强, 吴仲城, 张俊, 李芳, . 基于异构平台的卷积神经网络加速系统设计[J]. 计算机工程与科学, 2024, 46(01): 12-20.
[9]	周理, 赵祉乔, 潘国腾, 铁俊波, 赵王. 基于RISC-V的图卷积神经网络加速器设计[J]. 计算机工程与科学, 2023, 45(12): 2113-2120.
[10]	梁秀满, 周佳润, 杨若兰. LPD-YOLO：轻量级遮挡行人检测模型[J]. 计算机工程与科学, 2023, 45(12): 2197-2205.
[11]	余子丞, 凌捷. 基于Transformer和多特征融合的DGA域名检测方法[J]. 计算机工程与科学, 2023, 45(08): 1416-1423.
[12]	刘俊奇, 涂文轩, 祝恩. 图卷积神经网络综述[J]. 计算机工程与科学, 2023, 45(08): 1472-1481.
[13]	易啸, 马胜, 肖侬. 深度学习加速器在不同剪枝策略下的运行优化[J]. 计算机工程与科学, 2023, 45(07): 1141-1148.
[14]	曹玉东, 陈冬昊, 曹睿, 赵朗. 融合Mask R-CNN的在线多目标行人跟踪方法[J]. 计算机工程与科学, 2023, 45(07): 1216-1225.
[15]	崔克彬, 崔叶微. 基于卷积和Transformer的断路器动触头跟踪方法研究[J]. 计算机工程与科学, 2023, 45(07): 1236-1244.