• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (06): 1050-1062.

• Graphics and Images • Previous Articles     Next Articles

An improved dense pedestrian detection algorithm based on YOLOv8: MER-YOLO

WANG  Ze-yu1,XU Hui-ying1,ZHU Xin-zhong1,LI Chen1,LIU Zi-yang1,WANG Zi-yi2   

  1. (1.College of Computer Science and Technology,Zhejiang Normal University,Jinhua 321004,China;
    2.Computer Science Department,School of Engineering,The University of Manchester,Manchester M139PL,UK)

  • Received:2023-08-05 Revised:2023-10-15 Accepted:2024-06-25 Online:2024-06-25 Published:2024-06-18

Abstract: In large-scale crowded places, abnormal crowd gathering occurs from time to time, which brings certain challenges to the dense pedestrian detection technology involved in application scenarios such as autonomous driving and large-scale public place crowd monitoring systems. The new generation of dense pedestrian detection technology requires higher accuracy, smaller computing overhead, faster detection speed and more convenient deployment. In view of the above requirements, a lightweight dense pedestrian detection algorithm MER-YOLO based on YOLOv8 is proposed, which first uses MobileViT as the backbone network to improve the overall feature extraction ability of the model in pedestrian gathering areas. The EMA attention mechanism module is introduced to encode the global information, further aggregate pixel-level features through dimensional interaction, and strengthen the detection ability of small targets by combining the detection head with 160×160 scale. The use of Repulsion Loss as the bounding box loss function reduces the missed detection and misdetection of small target pedestrians under dense crowds. The experimental results show that compared with YOLOv8n, the mAP@0.5 of the MER-YOLO pedestrian detection algorithm is improved by 4.5% on the Crowd Human dataset and 2.1% on the WiderPerson dataset, while only 3.1×106 parameters and 9.8 GFLOPs, which meet the deployment requirements of low computing power and high precision.

Key words: object detection, pedestrian detection, light weight, attention mechanism

CLC Number: