• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2022, Vol. 44 ›› Issue (08): 1440-1448.

• 图形与图像 • 上一篇    下一篇

基于多尺度优化感知网络的口罩检测方法

苟淞,赵绪言,侯松,李威   

  1. (西南交通大学计算机与人工智能学院,四川 成都 611756)
  • 收稿日期:2020-12-30 修回日期:2021-03-29 接受日期:2022-08-25 出版日期:2022-08-25 发布日期:2022-08-25
  • 基金资助:
    国家自然科学基金(61772436,62001400);中央高校基本科研业务费(A0920502052001-3/251);四川省科技计划(2021YJ0364)

A mask detection method based on multi-scale optimized awareness network

GOU Song,ZHAO Xu-yan,HOU Song,LI Wei   

  1. (School of Computing and Artifical Intelligence,Southwest Jiaotong University,Chengdu 611756,China)
  • Received:2020-12-30 Revised:2021-03-29 Accepted:2022-08-25 Online:2022-08-25 Published:2022-08-25

摘要: 佩戴口罩是全球医学专家公认最有效的预防新冠肺炎感染的方法之一。基于视觉的智能口罩检测技术对于督促人们在公共场合佩戴口罩具有重要的作用。然而,目前专用口罩检测算法较为缺乏,通用目标检测算法对于多尺度、多角度和外观多样的戴口罩人脸目标识别仍然无法满足检测精度的要求。针对该问题,提出了一种基于多尺度优化感知网络的口罩检测方法——PyramidMask。首先,PyramidMask从骨干网络的不同尺度获取图像的多层特征;然后设计尺度感知分支进行不同层的高密度先验框独立预测,以端到端的方式对图像中多尺度的人脸进行精准定位和佩戴口罩检测。此外,为了提高模型对复杂环境的鲁棒性,在训练阶段以图像拼接的方式对训练样本进行数据增强。实验结果表明,在公开的口罩检测数据集上,PyramidMask优于当前主流方法。相较于基准方法,PyramidMask在检测戴与未戴口罩的召回率上分别有5.4%和12.5%的提升,精确率上分别有6.0%和4.1%的提升。

关键词: 新冠肺炎, 口罩检测, 计算机视觉, 深度学习

Abstract: Wearing a mask is recognized by global medical experts as one of the most effective ways to prevent COVID-19 infection. The vision-based intelligent mask detection plays an important role in urging people to wear masks in public. However, compared with general object detection, there are currently few studies focusing on  mask detection. To solve the problem, an optimized multi-scale awareness network, called PyramidMask, is proposed for  mask detection. Firstly, PyramidMask obtains the multi-layer features of the image from different scales of the backbone. Secondly, the scale-awareness branches are designed to perform independent predictions of different layers of high-density candidate boxes. Finally, the multi-scale faces with masks in an image is accurately detected in an end-to-end manner. In addition, in order to improve the robustness of PyramidMask under complex scenes, the training samples are augmented by image stitching in the training stage. The experimental results show that PyramidMask outperforms the state-of-the-art methods on the public mask detection dataset. Compared with the benchmark, PyramidMask improves 5.4% and 12.5% in the recall of detection with and without masks, and 6.0% and 4.1% in the precision of detection with and without masks.

Key words: COVID-19, face mask detection, computer vision, deep learning