• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (04): 665-673.

• Graphics and Images • Previous Articles     Next Articles

Research on dynamic gesture recognition based on multimodal fusion

HU Zong-cheng1,DUAN Xiao-wei2,ZHOU Ya-tong1,HE Hao1   

  1. (1.School of Electronic and Information Engineering,Hebei University of Technology,Tianjin 300401;
    2.The 29th Research Institute of China Electronics Technology Group Corporation,Chengdu 610036,China)
  • Received:2021-05-21 Revised:2021-11-02 Accepted:2023-04-25 Online:2023-04-25 Published:2023-04-13

Abstract: Aiming at the problems of low accuracy and weak robustness of dynamic gesture recognition in complex environment, a dynamic gesture recognition algorithm based on multimodal fusion, named TF-MG, is proposed. TF-MG combines the depth information and hand skeleton information, extracts the corresponding feature information using two different networks, and then fuses the extract- ed features into the classification network to realize dynamic gesture recognition. According to the depth information, the motion history image method is used to compress the motion trajectory into a single frame image, and the feature is extracted by MobileNetV2. According to the hand skeleton information, DeepGRU composed of gated recurrent units is used to extract features from the hand skeleton information. The experimental results show that, on DHG-14/28 dataset, the recognition accuracy of 14 kinds of hand gestures reaches 93.29%, and that of 28 kinds of hand gestures reaches 92.25%. Compared with other algorithms, it achieves higher recognition accuracy.

Key words: multimodality, dynamic gesture recognition, gated recurrent unit, convolutional neural network