• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (10): 1814-1821.

• Graphics and Images • Previous Articles     Next Articles

A video human behavior recognition method based on improved 3D ResNet

NIU Wei-hua1,2,ZHAI Rui-bing1   

  1. (1.Department of Computer,North China Electric Power University,Baoding 071003;
    2.Engineering Research Center of Intelligent Computing for Complex Energy Systems,
    Ministry of Education,North China Electric Power University,Baoding 071003,China)
  • Received:2022-08-07 Revised:2022-11-26 Accepted:2023-10-25 Online:2023-10-25 Published:2023-10-17

Abstract: Aiming at the temporal characteristics of human behavior in videos, a video human beha- vior recognition method is proposed that combines asymmetric convolution and CBR modules. This method uses 3D ResNet-50 as the backbone network. First, the larger convolutions in the network are changed to the concatenation of two asymmetric 3D convolutions, which deepens the local key feature extraction of the convolution layer in the horizontal and vertical directions. Secondly, CBR module is added to improve the number of network layers. The network extracts multi-angle features of images and time series from continuous video frame sequences, classifies them according to the feature data, and finally outputs the recognition results. Extensive experimental results on the benchmark dataset UCF101 show that the Top1 and Top5 accuracy of the proposed method are improved by 4.03% and 4.99%, respectively, compared with the original 3D ResNet network, and the recognition accuracy of this method is also better than other mainstream methods.

Key words: human behavior recognition, 3D convolution, 3D ResNet, asymmetric convolution, UCF101 dataset