• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (1): 130-139.

• Graphics and Images • Previous Articles     Next Articles

Video anomaly detection with improved attention hybrid auto-encoder

CHEN Zhaobo1,ZHANG Lin1,2,MA Xiaoxuan1   

  1. (1.School of Electrical and Information Engineering,Beijing University of Civil Engineering and Architecture,Beijing 102616;
    2.Beijing Key Laboratory of Intelligent Processing for Building Big Data,
    Beijing University of Civil Engineering and Architecture,Beijing 102616,China)
  • Received:2023-08-22 Revised:2024-03-12 Online:2025-01-25 Published:2025-01-18

Abstract: Video anomaly detection is one of the important research areas in computer vision, widely applied in fields such as transportation and public safety. However, the current field of video anomaly detection faces issues such as susceptibility to noise interference in individual prediction models and generalization anomalies in individual reconstruction models. To address these problems, a video anomaly detection method combining reconstruction and prediction models is proposed. A reconstruction network with an attention mechanism and a memory enhancement module is trained on normal optical flow data. The reconstructed optical flow and original video frames are then simultaneously input into a future frame prediction network, where the reconstructed optical flow serves as a conditional aid to assist the frame prediction network in better generating future frames. To extract more effective features, a residual convolutional attention module (SRCAM) is proposed to facilitate the reconstruction and prediction networks in effectively learning feature representations of latent spaces at both global and local levels, thereby enhancing the model's ability to detect anomalous events in videos and improving its robustness. Extensive experimental evaluations on two commonly used video anomaly detection datasets, UCSD Ped2 and CUHK Avenue, demonstrate the effectiveness of the proposed method.

Key words: video anomaly detection, attention mechanism, stream reconstruction, frame prediction, auto-encoder