基于条件扩散概率模型的视频异常检测

计算机工程与科学

基于条件扩散概率模型的视频异常检测

叶亚琴, 汤子健, 牛嘉诚, 张新欢

(1. 中国地质大学(武汉)计算机学院，武汉 430078；
2. 中国地质大学(武汉)湖北省智能地理信息处理重点实验室，武汉 430078)

出版日期:2025-06-12 发布日期:2025-06-12

Yaqin Ye, Zijian Tang , Jiacheng Niu, Xinhuan Zhang

(1. School of Computer Science, China University of Geosciences, Wuhan, 430078, China;
2. Hubei Key Laboratory of Intelligent Geo-Information Processing, China University of Geosciences, Wuhan, 430078, China)

Online:2025-06-12 Published:2025-06-12

摘要/Abstract

摘要： 视频异常检测在现代社会中越来越重要，由于视频中存在多样的模糊行为，并且类别无法穷举，基于单分类的方法难以界定正常和异常。针对以上问题，本文提出了基于条件扩散概率模型的视频异常检测模型CDiffuVAD。该方法首先设计了一个常态内容生成器，用于提高模型生成图像的内容准确程度。它通过记忆池增强模型对正常样本分布特征的理解,并借助扩散概率模型来学习视频数据的复杂分布。其次，设计引入了隐式运动条件来学习视频片段的时空特征，将双向光流信息作为扩散过程的隐式运动条件，并采用坐标归一化方法提供片段帧的坐标嵌入，从而实现对多帧序列数据中的运动趋势的学习拟合，缓解模型对视频中的硬正态信息敏感。最终实验表明，所提出的方法分别在Avenue数据集、ShanghaiTech数据集和UBnormal数据集上达到帧级AUC 85.7%，75.5%以及65.7%的精度，表明其可以发现正常样本中多样的特征，并在视频异常检测任务上具有有效性。

关键词: 视频异常检测, 扩散模型, 不确定性, 记忆池, 生成模型

Abstract: Video anomaly detection is becoming more and more important in modern society. Because there are various fuzzy behaviors in videos and the categories can not be exhaustive, it is difficult to define normal and abnormal based on single-classification methods. In response to the aforementioned issues, a Conditional Diffusion Probabilistic Model for Video Anomaly Detection (CDiffuVAD) is proposed. Firstly, a normal content generator is designed to improve the content accuracy of the image generated by the model. It highlights the distribution pattern of normal samples through a memory pool and leverages the diffusion probability model to learn the complex distribution of video data. Secondly, implicit motion conditions are designed and introduced to learn the spatiotemporal features of video segments. The bidirectional optical flow information is used as the implicit motion condition of the diffusion process, and the coordinate normalization method is used to provide the coordinate embedding of the segment frame, so as to realize the learning and fitting of the motion trend in the multi-frame sequence data and alleviate the sensitivity of the model to the hard normal information in the video. Finally, experiments that the proposed method achieves 85.7%, 75.5% and 65.7% accuracy of frame-level AUC on Avenue dataset, ShanghaiTech dataset and UBnormal dataset respectively, indicating that it can find diverse features in normal samples and the effectivity in video anomaly detection tasks.

Key words: video anomaly detection, diffusion model, uncertainty, memory pool, generative model

叶亚琴, 汤子健, 牛嘉诚, 张新欢. 基于条件扩散概率模型的视频异常检测[J]. 计算机工程与科学.

Yaqin Ye, Zijian Tang , Jiacheng Niu, Xinhuan Zhang. [J]. Computer Engineering & Science.

[1]	袁程胜1, 2, 陈金瑞1, 2, 徐晨维3, 刘庆程1, 2, 付章杰1, 2. 基于细粒度局部伪影的生成式图像检测[J]. 计算机工程与科学, 2025, 47(8): 1449-1458.
[2]	薛锦云1, 周智鹏1, 2, 薛慧琦3, 易心武1, 2, 李志辉1, 2, 刘智高1, 2. 基于Unity 3D的灭火器虚拟消防模拟[J]. 计算机工程与科学, 2025, 47(6): 1090-1096.
[3]	李航, 陈志刚, 王易杰, 张心宇, 雷惊鸿, 刘凌枫. 基于时空图注意力状态空间模型的人体姿态异常检测研究[J]. 计算机工程与科学, 2025, 47(10): 1830-1840.
[4]	李召恺, 马占有, 李健祥, 郭昊. 基于模糊决策过程的模糊计算树逻辑模型检测[J]. 计算机工程与科学, 2022, 44(2): 266-275.
[5]	吴翠先1,2,3，何少元1,2. 基于区间数的不确定性数据聚类算法:UD-OPTICS[J]. 计算机工程与科学, 2019, 41(7): 1303-1311.
[6]	高长元1,2，王海晶1，王京1,2. 基于改进CURE算法的不确定性移动用户数据聚类[J]. J4, 2016, 38(4): 768-774.
[7]	李贯峰，陈冬梅. 基于证据理论的不确定模式匹配方法[J]. J4, 2014, 36(6): 1108-1113.
[8]	贺怀清，王赫. 恐怖袭击事件不确定性度量及可视分析[J]. J4, 2012, 34(9): 77-82.
[9]	滕书华，鲁敏，张军，谭志国，庄钊文. 信息系统中的熵理论和信息粒度[J]. J4, 2012, 34(4): 94-101.
[10]	吴佳伟，刘国华，王梅. 匿名隐私保护模型中不确定性数据的建模问题研究[J]. J4, 2011, 33(9): 7-12.
[11]	吴涛,金义富. CDDL:动态描述逻辑的不确定性扩展[J]. J4, 2011, 33(2): 142-148.
[12]	李义杰,蒋靖,程政. 基于贝叶斯网络的软件项目进度管理模型[J]. J4, 2011, 33(11): 140-143.
[13]	张殿旭1 ,2,张怡1,刘晓阳3,曾星2,彭军4. 良性蠕虫SRF扩散模型研究[J]. J4, 2010, 32(7): 19-22.
[14]	刘祥远陈书明. 高性能VLSI设计中时钟分布网络的问题与解决方法[J]. J4, 2007, 29(6): 89-92.
[15]	李克文吴孟达张雄明. 约简的一种启发式算法[J]. J4, 2004, 26(1): 92-94.

基于条件扩散概率模型的视频异常检测

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 15

编辑推荐

Metrics

本文评价