基于网格密度积叠的流数据异常检测

计算机工程与科学 ›› 2025, Vol. 47 ›› Issue (01): 75-85.

• 计算机网络与信息安全 • 上一篇下一篇

基于网格密度积叠的流数据异常检测

武培成，赵旭俊，靳黎忠

(太原科技大学计算机科学与技术学院，山西太原 030024)

收稿日期:2023-06-15 修回日期:2023-10-30 接受日期:2025-01-25 出版日期:2025-01-25 发布日期:2025-01-18
基金资助:
国家自然科学基金(61572343，U1931209)；山西省应用基础研究计划(20210302123223，202103021224275)

Anomaly detection of stream data based on grid density stacking

WU Peicheng，ZHAO Xujun，JIN Lizhong

(College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan 030024,China)

Received:2023-06-15 Revised:2023-10-30 Accepted:2025-01-25 Online:2025-01-25 Published:2025-01-18

摘要/Abstract

摘要： 多数的流数据异常检测算法采用滑动的单一窗口模型，这会导致大量数据点进行重复计算，异常点也会受到滑动窗口中近邻更替的干扰，进而影响异常检测算法的准确性。为解决上述问题，提出了联合窗口模型，采用若干无重叠的窗口作为异常点的检测范围。在此模型上，提出了基于网格密度积叠的异常检测算法，首先，优化了核密度估计函数用于数据点局部密度的计算；其次，提出网格密度积叠操作，用于异常网格的度量。在异常网格中，通过计算数据点的异常分数来确定最终异常数据。为了提高算法效率，提出一种自适应剪枝策略，剪枝一些异常点不可能出现的区域。实验结果表明，该算法同现有的数据流异常检测算法相比，在效率和准确性2个方面体现出较强的优势。

关键词: 异常检测, 流数据, 核密度估计, 网格密度积叠

Abstract: Most of the stream data anomaly detection algorithms employ a sliding single-window model, which leads to redundant calculations for a large number of data points and disturbs anomaly points due to the replacement of neighbors in the sliding window, thereby affecting the accuracy of anomaly detection algorithms. To address these issues, a combined window model is proposed, which utilizes several non-overlapping windows as the detection range for anomaly points. Based on this model, an anomaly detection algorithm based on grid density accumulation is introduced. Firstly, the kernel density estimation function is optimized and used to calculate the local density of data points. Then, a grid density accumulation operation is proposed to measure anomalous grids. In anomalous grids, the final anomalous data is determined by calculating the anomaly scores of data points. To improve the algorithm's efficiency, an adaptive pruning strategy is proposed to prune areas where anomaly points are unlikely to appear. Experimental results show that this algorithm exhibits significant advantages in both efficiency and accuracy compared to existing stream data anomaly detection algorithms.

Key words: anomaly detection, stream data, kernel density estimation, grid density stacking

武培成, 赵旭俊, 靳黎忠. 基于网格密度积叠的流数据异常检测[J]. 计算机工程与科学, 2025, 47(01): 75-85.

WU Peicheng, ZHAO Xujun, JIN Lizhong. Anomaly detection of stream data based on grid density stacking[J]. Computer Engineering & Science, 2025, 47(01): 75-85.

编辑推荐

Metrics

阅读次数

全文

155

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	155

来源	本网站	其他网站

次数	95	60
比例	61%	39%

摘要

最新录用	在线预览	正式出版

0	0	79

	来源	本网站

	次数	79
	比例	100%

[1]	余佳妮, 胡朝霞, 蒋从锋. 一种基于多特征的日志事件异常检测方法研究[J]. 计算机工程与科学, 2024, 46(09): 1587-1597.
[2]	尹春勇, 赵峰. 基于双层注意力和深度自编码器的时间序列异常检测模型[J]. 计算机工程与科学, 2024, 46(05): 826-835.
[3]	钟卓辉, 陈黎飞, . 基于模型的非凸聚类算法[J]. 计算机工程与科学, 2024, 46(02): 292-302.
[4]	吕鹤轩, 黄山, 艾力卡木·再比布拉, 吴思衡, 段晓东, . Flink水位线动态调整策略[J]. 计算机工程与科学, 2023, 45(02): 237-245.
[5]	郭会云1,2，房俊1,2，李冬1,2. 基于负载均衡的多源流数据实时存储系统[J]. 计算机工程与科学, 2017, 39(04): 641-647.
[6]	甘亮1,2，李润恒1，贾焰1，刘健3. HS-StreamCube:网络安全事件流实时多维分析系统[J]. J4, 2013, 35(3): 72-79.
[7]	马驰远[1] 陈书明[2] 邢座程[1] 郝跃[3]. 支持流数据传输的互连网络控制器研究与实现[J]. J4, 2008, 30(9): 103-106.

基于网格密度积叠的流数据异常检测

Anomaly detection of stream data based on grid density stacking

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 7

编辑推荐

Metrics

本文评价