An interpolation based outlier detection
method of sparse high-dimensional data

Abstract

Abstract:

The data in the outlier detection problem can be considered as the mixture of normal and abnormal points in a space. Under the premise of reducing the loss of normal points, outliers are usually contained in the sample sets farthest from all clustering centroids. Inspired by this idea, this paper proposes an interpolation-based outlier detection method for sparse high-dimensional data. This method interpolates the original data by applying genetic algorithm on the basis of k-means clustering, solving the problem that sparse data in k-means clustering is easy to be merged. Experimental results show that, compared with traditional outlier detection methods based on k-means clustering and several typical detection methods based on improved k-means clustering, the proposed method can not only lose fewer normal points, but also improve the accuracy and precision of detection.

Key words: sparse data, outlier detection, interpolation, clustering, genetic algorithm

CHEN Wang-hu,TIAN Zhen,ZHANG Li-zhi,LIANG Xiao-yan,GAO Ya-qiong.

An interpolation based outlier detection

method of sparse high-dimensional data

[J]. Computer Engineering & Science.

[1]	LIU He-bing, KONG Yu-jie, XI Lei, SHANG Jun-ping. A decoupled contrastive clustering integrating attention mechanism [J]. Computer Engineering & Science, 2024, 46(12): 2261-2270.
[2]	AN Yuan-yuan, MA Xiao-ning. Flight path planning based on improved genetic algorithm and multi-objective optimization model [J]. Computer Engineering & Science, 2024, 46(09): 1660-1666.
[3]	LI Meng, LIU Zi-yi, SONG Yu-hang. A deep subspace clustering algorithm based on dual self-expression and the maximum entropy principle [J]. Computer Engineering & Science, 2024, 46(09): 1685-1692.
[4]	LI Cheng-ran, FANG Jia-hao, YIN Shou-yi, WEI Shao-jun, HU Yang. Research on wafer-scale chip mapping task based on genetic algorithm [J]. Computer Engineering & Science, 2024, 46(06): 993-1000.
[5]	REN Sheng-qi, SONG Wei. Feature extraction and prediction of multidimensional time series based on GGInformer model [J]. Computer Engineering & Science, 2024, 46(04): 590-598.
[6]	WANG Zhong-hao, XIA Jing, LI Shi-jie, CAI Zhi-ping. A metal artifact correction algorithm for cone beam CT based on biharmonic equation interpolation [J]. Computer Engineering & Science, 2024, 46(03): 471-478.
[7]	SONG Xin-hai, HAN Jing-yu, LANG Hang, MAO Yi. A sliding window voting strategy based on hidden Markov model for morphology detection of QRS complex [J]. Computer Engineering & Science, 2024, 46(02): 272-281.
[8]	ZHONG Zhuo-hui, CHEN Li-fei, . A model-based non-convex clustering algorithm [J]. Computer Engineering & Science, 2024, 46(02): 292-302.
[9]	XIAO Zhen-guo, CHEN Lin-shu, SUN Shao-jie, MEI Ben-xia, LIU Yuan-hui, ZHAO Lei. A clustering method based on algebraic granularity [J]. Computer Engineering & Science, 2024, 46(01): 150-158.
[10]	SUN Rui-nan, CHU Xiang, CHEN Yu, YAN Ming-ning. Research on path optimization of express terminal location based on hybrid heuristic algorithm [J]. Computer Engineering & Science, 2024, 46(01): 159-169.
[11]	ZHOU Xiao-hua, WANG Xue-zhi, ZHOU Yuan-chun, MENG Zhen, . Distributed Kriging interpolation algorithm optimization for large region carbon satellite data [J]. Computer Engineering & Science, 2023, 45(11): 1911-1921.
[12]	WANG Ruo-bin, GENG Fang-dong, ZHANG Yong-mei, SONG Wei, WANG Wei-feng, XU Lin. Blended MOOC video viewing pattern mining based on an improved self-adaptive DBSCAN [J]. Computer Engineering & Science, 2023, 45(09): 1670-1678.
[13]	GUO Yi, HE Ting-nian, LI Ai-bin, MAO Jun-yu. A knowledge tracing model fusing GA-CART and Deep-IRT [J]. Computer Engineering & Science, 2023, 45(09): 1691-1700.
[14]	TANG Min, QI Niu-niu, DENG Guo-qiang. A sparse interpolation algorithm based on modular arithmetic coefficient parsing [J]. Computer Engineering & Science, 2023, 45(04): 599-606.
[15]	TANG Yu, DAI Qi, YANG Meng-yuan, CHEN Li-fang, . An improved sparrow search algorithm to optimize SVM for outlier detection [J]. Computer Engineering & Science, 2023, 45(02): 346-354.

An interpolation based outlier detection

method of sparse high-dimensional data

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments