• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2020, Vol. 42 ›› Issue (08): 1440-1447.

Previous Articles     Next Articles

A semi-supervised outlier detection model based on autoencoder and integrated learning

XIA Huo-song,SUN Ze-lin   

  1. (School of Management,Wuhan Textile University,Wuhan 430073,China)

  • Received:2019-12-31 Revised:2020-03-11 Accepted:2020-08-25 Online:2020-08-25 Published:2020-08-29

Abstract: Outlier detection is an important data mining method, which is used to preprocess data and mine heterogeneous data information. In recent years, due to the problem of dimension disaster, it is very difficult to detect the high-dimensional outlier data. Aiming at the above problems, a semi- supervised outlier detection model based on autoencoder and integrated learning is proposed. Firstly, autoencoder is used to reduce the dimension and increase the outlier degree of the outlier data. Secondly, considering that Iforest, lof and k-means algorithms are sensitive to different outlier types, they are fused in the AdaBoost boosting framework to improve the accuracy of outlier detection. The results show that, compared with the current mainstream outlier detection methods, the proposal significantly improves the accuracy of the model.

Key words: outlier detection, boosting framework, semi-supervised;autoencoder