• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

An abnormal behavior detection method in Hadoop cluster

CAI Wu-yue1,WANG Ke2,HAO Yu-jie2,DUAN Xiao-ran2   

  1. (1.National Education Examinations Authority,Beijing 100084;
    2.School of Computer Science and Engineering,University of Electronic Science and Technology of China,Chengdu 611731,China)
     
  • Received:2017-07-03 Revised:2017-09-25 Online:2017-12-25 Published:2017-12-25

Abstract:

With the development of distributed computing technology, Hadoop, as a typical representative in the field of massive data processing, is vulnerable to hidden security threats, such as data breaches, due to weak security mechanism and lack of user activity monitoring. By combining with the characteristics of the principal component analysis, we perform parallel process through MapReduce to overcome the disadvantage of principal component analysis and improve the training efficiency. We propose an abnormal behavior detection method in Hadoop cluster, namely we compare the current user behavior patterns with historical behavior patterns to see if they match, which is taken as a metric for anomaly behavior detection. Experimental results indicate that our method can detect users' anomaly behavior effectively.

Key words: Hadoop cluster, principal component analysis, anomaly detection, MapReduce, behavior pattern