• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2012, Vol. 34 ›› Issue (3): 62-66.

• 论文 • Previous Articles     Next Articles

Research on the SLIQ Parallel Algorithm Based on Cloud Computing

YANG Changchun,SHEN Xiaoling   

  1. (School of Information Science and Engineering,Changzhou University,Changzhou 213164,China)
  • Received:2011-09-24 Revised:2011-11-21 Online:2012-03-26 Published:2012-03-25

Abstract:

Cloud computing provides efficient solutions to storing and analyzing mass data.It is very important to study the data mining algorithms based on cloud computing from the theoretical viewpoint and the practical viewpoint.The SLIQ algorithm finds the best split point through calculating the scalability indexes one by one.When the amount of data increases,the method is timeconsuming,and the efficiency of the algorithm is very low.In this paper,the algorithms of mining decision rules based on the cloud computing environment are focused on the MapReduce programming model.On the basis,an improved SLIQ algorithm as well as the procedure of the improved SLIQ algorithm on MapReduce is designed in order to realize parallel data mining.

Key words: cloud computing;SLIQ;MapReduce;data mining