• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

PFPonCanTree:A parallel frequent patterns
incremental mining algorithm based on MapReduce

XIAO Wen,HU Juan,ZHOU Xiao-feng   

  1. (Wentian College,Hohai University,Maanshan 243000,China)
  • Received:2016-12-08 Revised:2017-02-15 Online:2018-01-25 Published:2018-01-25

Abstract:

Frequent pattern mining is one of the most important data mining tasks. Traditional frequent pattern mining algorithmsare executed in a "batch" mode, that is,all the data are mined in one time, so they cannotmeet the needs of the ever-growing bigdata mining. MapReduce is a popular parallel computing modeland has been widely used in the field of parallel data mining. In this paper, we migrate the traditional frequent pattern incremental mining algorithm CanTree to the MapReduce computing model,achieving a parallel frequent pattern incremental miningalgorithm. The experimental results show that the proposed algorithm achievesbetterload balancing and improvesthe execution efficiency significantly.
 

Key words: