• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2010, Vol. 32 ›› Issue (10): 108-111.doi: 10.3969/j.issn.1007130X.2010.

• 论文 • Previous Articles     Next Articles

Research on the Target Frequent Patterns Mining Algorithms

LIANG Bizhen1,LU Yueran1,GENG Lizhong2,QIN Liangxi3   

  1. (1.Department of Mathematics and Computer Information Engineering,Baise University,Baise 533000;2.School of Mechanical Engineering,Tsinghua University,Beijing 100084;3.School of Computer Science and Electronic Information,Nanning 530004,China)
  • Received:2010-03-17 Revised:2010-06-19 Online:2010-09-29 Published:2010-09-29

Abstract:

General frequent patterns mining algorithms usually produce large sets of frequent patterns, in which there are many nontarget patterns that users aren’t interested in. To exclude the nontarget patterns , users have to do the second mining. Although TFPgrowth can produce all maximum target frequent patterns , the second minning is still essential to getting the target frequent patterns from them. If we restrict the producing of the nontarget frequent patterns early in the mining process, it would improve the efficiency of the algorithm. Based on the TFPgrowth and the SFPgrowth, a target frequent patterns mining algorithm  named STFPgrowth is proposed in this paper,its efficiency can be promoted by sorting TFPtree, adopting different ways to build sub trees and sift target frequent patterns in different cases of tree nodes. STFPgrowth mines all the target frequent patterns which satisfy users’ requirements, and users need not do the second minning . The experiments show that STFPgrowth is more efficient than the TFPgrowth, and outperforms Apriori and Eclat obviously.

Key words: frequent pattern;target frequent pattern;maximum target frequent pattern;mining algorithm