• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (05): 963-970.

• 论文 • Previous Articles     Next Articles

An improved algorithm for mining maximal
frequent itemsets over data streams    

HU Jian,WU Maomao   

  1. (Institute of Information Engineering,Jiangxi University of Science and Technology,Ganzhou 341000,China)
  • Received:2012-12-03 Revised:2013-04-03 Online:2014-05-25 Published:2014-05-25

Abstract:

Based on the algorithm of DSMMFI, an improved algorithm, named DSMMFIDS (Dictionary Sequence Mining Maximal Frequent Itemsets over Data Streams), is proposed. Firstly, it stores transaction data into DSFIlist in alphabetical order. Secondly, the data are stored sequentially into the tree similar to the summary data structure. Thirdly, nonfrequent items in the tree and DSFIlist are removed, and the transaction items with the maximum count of window attenuation supports are deleted. Finally, the strategy (topdown and bottomup twoway search) is used to mine maximal frequent itemsets over data streams, and case analysis and experiments prove that the algorithm DSMMFIDS has better performance than the algorithm DSMMFI.
    

Key words: data mining;data stream;landmark windows;maximal frequent itemsets;window attenuation support count