• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

A vertical mining algorithm for high average-utility
 itemsets based on optimal upper bound

PU Rong,SHAO Jian-fei,HU Chang-li,QU Kun   

  1. (Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China)
  • Received:2019-10-21 Revised:2019-12-11 Online:2020-05-25 Published:2020-05-25

Abstract:

Mining high average-utility itemsets is one of the hotspots in the current research. Aiming at the problem that the high average-utility itemsets mining algorithm generates a large number of meaningless candidate itemsets, which results in high memory consumption and long running time, the dMHAUI algorithm is proposed. Firstly, the algorithm defines the integration matrix Q, and proposes four compact average-utility upper bounds based on vertical database representation and three effective pru- ning strategies. Secondly, the information needed for high average-utility itemsets mining is stored in the IDUL structure tree, and the improved diffset technique is used to quickly calculate the average- utility and upper bound of itemsets. Finally, the high average-utility itemsets are obtained by recursively calling the search algorithm. Simulation results show that the dMHAUI function has better performance than the EHAUPM algorithm and the MHAI algorithm in terms of running time, join operation number and scalability.
 

Key words: pattern mining, high average-utility itemsets mining, dMHAUI algorithm, upper bound, utility mining