• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (09): 1702-1710.

• Artificial Intelligence and Data Mining • Previous Articles    

A high utility quantitative frequent pattern mining algorithm based on related degree

WANG Hui1,LI Yan1,DING Ding2,3,WU Kun2,3,HUANG Ya-ping2,3   

  1. (1.Institute of Computing Technologies,China Academy of Railway Sciences,Beijing 100081;
    2.School of Computer Science & Technology,Beijing Jiaotong University,Beijing 100044;
    3.Beijing Key Lab of Traffic Data Analysis and Mining,Beijing 100044,China)
  • Received:2023-03-29 Revised:2023-11-29 Accepted:2024-09-25 Online:2024-09-25 Published:2024-09-23

Abstract: The high utility frequent pattern mining algorithm mines more important frequent patterns from the data by using the importance degree  information. On this basis, the high utility quantitative frequent pattern mining algorithm further explores the quantitative relationship between data items, and thus has become a popular research topic in the field of data mining. RHUQI-Miner is proposed to improve the performance and practicability of the algorithm. Firstly, the concept of related degree is proposed, the item related degree structure is constructed according to the related degree, and a pruning optimization strategy is given to find frequent patterns with higher related degree, reducing redundancy and invalid frequent patterns. Secondly, the fixed pattern length strategy is used to modify the utility information of the item in the mining process, so that the algorithm can control the length of the output frequent pattern according to the actual data situation, and further improve the performance and practicability of the algorithm. The experimental results show that RHUQI-Miner can effectively reduce the time and memory consumption in the mining process, which can provide data support for differentiated and precise maintenance strategies.

Key words: high utility, quantitative, frequent pattern mining, related pruning, fixed pattern length