• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2020, Vol. 42 ›› Issue (12): 2259-2264.

• 人工智能与数据挖掘 • 上一篇    下一篇

基于AO算法的数据流频繁项集挖掘

文凯1,2,3,耿小海1,2,3,朱璐伟1,2,3,许萌萌1,2,3   

  1. (1.重庆邮电大学通信与信息工程学院,重庆 400065;2.重庆邮电大学通信新技术应用研究中心,重庆 400065;
    3.重庆信科设计有限公司,重庆 401121)


  • 收稿日期:2019-11-21 修回日期:2020-03-20 接受日期:2020-12-25 出版日期:2020-12-25 发布日期:2021-01-05

Frequent itemsets mining for data stream based on AO algorithm

WEN  Kai1,2,3,GENG Xiao-hai1,2,3,ZHU Lu-wei1,2,3,XU Meng-meng1,2,3   

  1. (1.School of Communication and Information Engineering,
    Chongqing University of Posts and Telecommunications,Chongqing 400065;

    2.Research Center of New Telecommunication Technology,
    Chongqing University of Posts and Telecommunications,Chongqing 400065;

    3.Chongqing Information Technology Designing Co.,Ltd.,Chongqing 401121,China)

  • Received:2019-11-21 Revised:2020-03-20 Accepted:2020-12-25 Online:2020-12-25 Published:2021-01-05

摘要: 针对传统数据流频繁项集挖掘算法在支持度更新、窗口更新方式、频繁k-项集挖掘等方面存在的一系列问题,造成空间和时间效率不高,改进研究了一种高效挖掘数据流频繁项集的AO算法。采用滑动窗口思想,对数据流分块挖掘;在满窗口有新数据流入时,采用取余插入完成数据更新;挖掘频繁k-项集采用And Operation求解支持度,并在挖掘过程结合超集检测,极大地提高了挖掘效率。 实验结果表明,该算法在时间和空间效率上均有一定的优越性。


关键词: 数据流, 超集检测, 频繁项集, 与运算

Abstract: In view of a series of problems existing in support update, window update mode and frequent k-itemset mining of traditional frequent itemset mining algorithm in data flow, which results in low efficiency of space and time, an efficient AO algorithm for mining frequent itemsets in data streams is improved. The algorithm uses the idea of sliding window to mine the data stream in blocks; when there is new data flowing in the full window, the residual insertion is used to update the data; and operation is used to solve the support degree of frequent k-itemsets, and the superset detection is combined in the mining process, which greatly improves the mining efficiency. The experimental results show that the algorithm has good superiority in both time and space efficiency.


Key words: data stream, superset checking, frequent itemsets, And Operation