• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2007, Vol. 29 ›› Issue (1): 83-85.

• 论文 • 上一篇    下一篇

关联规则挖掘算法及其应用研究

刘星沙[1] 谭利球[1] 熊拥军[2]   

  • 出版日期:2007-01-01 发布日期:2010-05-30

  • Online:2007-01-01 Published:2010-05-30

摘要:

本文提出了一种适用于数字资源访问日志数据库的关联规则挖掘改进算法,它采用事务压缩和项目压缩相结合,而候选项目集及支持度计算是在每条事务压缩后通过联接产生,候选项目集采用关键字识别,省去了Apriori算法中的剪枝和字符串模式匹配步骤,可快速得到完整的频繁模式集。该算法特别适用于数字图书馆海量数字资源的个性化信息需 求获取分析。

关键词: 关联规则 数字图书馆 个性化服务 Apriori算法 事务压缩 项目压缩

Abstract:

This paper proposes an enhanced algorithm which associates the Apriori algorithm with the transaction reduction and item reduction techniques.The cand idate set generation and the support calculation of each itemset is created after each transaction is compressed and connected. The candidate set adopts   the key word identification.The process of pruning and string pattern matching is removed from the Apriori algorithm,and it is especially suitable for   the personal services of large digital libraries to gain personal information requirements.

Key words: association rule;digital library;personal service;Apriori algorithm;transaction reduction ;item reduction