• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2011, Vol. 33 ›› Issue (9): 105-108.

• 论文 • 上一篇    下一篇

一种时态关联规则挖掘算法

李广原1,2,刘英华1,3,刘永彬1   

  1. (1.北京科技大学计算机与通信工程学院,北京100083;2.广西师范学院计算机与信息工程学院,广西 南宁 530023;3.中国青年政治学院,北京100089)
  • 收稿日期:2011-05-20 修回日期:2011-07-26 出版日期:2011-09-25 发布日期:2011-09-25
  • 作者简介:李广原(1969),男,广西上林人,博士,副教授,研究方向为数据挖掘。
  • 基金资助:

    国家自然科学基金资助项目(60875029)

An Efficient  Mining Algorithm of Temporal Association Rules

LI Guangyuan1,2,LIU Yinghua1,3,LIU Yongbin1   

  1. (1.School of Computer and Communication Engineering,University of Science and  Technology Beijing,Beijing 100083;2.〖JP2〗School of Computer and Information Engineering,Guangxi Teachers’ Education University,Nanning 〖JP〗530023;3.China Youth University for Political Sciences,Beijing 100089,China)
  • Received:2011-05-20 Revised:2011-07-26 Online:2011-09-25 Published:2011-09-25

摘要:

时态关联规则挖掘是针对在一段时间范围内的关联挖掘,在现实中有较多的应用。现有的大多数时态关联挖掘算法或者需要多次扫描数据库,或者没有考虑各个项在数据集上出现或结束时间上的不同,因而挖掘性能受到较大的制约。为此,本文提出一种增量式的面向具有不同时间出现与结束的项的时态关联规则挖掘算法。为减少存储方面的开销,只需保存已挖掘过的历史数据集中的频繁1项集。为了减少数据的扫描量,通过有效的剪枝策略,有选择性地扫描相关事务项,至多只需扫描一次完整的数据库。实验证明,该算法具有较好的挖掘性能。

关键词: 数据挖掘, 关联规则, 时态挖掘

Abstract:

Temporal association rules mining(TARM) is widely applied in many applications, it aims at mining rules within a certain interval of time. Most of the exiting algorithms for TARM need to scan several times of the database, or do not consider the different exhibition period of an individual item, so the efficiency of these algorithms are not enough. In this paper, we present a novel approach to investigating TARM, the proposed algorithm works in an incremental way which takes the different exhibition period of individual item into account, in order to reduce the cost of storage, and only the frequent 1item is stored, and efficient pruning techniques are adopted to reduce the scan times of the database, and it only needs at most one time to scan the whole data set to obtain all the temporal association rules. The  experimental results show that the proposed algorithm is efficient.

Key words: data mining;association rules;temporal mining