• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2007, Vol. 29 ›› Issue (2): 93-96.

• 论文 • 上一篇    下一篇

基于Galois联络的最小非冗余关联规则挖掘

魏长华[1] 魏敏[2] 杨伟传[1]   

  • 出版日期:2007-02-01 发布日期:2010-06-01

  • Online:2007-02-01 Published:2010-06-01

摘要:

关联规则挖掘是NP难题,关键是如何约简频繁项集。本文以Galois联络为理论基础,应用Galois联络的闭包运算及其性质定义数据库中的频繁项和封闭频繁项,提出了挖掘关联 规则生成子、精确关联规则生成基和近似关联规则本征基的概念,并由此构造最小非冗余精确关联规则和近似关联规则挖掘的MNRM算法。该算法与Apriori算法相比较,挖掘的 关联规则是最小非冗余的,降低了计算复杂度,而且规则具有不丢失任何信息、最小前件和最大后件以及对用户最实用和最相关等优点。

关键词: Galois联络 关联规则 数据挖掘 MNRM算法

Abstract:

Since the association rule's mining is NP-hard,the key is how to reduce frequent itemsets.In the paper,the generating basis for exact association rul  es and the proper basis for approximate association rules are addressed based on the Galois connection,and the Galois closure properties.This paper pres ents a new algorithm called MNRM to discover minimal non-redundant exact and approximate rules.Compared with the Apriori algorithm,these rules are the m ost non-redundant,the computing complexity is reduced.And these rules have many strongpoints,such as minimal antecedents and maximal consequents,the mos t relevant association rules,limiting the number of rules produced without information loss,the most user-useful and user-relevant.

Key words: (Galois connection, association rules, data mining, MNRM algorithm)