• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (07): 1398-1403.

• 论文 • 上一篇    下一篇

基于改进关联分类的两次学习方法

黄再祥,周忠眉,何田中   

  1. (漳州师范学院计算机科学与工程系,福建 漳州 363000)
  • 收稿日期:2012-10-31 修回日期:2013-04-03 出版日期:2014-07-25 发布日期:2014-07-25
  • 基金资助:

    国家自然科学基金资助项目(61170129);福建省自然科学基金资助项目(2013J01259)

A double learning method based on
the improved associative classification          

HUANG Zaixiang,ZHOU Zhongmei,HE Tianzhong   

  1. (Department of Computer Science and Engineering,Zhangzhou Normal University,Zhangzhou 363000,China)
  • Received:2012-10-31 Revised:2013-04-03 Online:2014-07-25 Published:2014-07-25

摘要:

关联分类通常产生大量的分类规则,导致在分类新实例时经常产生规则冲突问题。针对这种规则冲突问题,提出了一种基于改进关联分类的两次学习框架。利用频繁且互关联的项集产生分类规则改进关联分类算法,有效减少了规则数。应用改进的关联分类算法产生的一级规则一次性分离出训练集中规则冲突的所有实例。然后,在冲突实例上应用改进的关联分类算法进行第二次学习得到二级规则。分类新实例时,首先利用第一级规则进行分类。如果出现规则冲突,则利用第二级规则分类该实例。实验结果表明,基于改进关联分类的两次学习方法降低了规则冲突比率,并且显著提高了分类准确率。

关键词: 数据挖掘, 关联分类, 两次学习, 规则冲突

Abstract:

Associative classification usually generates numerous rules, resulting in rule conflicts in stage of classification. To address this problem, a double learning method based on the improved associative classification is proposed. The improved associative classification reduces the number of rules significantly by discovering the frequent and mutual associated itemsets. All training conflict instances in training set are separated by applying the first level rules generated by the improved associative classification. Then, the second level rule set is induced by applying the improved associative classification on the conflict instances. When classifying a new instance, the first level rule set is applied. If the rules are not consistent with the instance, the second rules set is used to classify this instance. The experimental results show that the double learning method based on the improved associative classification can improve the classification accuracy effectively.

Key words: data mining;associative classification;double learning;conflict rules