• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学

• 人工智能与数据挖掘 • 上一篇    

基于各类支持度阈值独立挖掘的关联改进算法

周忠眉1,2,李家辉1,2   

  1. (1.闽南师范大学计算机学院,福建 漳州 363000;2.数据科学与智能应用福建省高等学校重点实验室,福建 漳州 363000)
  • 收稿日期:2019-04-12 修回日期:2019-06-28 出版日期:2019-11-25 发布日期:2019-11-25
  • 基金资助:

    福建省自然科学基金(2018J01545)

An associative classification algorithm based on various
class-support thresholds and independent mining rules

ZHOU Zhong-mei1,2 ,LI Jia-hui1,2   

  1. (1.School of Computer Science,Minnan Normal University,Zhangzhou 363000;
    2.Key Laboratory of Data Science and Intelligence Application,Fujian Province University,Zhangzhou 363000,China)
  • Received:2019-04-12 Revised:2019-06-28 Online:2019-11-25 Published:2019-11-25

摘要:

关联分类及较多的改进算法很难同时既具有较高的整体准确率又有较好的小类分类性能。针对此问题,提出了一种基于类支持度阈值独立挖掘的关联分类改进算法—ACCS。ACCS算法的主要特点是:(1)根据训练集中各类数量大小给出每个类类支持度阈值的设定方法,并基于各类的类支持度阈值独立挖掘该类的关联分类规则,尽量使小类生成更多高置信度的规则;(2) 采用类支持度对置信度相同的规则排序,提高小类规则的优先级;(3)用综合考虑置信度和提升度的新的规则度量预测未知实例。在多个数据集上的实验结果表明,相比多种关联分类改进算法,ACCS算法有更高的整体分类准确率,且在不平衡数据上也能取得较好的小类分类性能。

 

关键词: 关联分类, 类支持度阈值, 类支持度, 分类准确率

Abstract:

Associative classification algorithm and its existing improved algorithms cannot achieve both high overall accuracy and good minority class classification. To solve this problem, we propose an improved associative classification algorithm based on various class-support thresholds (ACCS)independent mining rules. Its main featuresare: (1) ACCS sets the support threshold of each class according to the class size in the training data, and extracts the associative classification rule of each class separately based on the class-support threshold in order to get higher confidence rules of minority classes; (2) ACCS uses the class-support threshold to rank the rules with the same confidence for increasing the priority of the minority classes; (3) ACCS combines confidence and lift degrees together to predict unknown instances. The experimental results on multiple datasets show that ACCS can achieve higher overall classification accuracy than the existing associative algorithms, and can also get good minority class classification performance in imbalanced data.
 
 
 

Key words: associative classification, class support threshold, class support, classification accuracy