基于支持度和增比率的改进关联分类算法

J4 ›› 2016, Vol. 38 ›› Issue (02): 370-375.

基于支持度和增比率的改进关联分类算法

王卫平,周忠眉,郑艺峰

（闽南师范大学计算机学院,福建漳州 363000）

收稿日期:2015-04-29 修回日期:2015-07-03 出版日期:2016-02-25 发布日期:2016-02-25
基金资助:
国家自然科学基金（61170129）；闽南师范大学研究生课题基金（YJS201434）

An improved associative classification approach
based on support and enhancement ratio

WANG Weiping,ZHOU Zhongmei,ZHENG Yifeng

(School of Computer,Minnan Normal University,Zhangzhou 363000,China)

Received:2015-04-29 Revised:2015-07-03 Online:2016-02-25 Published:2016-02-25

摘要/Abstract

摘要：

：关联分类是一项重要的分类技术,目前普遍采用基于支持度和置信度的关联分类模式。但是,用支持度度量项集的分类能力过于简单,且置信度不能度量项集与类的相关性,所以利用支持度和置信度容易产生质量不好的规则。提出改进的关联分类算法—ACSER。ACSER不仅考虑项集到本类的支持度,也考虑项集到补类的支持度。首先,提取频繁增比模式作为分类候选规则集；其次,利用置信度和增比率度量规则的强度,按照其强度进行排序和剪枝；最后,选择k条最优的规则进行预测。在16个UCI数据集上的实验结果表明,改进的分类算法ACSER与传统的分类算法相比有更高的分类准确率。

关键词: 数据挖掘, 关联分类, 频繁项集, 规则强度, 分类准确率

Abstract:

Associative classification is a significant data mining technique. The schema with support and confidence is commonly employed in the stateoftheart associative classification methods. Since the classification based on support is very simple and the classification based on confidence fails to measure the correlation between itemset and class, these methods tend to generate many inferior rules. In this paper, we propose an improved associative classification approach based on support and enhancement ratio (ACSER). The ACSER considers the support of itemset both in the target class and in its complement class. Firstly, frequent enhancement ratio patterns are extracted from training data as candidate classification rules. Secondly, the ACSER ranks and prunes the extracted rules according to the rule intensity measured by confidence and enhancement ratio. Finally, the ACSER selects the best k rules to predict unknown objects. Experiments on 16 UCI datasets show that the improved approach has higher accuracy than the traditional approaches based on support and confidence.

Key words: data mining;associative classification;frequent itemset;rule intensity;classification accuracy

王卫平,周忠眉,郑艺峰. 基于支持度和增比率的改进关联分类算法[J]. J4, 2016, 38(02): 370-375.

WANG Weiping,ZHOU Zhongmei,ZHENG Yifeng. An improved associative classification approach
based on support and enhancement ratio [J]. J4, 2016, 38(02): 370-375.

编辑推荐

Metrics

阅读次数

全文

167

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	167

来源	本网站	其他网站

次数	155	12
比例	93%	7%

摘要

最新录用	在线预览	正式出版

0	0	86

	来源	本网站

	次数	86
	比例	100%

[1]	赵琰, 马慧芳, 王文涛, 童海斌, 贺相春. 可靠响应表示增强的知识追踪方法[J]. 计算机工程与科学, 2024, 46(03): 535-544.
[2]	雷轩, 程光, 张玉健, 郭靓, 张付存. 基于电力网络态势感知平台的告警信息关联分析[J]. 计算机工程与科学, 2023, 45(07): 1197-1208.
[3]	王晨宇, 温浩珉, 郭晟楠, 林友芳, 万怀宇, . 面向快递员揽收到达时间预测的多任务深度时空网络[J]. 计算机工程与科学, 2023, 45(01): 136-144.
[4]	程小刚, 郭韧, 周长利, . 基于理性密码学的分布式隐私保护数据挖掘框架[J]. 计算机工程与科学, 2022, 44(10): 1781-1787.
[5]	王文涛, 马慧芳, 舒跃育, 贺相春. 基于上下文表示的知识追踪方法[J]. 计算机工程与科学, 2022, 44(09): 1693-1701.
[6]	文武, 万玉辉, 文志云, . 基于正余弦算法的文本特征选择[J]. 计算机工程与科学, 2022, 44(08): 1467-1473.
[7]	刘云, 肖添. 网络日志数据中条件因果挖掘算法的优化研究[J]. 计算机工程与科学, 2021, 43(09): 1584-1590.
[8]	文凯, 许萌萌, 张许红, . 基于列表结构的加权可擦除项集挖掘算法[J]. 计算机工程与科学, 2021, 43(09): 1676-1683.
[9]	熊中敏, 汪博, 陶然, 郑宗生, 陈明, . 一种基于主属性判定的关联规则挖掘约简算法[J]. 计算机工程与科学, 2021, 43(04): 738-745.
[10]	文凯, 耿小海, 朱璐伟, 许萌萌, . 基于AO算法的数据流频繁项集挖掘[J]. 计算机工程与科学, 2020, 42(12): 2259-2264.
[11]	藏润强, 左美云, 郭鑫鑫. 基于Doc2Vec和BiLSTM的老年患者疾病预测研究[J]. 计算机工程与科学, 2020, 42(12): 2273-2279.
[12]	何望1,2，林果园1,2. 基于FP-Growth改进算法的云服务器故障数据分析[J]. 计算机工程与科学, 2020, 42(05): 770-775.
[13]	谭胜昔，贾金萍，赵斌，吉根林. 动态空间网络中的黑洞模式挖掘算法[J]. 计算机工程与科学, 2020, 42(02): 325-333.
[14]	廖纪勇，吴晟，刘爱莲. 基于布尔矩阵约简的Apriori算法改进研究[J]. 计算机工程与科学, 2019, 41(12): 2231-2238.
[15]	周忠眉1,2,李家辉1,2. 基于各类支持度阈值独立挖掘的关联改进算法[J]. 计算机工程与科学, 2019, 41(11): 2088-2094.