计算机工程与科学
• 论文 • 上一篇 下一篇
赵亮1,刘建辉2,张昭昭2
收稿日期:
修回日期:
出版日期:
发布日期:
基金资助:
国家自然基金(61440059);辽宁省自然基金(LS2013129)
ZHAO Liang1,LIU Jianhui2,ZHANG Zhaozhao2
Received:
Revised:
Online:
Published:
摘要:
K-modes算法中原有的分类变量间距离度量方法无法体现属性值之间差异,对此提出了一种基于朴素贝叶斯分类器中间运算结果的距离度量。该度量构建代表分类变量的特征向量并计算向量间的欧氏距离作为变量间的距离。将提出的距离度量代入Kmodes聚类算法并在多个UCI公共数据集上与其他度量方法进行比较,实验结果表明该距离度量更加有效。
关键词: K-modes聚类算法, 分类变量, 朴素贝叶斯分类器, 距离度量
Abstract:
The original distance measure of Kmodes clustering algorithm cannot reflect the difference between categorical variables. To overcome this drawback, we propose a new distance measure algorithm based on the intermediate result of Nave Bayes classifier. This algorithm constructs feature vectors to present categorical variables and uses the Euclidean distance of the feature vectors as distance between variables. We implement the Kmodes algorithm with the new derived measure and the experiments on extensive UCI data sets show that the proposal is more effective in comparison with other measure algorithms.
Key words: K-modes clustering algorithm, categorical variables, Nave Bayes classifier, distance measure
赵亮1,刘建辉2,张昭昭2. 基于贝叶斯距离的K-modes聚类算法[J]. 计算机工程与科学.
ZHAO Liang1,LIU Jianhui2,ZHANG Zhaozhao2. A K-modes clustering algorithm based on Bayes distance measure
0 / / 推荐
导出引用管理器 EndNote|Ris|BibTeX
链接本文: http://joces.nudt.edu.cn/CN/
http://joces.nudt.edu.cn/CN/Y2017/V39/I01/188