Computer Engineering & Science
Previous Articles Next Articles
ZHAO Liang1,LIU Jianhui2,ZHANG Zhaozhao2
Received:
Revised:
Online:
Published:
Abstract:
The original distance measure of Kmodes clustering algorithm cannot reflect the difference between categorical variables. To overcome this drawback, we propose a new distance measure algorithm based on the intermediate result of Nave Bayes classifier. This algorithm constructs feature vectors to present categorical variables and uses the Euclidean distance of the feature vectors as distance between variables. We implement the Kmodes algorithm with the new derived measure and the experiments on extensive UCI data sets show that the proposal is more effective in comparison with other measure algorithms.
Key words: K-modes clustering algorithm, categorical variables, Nave Bayes classifier, distance measure
ZHAO Liang1,LIU Jianhui2,ZHANG Zhaozhao2. A K-modes clustering algorithm based on Bayes distance measure
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2017/V39/I1/188