J4 ›› 2012, Vol. 34 ›› Issue (9): 135-142.
• 论文 • Previous Articles Next Articles
CHEN Xinquan
Received:
Revised:
Online:
Published:
Abstract:
In order to effectively preprocess some mixed data sets,this paper first gives some definitions and related properties,then presents a twophase clustering algorithm based on near neighbor connection.To improve the time efficiency of this algorithm,some improving ideas and techniques are described.Through the simulation experiments of some artificial data sets and UCI standard data sets,we can verify that this clustering algorithm can often obtain better clustering quality than the kmeans algorithm and the AP algorithm when facing to some data sets with apparent clusters.So we can say that this clustering algorithm has certain value. In the end,several research expectations are given to disinter and popularize this method.
Key words: mixed attributes;cluster feature;primary cluster;near neighbor graph
CHEN Xinquan. A TwoPhase Clustering Algorithm Based on Near Neighbor Connection for Mixed Data Set[J]. J4, 2012, 34(9): 135-142.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2012/V34/I9/135