• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2011, Vol. 33 ›› Issue (6): 138-143.

• 论文 • Previous Articles     Next Articles

An Improved SemiSupervised  K-Means Clustering Algorithm

YUAN Liyong,WANG Jiyi   

  1. (School of Information Science and Engineering,Zhejiang Normal University,Jinhua 321004,China)
  • Received:2010-07-15 Revised:2010-12-08 Online:2011-06-25 Published:2011-06-25

Abstract:

Semisupervised clustering employs a small amount of labeled data to aid unsupervised learning. For the poor ability of the clustering algorithm based on the K-means for nonspherical clusters problems, this paper presents a new idea that considers the influence of the labeled datapoints on the unlabeled datapoints in allocating category, puts forward a definition of gravitational influence degree between category and datapoint,and designs a semisupervised K-means clustering algorithm with a gravitational parameter.The experiments show that the new algorithm has better effect than the traditional semisupervised K-means clustering method in dealing with the distribution of nonspherical cluster clustering.

Key words: semisupervised clustering;constrainedKmeans;labeled data, gravitational influence;nonspherical cluster