• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (11): 2077-2083.

Previous Articles     Next Articles

A continuous K-means equivalent clustering model and its optimization algorithm

XIE Ting1,LIU Rui-hua2,WEI Zheng-yuan1   

  1. (1.School of Science,Chongqing University of Technology,Chongqing 400054;

    2.School of Artificial Intelligence,Chongqing University of Technology,Chongqing 400054,China)


  • Received:2020-07-10 Revised:2020-09-29 Accepted:2021-11-25 Online:2021-11-25 Published:2021-11-23

Abstract: As an unsupervised learning method, clustering is a significant research topic in data science. K-means is a partition-based clustering algorithm, which generally uses a heuristic algorithm to solve a discrete NP problem. In order to improve the application of K-means in big data problems, a continuous non-convex K-means equivalent clustering model is designed according to the properties of clustering matrix, and the  fast optimization algorithm of this equivalent clustering medel is given by ADMM framework. Numerical experiments show that the model and algorithm are accurate and efficient in big data clustering. In addition, the feature and equivalence of the model are discussed.

Key words: K-means, clustering, sparse, alternating direction method of multipliers