• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (10): 1838-1847.

Previous Articles     Next Articles

A self-adaptive clustering algorithm without neighborhood parameter k and cluster number c

ZHANG Bo-kai1,YANG De-gang1,2,FENG Ji1,2#br#

#br#
  

  1. (1.College of Computer and Information Science,Chongqing Normal University,Chongqing 401331;

    2.Chongqing Engineering Research Center of 

    Educational Big Data Intelligent Perception and Application,Chongqing 401331,China)

  • Received:2020-08-06 Revised:2020-11-23 Accepted:2021-10-25 Online:2021-10-25 Published:2021-10-22
  • About author:ZHANG Bo-kai ,born in 1996,MS candidate,CCF member(E9765G),his research interest includes data analysis.

Abstract: Traditional clustering methods often cannot avoid the selection of neighborhood parameters and the number of clusters. The optimal selection of these parameters in different shapes of data is hard to choose, and this choice is depending on prior knowledge. Aiming at the above parameter selection problem, this paper proposes a natural neighbors based border peeling clustering algorithm (NaN-BP), which can obtain satisfactory clustering results without setting the neighborhood parameters and the number of clusters. The core idea of the algorithm is to adaptively iterate to a logarithmic stable state and obtain neighborhood information according to the distribution characteristics of the data set, then mark and strip the boundary points according to the neighborhood information, and finally gather the core points as the center of the data cluster. Extensive comparative experiments is conducted on data sets of different scales and distributions, and satisfactory experimental results verify the adaptability and effectiveness of the algorithm.


Key words: clustering analysis, self-adaptive, natural neighbor, logarithmic steady state, core point