A clustering analysis algorithm
based on double genetic algorithm

Computer Engineering & Science

Previous Articles Next Articles

A clustering analysis algorithm

based on double genetic algorithm

WEN Jing，CAO Yan，ZHANG Lin，MU Xiang-wei

(College of Transportation Management，Dalian Maritime University，Dalian 116026，China)

Received:2015-12-17 Revised:2016-05-12 Online:2017-12-25 Published:2017-12-25

Abstract

Abstract:

There are two major factors that affect the k-means clustering effect: the number of clustering and the initial choice of the centroids. We put forward an improved k-means algorithm based on the double genetic algorithm, which uses the outer sub-genetic algorithm to control the number of clustering, and the inner sub-genetic algorithm to control the initial choice of cluster centroids, and utilizes the intra-class distance and inter-lass distance as well as the ratio between them to evaluate the clustering results. We therefore can get both the optimal number of clustering and the corresponding optimal initial cluster centroids by this improved k-means method. In addition, given the specificity of the inner and outer sub-generic algorithms, the improved k-means algorithm uses two different encoding strategies, and in order to preserve excellent individuals, it also uses the elite individuals reserved strategy. Experiments on the UCI data set verify the effectiveness of the improved k-means algorithm and it has a reference value for data mining.

Key words: double genetic, cluster analysis, k-means algorithm, layered coding, elitism preservation

WEN Jing，CAO Yan，ZHANG Lin，MU Xiang-wei.

A clustering analysis algorithm

based on double genetic algorithm

[J]. Computer Engineering & Science.

[1]	HU Xiao-yue, , WANG Qiang, Lv Fang-xu, XU Chao-long, ZHANG Jin. DSP design for 56 Gb/s high-speed SerDes receiver [J]. Computer Engineering & Science, 2024, 46(07): 1202-1209.
[2]	SHEN Guo-xin, JIANG Zhong-yun. A Canopy bisecting K-Means algorithm based on density and central index [J]. Computer Engineering & Science, 2022, 44(02): 372-380.
[3]	GAO Xing1,LIU Jian-fei1,HAO Lu-guo2,DONG Qi-qi1. A training set optimization and detection method based on YOLOv3 algorithm [J]. Computer Engineering & Science, 2020, 42(01): 103-109.
[4]	LIU Yun-peng. An empirical study of learning situation data analysis based on mobile cloud teaching platform: A case study of “dynamic website design” course [J]. Computer Engineering & Science, 2019, 41(增刊S1): 119-123.
[5]	LI Xin-jian，LIU Man-dan. Correlation measurement of campus wireless network users based on the shortest time distance [J]. Computer Engineering & Science, 2019, 41(10): 1755-1762.
[6]	DU Jia-xing,CHEN Ya-wei,ZHANG Jing. Distance rectification indoor localization based on cluster analysis optimization [J]. Computer Engineering & Science, 2018, 40(02): 246-254.
[7]	SHEN Weichao1,2，CAO Liqiang2，XIA Fang1,2. Large scale particle cluster identification and analysis [J]. J4, 2013, 35(11): 62-67.
[8]	CHEN Surong,ZHU Xiaohui. Research of K-means Algorithm by Fuzzy Logic [J]. J4, 2012, 34(12): 155-159.
[9]	. [J]. J4, 2006, 28(12): 74-76.

A clustering analysis algorithm

based on double genetic algorithm

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 9

Recommended Articles 0

Metrics

Comments