一种基于KD树子样的自动聚类方法

doi:10.3969/j.issn.1007130X.2011.

J4 ›› 2011, Vol. 33 ›› Issue (1): 166-170.doi: 10.3969/j.issn.1007130X.2011.

一种基于KD树子样的自动聚类方法

潘章明

（广东金融学院计算机科学与技术系，广东广州 510521）

收稿日期:2010-02-26 修回日期:2010-05-30 出版日期:2011-01-25 发布日期:2011-01-25
通讯作者: 潘章明 E-mail:panzhangming@163.com
作者简介:潘章明（1969），男，安徽芜湖人，硕士，讲师，研究方向为智能计算和模式识别。

An Automatic Clustering Method Using SubSampling for the KDTree

PAN Zhangming

(Department of Computer Science and Technology,Guangdong University of Finance,Guangzhou 510521,China)

Received:2010-02-26 Revised:2010-05-30 Online:2011-01-25 Published:2011-01-25

摘要/Abstract

摘要：

基于进化算法的自动聚类方法具有搜索目标函数全局最优和自动发现聚类数的优点，同时也存在时间代价过高的缺陷。本文提出一种基于KD树子样的自动聚类方法，该方法使用KD树对样本空间进行分割，并在各子空间中随机取样形成KD树子样，然后在子样中自动聚类，最后运用KMeans在整个样本集中优化子样中的聚类结果。本文方法能够有效避免随机子样分布有偏的缺陷，即使比例很小的子样也能获得较好的聚类效果。仿真结果表明，本文方法能够保证聚类效果没有明显下降的情况下，显著缩短进化算法自动聚类的时间。

关键词: KD树, 子样, 差分进化, 自动聚类

Abstract:

The evolution theory based automatic clustering method has advantages in finding the global optimum and the cluster number, but shows the lack of efficiency in machine time. An autoclustering method using the KDTree subsampling technique is proposed in this paper. The sample space is divided into subspaces using the KDTree method. In each subspace, the KDTree subsamples are produced by randomly sampling for later autoclustering. The KMeans method is used to optimize the cluster results of the subsamples. The method can effectively overcome the defect of biased distribution for random subsamples and give good cluster results even for small samples. The simulation results show that the method remarkably reduces the machine time for auto clustering without decreasing the clustering effect.

Key words: KDtree;subsample;differential evolution;automatic clustering

潘章明. 一种基于KD树子样的自动聚类方法[J]. J4, 2011, 33(1): 166-170.

PAN Zhangming. An Automatic Clustering Method Using SubSampling for the KDTree[J]. J4, 2011, 33(1): 166-170.

[1]	赵鑫博, 陆忠华. 面向深度行情因子挖掘的分布式训练关键技术研究[J]. 计算机工程与科学, 2024, 46(09): 1554-1565.
[2]	叶坤涛, 舒蕾蕾, 李文, 侯春菊. 基于差分进化策略的天牛须搜索算法及其应用[J]. 计算机工程与科学, 2023, 45(05): 920-930.
[3]	凌文通, 倪建军, 陈颜, 唐广翼. 基于改进鸽群优化算法的多无人机目标搜索[J]. 计算机工程与科学, 2022, 44(03): 531-535.
[4]	张明珠, 曹杰, 王斌. 基于精英集的多目标差分进化聚类算法[J]. 计算机工程与科学, 2021, 43(01): 170-179.
[5]	胡福年，董倩男. 多策略自适应变异的差分进化算法及其应用[J]. 计算机工程与科学, 2020, 42(06): 1076-1088.
[6]	覃远年，梁仲华. 基于混合粒子群算法的运动估计研究[J]. 计算机工程与科学, 2019, 41(04): 758-764.
[7]	马永杰，朱琳，田福泽. 基于精英种群策略的协同差分进化算法[J]. 计算机工程与科学, 2019, 41(02): 335-.
[8]	宋强1,刘亚萍2,刘珍兰1. 基于多代种群进化信息改进的差分进化算法研究[J]. 计算机工程与科学, 2018, 40(11): 2054-2059.
[9]	黄辉先，胡鹏飞. 基于共轭梯度法的反馈差分进化混合算法及其在弹簧设计中的应用[J]. 计算机工程与科学, 2018, 40(07): 1316-1322.
[10]	王林1，万小雨1,万建超2. 一种基于选择策略的差分混合蛙跳算法[J]. 计算机工程与科学, 2018, 40(01): 121-127.
[11]	朱林波，汪继文，邱剑锋，方柳平. 基于简化群优化算法和协方差矩阵学习的差分进化算法[J]. 计算机工程与科学, 2017, 39(11): 2122-2130.
[12]	徐曼舒，汪继文，邱剑锋，王心灵. 基于改进人工蜂群的模糊C-均值聚类算法[J]. J4, 2016, 38(06): 1238-1243.
[13]	王林1，彭璐1，夏德2，曾奕1. 自适应差分进化算法优化BP神经网络的时间序列预测[J]. J4, 2015, 37(12): 2270-2275.
[14]	张弛1，乐晓波1，周恺卿2，莫礼平3. 采用差分进化算法优化模糊Petri网参数[J]. J4, 2014, 36(06): 1095-1100.
[15]	拓守恒1,陶维天2. 一种求解高维多模态复杂问题的差分文化算法[J]. J4, 2013, 35(1): 142-148.

一种基于KD树子样的自动聚类方法

An Automatic Clustering Method Using SubSampling for the KDTree

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 15

编辑推荐

Metrics

本文评价