[1]Erdogmus H.Cloud computing:Does nirvana hide behind the nebula[J].IEEE Software,2009,26(2):46.
[2]Ghemawat S,Gobioff H,Leung S.The google file system[J].
SACM SIGOPS Operating Systems Review,2003,37(5):2943.
[3]Huang Wei,Liu Kun.Review on kernel algorithm of information pattern recognition[J].Journal of Modern Information,2014,34(3):168176.(in Chinese)
[4]He Qing, Li Ning, Luo Wenjuan. A survey of machine learning algorithms for big data[J].PR&AI,2014,27(4):328336.(in Chinese)
[5]Xu Rui,Wunsch D.Survey of clustering algorithms[J].IEEE Transactions on Neural Network,2005,16(3):654678.
[6]Pena J M,Lozano J A,Larranaga P.An empirical comparison of four initialization methods for the kmeans algorithm[J].Pattern Recognition Letter,1999,20(10):10271040.
[7]Arthur D, Vassilvitskii S. kmeans++:the advantages of careful seeding[C]∥Proc of SODA’07, 2007:10271035.
[8]Redmond S J,Heneghan C.A method for initializing the kmeans clustering algorithm using Kdtree[J].Pattern Recognition Letters,2007,28(8):965973.
[9]Chu C T,Kim S K,Lin Y A,et al.Mapreduce for machine learning on multicore[C]∥Proc of Advances in Neural Information Processing Systems,2006:281288.
[10]Zhao Weizhong, Ma Huifang, He Qing.Parallel kmeansclustering based on MapReduce[C]∥
Proc of the CloudCom’09,2009:674679.
[11]Miao Yuqing, Zhang Jinxing. New clustering algorithm based on Hadoop[J].Computer Science,2013,39(10):115118.(in Chinese)
[12]Zhang Shilei, Wu Zhuang. Clustering algorithm optimization research based on Hadoop[J].Computer Science,2014,41(4):269272.(in Chinese)
[13]Polo J,Carrera D, Becerra Y, et al.Performancedriven task coscheduling for MapReduce environments[C]∥Proc of Network Operations and Management Symposium (NOMS),2010:373380.
[14]Muller K,Mika S,Ratch G,et al.An introduction to kernerbased learning algorithms[J].IEEE Transactions on Neural Network,2001,12(2):181201.
[15]Kasselman P R.A fast attack on the MD4 Hash function[C]∥Proc of Symposium on Communications & Signal Processing,1997:147150.
[16]http://elki.dbs.ifi.lmu.dewikiDataSetGenerator.
附中文参考文献:.
[3]黄炜,刘坤.面向信息特征模式识别的核方法研究综述[J].现代情报,2014,34(3):168176.
[4]何清,李宁,罗文娟.大数据下的机器学习算法综述[J].模式识别与人工智能,2014,27(4):328336.
[11]缪裕青,张锦杏.一种基于Hadoop平台的新聚类算法[J].计算机科学,2014,41(4):269272.
[12]张石磊,武装.一种基于Hadoop云计算平台的聚类算法优化的研究[J].计算机科学,2012,39(10):115118.