[1] |
Jain A K,Murty M N,Flynn P J.Data clustering: A review[J].ACM Computing Surveys (CSUR),1999,31(3):264323.
|
[2] |
Ester M,Kriegel H P,Sander J,et al.A densitybased algorithm for discovering clusters in large spatial databases with noise[C]∥Proc of Kdd,1996:226231.
|
[3] |
Zhang T, Ramakrishnan R, Livny M. BIRCH: An efficient data clustering method for very large databases[C]∥Proc of ACM Sigmod Record,1996: 103114.
|
[4] |
Wang W,Yang J,Muntz R.STING: A statistical information grid approach to spatial data mining[C]∥Proc of VLDB,1997: 186195.
|
[5] |
Tsapanos N,Tefas A,Nikolaidis N,et al.Efficient MapReduce kernel kmeans for big data clustering[C]∥Proc of the 9th ACM Hellenic Conference on Artificial Intelligence,2016: 28.
|
[6] |
Patwary M M A,Palsetia D,Agrawal A,et al.A new scalable parallel DBSCAN algorithm using the disjointset data structure[C]∥Proc of
|
20 |
12 IEEE International Conference on High Performance Computing,Networking,Storage and Analysis (SC 2012),2012:111.
|
[7] |
Chierichetti F, Dalvi N, Kumar R. Correlation clustering in MapReduce[C]∥Proc of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2014:641650.
|
[8] |
Jin C,Chen Z,Hendrix W,et al.Incremental,distributed singlelinkage hierarchical clustering algorithm using MapReduce[C]∥Proc of the Symposium on High Performance Computing,2015:8392.
|
[9] |
MLlib | Spark [EB/OL].[201605].http:∥spark.apache.org/mllib/.
|
[10] |
Liu Zhiqiang,Gu Rong,Yuan Chunfeng,et al.Parallelization of classification algorithms based on SparkR[J].Journal of Frontiers of Computer Science and Technology,2015,9(11):12811294.(in Chinese)
|
[11] |
Meng X,Bradley J,Yavuz B,et al.MLlib:Machine learning in apache spark[J].The Journal of Machine Learning Research,2016,17(1):12351241.
|
[12] |
Garg A,Mangla A,Gupta N,et al.PBIRCH:A scalable parallel clustering algorithm for incremental data[C]∥Proc of 2006 10th International Database Engineering and Applications Symposium (IDEAS’06),2006:315316.
|
[13] |
Yu Xiaoshan, Wu Yangyang. Parallel text hierarchical clustering based on MapReduce[J].Journal of Computer Applications,2014,34(6):15951599.(in Chinese)
|
[14] |
Zhu Yinghui,Jiang Yuzhen.Research of BIRCH clustering algorithm optimization and parallelism[J].Computer Engineering and Design,2007,28(18):43454346.(in Chinese)
|
[15] |
Wei Xiang.Improved BIRCH clustering algorithm based on density[J].Computer Engineering and Applications,2013,49(10):201205.(in Chinese)
|
[16] |
Holden K,Andy K,Patrick W,et al.Learning spark:Lightingfast data analysis[M]. New York: O’Reilly Media,2015.
|
[17] |
Zaharia M,Chowdhury M,Franklin M J,et al.Spark:Cluster computing with working sets[C]∥Proc of HotCloud’10, 2010:10.
|
[18] |
Veenman C J,Reinders M J T,Backer E.A maximum variance cluster algorithm[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(9):12731280.
|
[19] |
Gionis A, Mannila H, Tsaparas P. Clustering aggregation[J].ACM Transactions on Knowledge Discovery from Data (TKDD),2007,1(1):130.
|
|
附中文参考文献:
|
[10] |
刘志强,顾荣,袁春风,等.基于 SparkR 的分类算法并行化研究[J].计算机科学与探索,2015,9(11):12811294.
|
[13] |
余晓山,吴扬扬.基于 MapReduce 的文本层次聚类并行化[J].计算机应用,2014,34(6):15951599.
|
[14] |
朱映辉,江玉珍.BIRCH 聚类算法优化及并行化研究[J].计算机工程与设计,2007,28(18):43454346.
|
[15] |
韦相. 基于密度的改进BIRCH聚类算法[J]. 计算机工程与应用,2013,49(10):201205.
|