[1] |
White T. Hadoop:The definitive guide[J]. Oreilly Media Inc Gravenstein Highway North,2010,215(11):14.
|
[2] |
Lakshman A,Malik P. Cassandra:A decentralized structured storage system[J]. Acm Sigops Operating Systems Review,2010,44(2):3540.
|
[3] |
Zaharia M,Chowdhury M,Franklin M J,et al. Spark:Cluster computing with working sets[C]∥Proc of the 2nd USENIX Conference on Hot Topics in Cloud Computing,2010:17651773.
|
[4] |
Seo S,Jang I,Woo K,et al. HPMR:Prefetching and preshuffling in shared MapReduce computation environment[C]∥Proc of the 2009 IEEE International Conference on Cluster Computing,2009:18.
|
[5] |
Jiang D,Ooi B C,Shi L,et al. The performance of MapReduce:An indepth study[J]. Proceedings of the VLDB Endowment,2010,3(12):472483.
|
[6] |
Dittrich J. Hadoop++:Making a yellow elephant run like a cheetah (without it even noticing)[J]. Proceedings of the VLDB Endowment,2010,3(12):518529.
|
[7] |
Shivnath B. Towards automatic optimization of MapReduce programs[C]∥Proc of the 1st ACM Symposium on Cloud Computing,2010:137142.
|
[8] |
Herodotou H,Lim H, Luo G, et al. Starfish:A selftuning system for big data analytics[C]∥Proc of the 5th Cidr Conf,2011:261272.
|
[9] |
Shi Juwei,Zhou Jia, Lu Jiaheng, et al. MRTuner:A toolkit to enable holistic optimization for MapReduce jobs[C]∥Proc of the VLDB Endowment, 2014,7(13):13191330.
|
[10] |
Aaron D,Andrew O.Optimizing shuffle performance in spark[R].CA:BerkeleyDepartment of Electrical Engineering and Computer Sciences,University of California,2013.
|
[11] |
Ravi N. Configuring and optimizing spark applications with easeNishkam ravi,Cloudera[EB/OL].[20150901].https://apachebigdata2015.sched.org/event/55afa6d65370a56bdbcb5eba5166f010#.VemuzvaqpEN.
|
[12] |
Lin J,Keogh E,Wei L,et al. Experiencing SAX:A novel symbolic representation of time series[J]. Data Mining & Knowledge Discovery,2007,15(2):107144.
|
[13] |
Xu W,Huang L,Fox A,et al. Detecting largescale system problems by mining console logs[C]∥Proc of the 27nd International Conference on Machine Learning,2010.DOI:10.1145/1629575.1629587.
|
[14] |
Shieh J,Keogh E. iSAX:indexing and mining terabyte sized time series[C]∥Proc of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining ACM,2008:623631.
|
[15] |
Zaharia M,Chowdhury M,Das T,et al. Resilient distributed datasets:A faulttolerant abstraction for inmemory cluster computing[C]∥Proc of the 9th USENIX Conference on Networked Systems Design and Implementation, 2011:141146.
|
[16] |
Dijkman R, Dumas M, GarciaBanuelos. Graph matching algorithms for business process model similarity search[C]∥Proc of the 7th International Conference on Business Process Management,2009:4863.
|
[17] |
Bottou L. Largescale machine learning with stochastic gradient descent[C]∥Proc of COMPSTAT’2010,2010:177186.
|