[1] |
Zaharia M.An architecture for fast and general data proces- sing on large clusters[M].Williston:Morgan & Claypool,2016.
|
[2] |
Luo Le,Liu Yi,Qian De-pei.Survey on in-memory computing technology[J].Journal of Software,2016,27(8):2147-2167.(in Chinese)
|
[3] |
Zaharia M,Chowdhury M,Das T,et al.Resilient distributed datasets:A fault-tolerant abstraction for in-memory cluster computing[C]∥Proc of the 9th USENIX Conference on Networked Systems Design and Implementation,2012:2.1-2.14.
|
[4] |
Meituan Comment Techical Group.Spark performance tuning guide [EB/OL].[2017-04-28].http://tech.meituan.com/Spark-tuning-basic.html.
|
[5] |
Apache Spark[EB/OL].[2017-03-15].http://Spark.apache.org.
|
[6] |
Dean J,Ghemawat S.MapReduce:Simplified data processing on large clusters[J].Communications of the ACM,2008,51(1):107-113.
|
[7] |
Apache Impala.Impala overview [EB/OL]. [2015-09-21].http://www.cloudera.com/content/www/en-us/products/apache-hadoop/impala.html.
|
[8] |
Ananthanarayanan G, Ghodsi A,Wang A,et al.PACMan:Coordinated memory caching for parallel jobs[C]∥Proc of the 9th USENIX Conference on Networked Systems Design and Implementation,2012:20.1-20.14.
|
[9] |
Babu S.Towards automatic optimization of MapReduce programs [C]∥Proc of the 1st ACM Symposium on Cloud Computing,2010:137-142.
|
[10] |
Sarma A D,Afrati F N,Salihoglu S,et al.Upper and lower bounds on the cost of a map-reduce computation [C]∥Proc of the 39th International Conference on Very Large Data Bases,2013:277-288.
|
[11] |
Li J G, Lin X L, Cui X L, et al. Improving the shuffle of Hadoop MapReduce[C]∥Proc of the 13th IEEE International Conference on Cloud Computing Technology and Science,2013:266-273.
|
[12] |
Zaharia M, Konwinski A,Joseph A D, et al.Improving MapReduce performance in heterogeneous environments[C]∥Proc of the 8th USENIX Conference on Operating Systems Design and Implementation,2008:29-42.
|
[13] |
Wang K,Khan M.Performance prediction for Apache Spark platform[C]∥Proc of the 2015 IEEE 17th International Conference on High Performance Computing and Communications,2015 IEEE 7th International Symposium on Cyberspace Satety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems,2015:166-173.
|
[14] |
Petridis P,Gounaris A,Torres J.Spark parameter tuning via trial-and-error[C]∥Proc of INNS Conference on Big Data,2016:226-237.
|
[15] |
Kwon Y,Balazinska M,Howe B,et al.Skew-resistant parallel processing of feature-extracting scientific user-defined functions[C]∥Proc of the 1st ACM Symposium on Cloud Computing,2010:75-86.
|
[16] |
Kwon Y,Balazinska M,Howe B,et al.SkewTune:Mitigat- ing skew in MapReduce applications [C]∥Proc of the 2012 ACM SIGMOD International Conference on Management of Data,2012:25-36.
|
[17] |
Ramakrishnan S R,Swart G,Urmanov A, et al.Balancing Reducer skew in MapReduce workloads using progressive sampling [C]∥Proc of the 3rd ACM Symposium on Cloud Computing,2012:1-14.
|
[18] |
Gufler B,Augsten N,Reiser A,et al.Handing data skew in MapReduce [C]∥Proc of the 1st International Conference on Cloud Computing and Services Science,2011:574-583.
|
[19] |
Gufler B, Augsten N, Reiser A, et al.Load balancing in MapReduce based on scalable cardinality estimates [C]∥Proc of the 28th IEEE International Conference on Data Engineering,2012:522-533.
|
[20] |
Kolb L,Thor A,Rahm E.Load balancing for MapReduce-based entity resolution [C]∥Proc of the 28th IEEE International Conference on Big Data Engineering,2012:618-629.
|
[21] |
Kolb L,Thor A,Rahm E,et al.Block-based load balancing for entity resolution with MapReduce [C]∥Proc of the 20th ACM International Conference on Information and Knowledge Management,2011:2397-2400.
|
[22] |
Racha S C.Load balancing Map-Reduce communications for efficient executions of applications in a cloud [D].Bangalore,India:Indian Institute of Science,2012:12-16.
|
[23] |
Ibrahim S,Jin H,Lu L,et al.Handling partitioning skew in MapReduce using LEEN [J].Peer-to-Peer Networking and Applications,2013,6(4):409-424.
|
[24] |
Xu L N, Butt A R,Lim S-H,et al.A heterogeneity-aware task scheduler for Spark [C]∥Proc of the 2018 IEEE International Conference on Cluster Computing,2018:1-12.
|
[25] |
Pan F F, Xiong J,Shen Y J,et al.H-Scheduler:Storage-aware task scheduling for heterogeneous-storage Spark clusters [C]∥Proc of the 2018 IEEE 24th International Confe- rence on Parallel and Distributed Systems,2018:9-17.
|
[26] |
Klimovic A,Litz H,Kozyrakis C.Selecta:Heterogeneous cloud storage configuration for data analytics [C]∥Proc of the 2018 USENIX Conference on Annual Technical Confe- rence,2018:759-773.
|
[27] |
Wang L, Zhan J,Luo C,et al.Bigdatabench:A big data benchmark suite from internet services[C]∥Proc of 2014 IEEE 20th International Symposium on High Performance Computer Architecture,2014:488-499.
|
|
附中文参考文献:
|
[2] |
罗乐,刘轶,钱德沛.内存计算技术研究综述 [J].软件学报,2016,27(8):2147-2167.
|