[1] |
Kamil S, Oliker L,Pinar A,et al.Communication requirements and interconnect optimization for high-end scientific applications[J].IEEE Transactions on Parallel and Distributed Systems,2010,21(2):188-202.
|
[2] |
Lavrijsen W,Iancu C,Pan X.Improving network throughput with global communication reordering[C]∥Proc of 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS),2018:266-275.
|
[3] |
Koop M J,Jones T,Panda D K.Reducing connection memory requirements of MPI for InfiniBand Clusters:A message coalescing approach[C]∥Proc of the 7th IEEE International Symposium on Cluster Computing and the Grid (CCGrid’07),2007:495-504.
|
[4] |
Lavrijsen W,Iancu C.Application level reordering of remote direct memory access operations[C]∥Proc of 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS),2017:988-997.
|
[5] |
Luo M,Panda D K,Ibrahim K Z,et al.Congestion avoidance on manycore high performance computing systems[C]∥
|
|
Proc of ACM International Conference on Supercomputing,2012:121-132.
|
[6] |
Jiang N,Dennison L,Dally W J.Network endpoint congestion control for fine-grained communication[C]∥Proc of the International Conference for High Performance Computing,Networking,Storage and Analysis,2015:1-12.
|
[7] |
Acun B,Gupta A,Jain N,et al.Parallel programming with migratable objects:Charm++ in practice[C]∥Proc of the International Conference for High Performance Computing,Networking,Storage and Analysis,2014:647-658.
|
[8] |
Wesolowski L,Venkataraman R,Gupta A,et al.Tram:Optimizing fine-grained communication with topological routing and aggregation of messages[C]∥Proc of 2014 43rd International Conference on Parallel Processing,2014:211-220.
|
[9] |
Wagle B,Kellar S,Serio A,et al.Methodology for adaptive active message coalescing in task based runtime systems[C]∥
|
|
Proc of 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),2018:1133-1140.
|
[10] |
Morari A,Tumeo A,Chavarría-Miranda D,et al.Scaling irregular applications through data aggregation and software multithreading[C]∥
|
|
Proc of 2014 IEEE 28th International Parallel and Distributed Processing Symposium,2014:1126-1135.
|
[11] |
Mo Ze-yao, Zhang Ai-qing, Liu Qing-kai, et al. Parallel algorithm and parallel programming:From specialty to generality as well as software reuse[J].Scientia Sinica Informations, 2016,46(10):1392-1410.(in Chinese)
|
|
附中文参考文献:
|
|
莫则尧,张爱清,刘青凯,等.并行算法与并行编程:从个性、共性到软件复用[J].中国科学:信息科学,2016,46(10):1392-1410.
|