• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (9): 20-26.

• 论文 • Previous Articles     Next Articles

Research of MPI nonblocking broadcast
algorithm and its performance analysis     

 YAN Xinkai,HAO Ziyu,WU Dong,XIE Xianghui   

  1. (State Key Laboratory of Mathematical Engineering and Advanced Computing,Wuxi 214125,China)
  • Received:2013-04-23 Revised:2013-07-20 Online:2013-09-25 Published:2013-09-25

Abstract:

The MPI3.0 standard contains nonblocking collectives. Nonblocking collectives captures the features of both nonblocking communication and collective communication. Compare with blocking collectives, nonblocking collectives have lower synchronization overhead and can realize the overlap between computing and communication in the program so as to improve the performance. The paper describes different algorithm implementations of broadcast, compares the lowlevel management between nonblocking and blocking broadcast, analyzes the experiment results and finally offers some optimizations.

Key words: MPI;nonblocking broadcast, synchronization overhead;overlap