• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2020, Vol. 42 ›› Issue (11): 1981-1987.

Previous Articles     Next Articles

Reduction operation offloading optimization based on Tianhe interconnect MPI collective

WANG Hao,ZHANG Wei,XIE Min,DONG Yong   

  1. (School of Computer,National University of Defense Technology,Changsha 410073,China)
  • Received:2020-06-12 Revised:2020-07-15 Accepted:2020-11-25 Online:2020-11-25 Published:2020-11-30

Abstract: MPI collective communication operation is widely used in parallel scientific application, which has an important influence impact on the scalability of the program. Tianhe interconnect network supports the trigger communication operations, 
which can offload the messaging and processing work and improve the performance between nodes. Allreduce and Reduce 
algorithms under different tree topological structures are designed by using the triggered operations to lower the latency the reduction operation communication between nodes. Tests based on the actual system platform show that that, compared with the pointtopoint implementation of these two types of operations in MPICH, the offload algorithm based on trigger can reduce the running time by up to 59.6% at different node scales.

Key words: collective communication, reduction, triggered operations, offloaded communication