• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2012, Vol. 34 ›› Issue (8): 184-190.

• 论文 • Previous Articles     Next Articles

Design of the RDMA Reliable Communication Protocol Based on Dynamic Connection

LIU Lu,ZHANG Lei,CAO Jijun,DAI Yi   

  1. (School of Computer Science,National University of Defense Technology,Changsha 410073,China)
  • Received:2012-04-28 Revised:2012-06-11 Online:2012-08-25 Published:2012-08-25

Abstract:

Upcoming 100 Petascale/Exascale Supercomputers will demand highly reliable,wellbalanced and highly scalable interconnection networks.Our RDMA transport model implements an endtoend reliable communication protocol by a small quantity of resources configuration and the dynamic connection strategy.Unlike the conventional implementations such as Infiniband,the proposed scheme has superior attributes in terms of a) being able to recover network failures by changing route automatically;b)being able to handle the packets coming out of order and use multiple paths between the source and destination nodes,providing message flow control,all of these measures can reduce the network hot spot and congestion;c)the reliability resources are implemented in hardware, not consuming the memory for connection,so it has good system scalability.The experimental results show that our optimized scheme does not increase the latency of the messages whose size is below 4k bytes.

Key words: reliable communication protocol;RDMA;network interface;Infiniband;dynamic connection