• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2020, Vol. 42 ›› Issue (10高性能专刊): 1720-1729.

• 高性能计算机体系结构 • 上一篇    下一篇

面向天河互连网络的可扩展通信框架实现技术

谢旻,张伟,周恩强,董勇   

  1. (国防科技大学计算机学院,湖南 长沙 410073)

  • 收稿日期:2020-06-10 修回日期:2020-07-12 接受日期:2020-10-25 出版日期:2020-10-25 发布日期:2020-10-23
  • 基金资助:
    国家重点研发计划(2018YFB0204301)

Implementation of scalable communication framework on TH-express interconnection

XIE Min,ZHANG Wei,ZHOU En-qiang,DONG Yong   

  1. (School of Computer,National University of Defense Technology,Changsha 410073,China)

  • Received:2020-06-10 Revised:2020-07-12 Accepted:2020-10-25 Online:2020-10-25 Published:2020-10-23

摘要: 开源通信框架在编程模型和互连接口之间定义标准化的通信编程接口,提供了独立于互连网络特性的高性能通信操作,提高了编程模型在新型互连网络上的开发效率。通过设计与实现多通道数据传输协议,解决了通信框架在天河互连网络上实现时的性能和扩展性问题。测试数据表明,天河互连网络上的通信框架具有很低的软件层开销,提供了接近于互连硬件设计指标的通信性能,为拓展天河互连网络对各种编程模型和分布式计算框架的高效支持提供了良好的基础。


关键词: 高速互连网络, 通信框架, 消息传递接口, 远程直接内存访问

Abstract: Open source communication framework defines standard communication APIs between the parallel programming model and the interconnection network, which provides high performance communication operations independent from the characteristics of interconnection network. Its purpose is to improve the efficiency of developing programming models on new interconnection networks. The performance and scalability of communication frameworks on TH-express interconnection are solved through the design and implementation of new multi-channels data transfer protocols. Performance test shows that open source communication frameworks have low software overhead and provide high performance data transfer very close to the design performance of TH-express interconnection. This provides a good foundation for supporting parallel programming models and distributed computing frameworks efficiently on TH-express interconnection.


Key words: high-speed interconnection network, communication framework, message passing interface (MPI), remote direct memory access (RDMA)