• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (02): 209-216.

• High Performance Computing • Previous Articles     Next Articles

Analysis and evaluation of congestion control in interconnection networks for high performance computing

SUN Yan,ZHANG Jian-min,LI Yuan,SUN Shun-yu#br#   

  1. (College of Computer Science and Technology,National University of Defense Technology,Changsha 410073,China)
  • Received:2023-09-06 Revised:2023-10-27 Accepted:2024-02-25 Online:2024-02-25 Published:2024-02-24

Abstract: With the development of high performance computing technology, the number of network nodes in high performance computing systems is continuously growing, and the requirements of high performance computing applications for network performance are becoming increasingly stringent. Therefore, congestion control for high performance interconnection networks faces great pressure and challenges. To address the characteristics of high performance computing interconnection networks, researching efficient and low-overhead congestion control methods is crucial to ensuring the performance and stability of high performance interconnection networks. This study focuses on the core issues of interconnection communication in high performance computing systems. It analyzes and compares the mainstream congestion control methods. Based on the structural characteristics and communication properties of high performance computing systems, it designs a data flow model and a flow file generation tool for large-scale simulation, and proposes a comprehensive evaluation index for congestion control. Using the proposed data flow model, different congestion control methods are simulated on a large-scale network, and their performance is analyzed and evaluated based on the proposed evaluation index. The analysis and evaluation techniques proposed in this study can provide more objective and accurate analysis and evaluation of congestion control methods for high performance interconnection networks.

Key words: high performance computing, congestion control, traffic control, RDMA network