• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2020, Vol. 42 ›› Issue (10高性能专刊): 1774-1783.

• 高性能计算机系统软件 • 上一篇    下一篇

百亿亿次级系统进程管理接口综述

张昆,张伟,卢凯,董勇,戴屹钦   

  1. (国防科技大学计算机学院,湖南 长沙 410073)

  • 收稿日期:2020-06-10 修回日期:2020-07-23 接受日期:2020-10-25 出版日期:2020-10-25 发布日期:2020-10-23
  • 基金资助:
    国家重点研发计划(2018YFB0204301);国家数值风洞项目(NNW2018-ZT6B13)

A survey on the process management interface for exascale computing systems

ZHANG Kun,ZHANG Wei,LU Kai,DONG Yong,DAI Yi-qin   

  1. (School of Computer,National University of Defense Technology,Changsha 410073,China)

  • Received:2020-06-10 Revised:2020-07-23 Accepted:2020-10-25 Online:2020-10-25 Published:2020-10-23

摘要: 高性能计算机不断发展,系统规模日益增加,系统内包含的计算结点数、处理器核数扩展到新的水平。在超大规模系统中,并行应用程序的启动时间成为限制系统运行效率、降低系统易用性的一个重要因素。在并行应用启动阶段,利用进程管理接口为进程部署通信通道,供进程后续通信使用。在百亿亿次级规模系统中,传统进程管理接口无法在启动时快速获得通信信息,导致启动时间过长,系统性能下降。首先介绍进程管理接口在并行程序启动过程中的作用,着重介绍面向百亿亿次级系统的进程管理接口PMIx,而后对比论述PMIx对于改进大规模并行程序启动的作用,分析PMIx在提升系统性能上做出的优化,以及未来发展方向。

关键词: 高性能计算, 进程管理接口, 进程通信

Abstract: With the continuous development of high-performance computing, the scale of the system increases constantly, and the number of nodes and processor cores in the system has expanded to a new level. Under the condition of hyperscale systems, the startup time of parallel applications becomes an important factor, which limits the system’s operating efficiency and reduces the ease of use. The process management interface is used to deploy a communication channel for the process during the parallel application startup phase for subsequent communication of the process. In exascale systems, the traditional process management interface cannot quickly obtain communication information at the startup phase, resulting in long startup time and the reduced system performance. We first introduce the role of the process management interface in the parallel program startup process, focus on the process management interface PMIx for exascale systems, compare and discuss the role of PMIx in improving the startup of large-scale parallel programs, analyze the optimization of PMIx in improving system performance, and discuss future development directions.



Key words: high-performance computing, process management interface, process communication