• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2022, Vol. 44 ›› Issue (09): 1542-1549.

• 高性能计算 • 上一篇    下一篇

一种面向大规模并发的Gatherv优化方法

孙浩男1,王飞2,魏迪2,尹万旺1,史俊达1   

  1. (1.国家并行计算机工程技术研究中心,北京 100080;2.清华大学计算机科学与技术系,北京 100084)
  • 收稿日期:2022-01-15 修回日期:2022-05-18 接受日期:2022-09-25 出版日期:2022-09-25 发布日期:2022-09-25
  • 基金资助:
    国家重点研发计划(2020YFB0204602)

A Gatherv optimization method for large scale concurrency

SUN Hao-nan1,WANG Fei2,WEI Di2,YIN Wan-wang1,SHI Jun-da1   

  1. (1.National Research Center of Parallel Computer Engineering & Technology,Beijing 100080;
    2.Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China)
  • Received:2022-01-15 Revised:2022-05-18 Accepted:2022-09-25 Online:2022-09-25 Published:2022-09-25

摘要: MPI不规则集合通信Gatherv为描述并行通信行为提供了极大的灵活性,但其不规则特性带来了较高的实现难度。现有方法存在通信热点突出、内存开销大和访存效率低等问题,难以满足当今大规模并行应用的性能需求。提出一种面向大规模并发的Gatherv优化方法,从优化等级、缓冲区管理等多个关键问题入手,将规则集合通信实现中常用的Binomial-Tree结构用于实现Gatherv,并提出消息链调度机制,进一步降低开销,提升优化效果。测试结果表明,该方法可以有效解决现有方法存在的性能问题,实现Gatherv集合通信性能在大规模并发条件下的高效可扩展。

关键词: MPI, 不规则集合通信, Gatherv, Binomial-Tree, 消息链调度

Abstract: As an irregular MPI (Message Passing Interface) collective communication, Gatherv provides great flexibility for the description of parallel communication behavior, but its irregularity brings high implementation difficulties. Existing methods have some problems, such as outstanding communication hotspots, high memory overhead, low memory access efficiency, etc., which are difficult to satisfy the performance requirements of todays large-scale parallel applications. A Gatherv optimization method for large scale concurrency is proposed. Starting from the optimization level, buffer management and other key issues, the binomial tree model commonly used in the implementation of regular collective communication is applied to the implementation of Gatherv. Besides, a message chain scheduling is proposed to further reduce the overhead and improve the optimization effect. Test data shows that the proposed method can effectively solve the performance problems of the existing methods, and achieve efficient scalability of Gatherv performance under the condition of large-scale concurrency.


Key words: message passing interface (MPI), irregular collectives, Gatherv, Binomial-Tree, message chain scheduling