• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2021, Vol. 43 ›› Issue (02): 228-234.

• 高性能计算 • 上一篇    下一篇

多GPU系统虚实地址转换架构研究

魏金晖,李晨,鲁建壮   

  1. (国防科技大学计算机学院,湖南 长沙 410073)
  • 收稿日期:2020-06-12 修回日期:2020-08-24 接受日期:2021-02-25 出版日期:2021-02-25 发布日期:2021-02-23

Research on virtual-physical address translation architectures of multi-GPU system

WEI Jin-hui,LI Chen,LU Jian-zhuang   

  1. (College of Computer Science and Technology,National University of Defense Technology,Changsha 410073,China)
  • Received:2020-06-12 Revised:2020-08-24 Accepted:2021-02-25 Online:2021-02-25 Published:2021-02-23

摘要: 近年来,随着大数据的发展,GPU应用的数据集规模急剧增加,这对GPU的处理能力提出了挑战。由于摩尔定律即将达到极限,提升单一GPU的性能变得越发困难,而多GPU系统通过提升GPU处理器级的并行性,成为应对该挑战的一种解决方案。GPU制造商对内存虚拟化的支持进一步简化了多GPU系统的编程,提升了资源利用率。内存虚拟化需要地址转换的支持,而地址转换的开销对系统性能具有重要影响。研究了多GPU系统中2种常见的地址转换架构,即分布式地址转换架构和集中式地址转换架构,通过模拟实验对2种架构进行了深度分析和比较,在此基础上提出了优化地址转换设计的建议。

关键词: 多GPU系统, 内存虚拟化, 地址转换架构

Abstract: In recent years, with the development of big data, the dataset size of GPU applications has increased significantly, which raises challenges for current GPUs. However, as Moore's Law reaches its limit, it is not easy to improve the performance of single GPU any further; Instead, multi-GPU systems have been shown to be an effective solution due to its GPU processor-level parallelism. The support for memory virtualization in multi-GPU systems further simplifies the programming and improves the resource utilization. Memory virtualization requires the support for address translation, and the overhead of address translation has important impact on system’s performance. This paper studies two common address translation architectures in multi-GPU systems, that is, distributed address translation architecture and centralized address translation architecture. Through simulation experiments, this paper ana- lyzes and compares the advantages and drawbacks of two address translation architectures in-depth. On this basis, this paper proposes optimization suggestions for address translation in multi-GPU systems.


Key words: multi-GPU system, memory virtualization, address translation architecture