• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2021, Vol. 43 ›› Issue (04): 681-688.

• 软件工程 • 上一篇    下一篇

ARM计算环境下堆芯程序的移植

明平洲1,李治刚1,刘婷1,芦韡2,刘东1,曾辉1,余红星2   

  1. (1.中国核动力研究设计院,四川 成都 610213;

    2.中国核动力研究设计院核反应堆系统设计技术重点实验室,四川 成都 610213)

  • 收稿日期:2020-04-28 修回日期:2020-06-09 接受日期:2021-04-25 出版日期:2021-04-25 发布日期:2021-04-21
  • 基金资助:
    中国核动力研究设计院探索基金

Porting of reactor core programs in ARM environment

MING Ping-zhou1,LI Zhi-gang1,LIU Ting1,LU Wei2,LIU Dong1,ZENG Hui1,YU Hong-xing2#br# #br# #br#   

  1. (1.Nuclear Power Institute of China,Chengdu 610213;

    2.Science and Technology on Reactor System Design Technology Laboratory,

    Nuclear Power Institute of China,Chengdu 610213,China)
  • Received:2020-04-28 Revised:2020-06-09 Accepted:2021-04-25 Online:2021-04-25 Published:2021-04-21

摘要: 为了论证国产芯片在堆芯数值计算领域的可行性,对多个堆芯程序在飞腾处理器的ARM通用计算环境中进行了移植,涉及堆芯燃料管理软件的扩散原型程序NACK-R、子通道分析程序CORTH、特征线输运程序OpenMOC和堆芯组件程序KYLIN2。移植过程在ARM计算环境中通过合理的程序代码修订,去除对商业函数库的依赖,且在移植过程中对KYLIN2的特征线循环扫描计算过程引入OpenMP多线程并行,论证单结点多个飞腾处理器核心的并行能力。参照对象Intel商用处理器的频率约为飞腾处理器频率的2倍,堆芯程序移植后的串行运行效率与在Intel计算环境中的串行运行效率差异保持在3~4倍,受限于所使用飞腾处理器型号的缓存大小,部分数据量较大例题的性能差异可能更大。KYLIN2完成多线程并行后计算效率接近在Intel处理器上的串行效率,证明单结点多个飞腾处理器核心能够替换部分堆芯数值计算既有的应用场景。移植结果也表明,混合不同处理器的异构设计,能够在计算资源紧张的情况下充分利用国产硬件,提升计算环境的整体利用效率。

关键词: 堆芯计算, 节块法, 子通道, 特征线方法, ARM64, 程序移植, OpenMP

Abstract: In order to demonstrate the feasibility of domestic chips in the field of reactor core calculation, some reactor core programs are ported in the ARM computing environment of Phytium processor, involving the diffusion prototype program NACK-R of the core fuel management software, and sub-channel analysis program CORTH, characteristic line transport program OpenMOC, and core assembly program KYLIN2. Through reasonable program code revision, the dependence on commercial function libraries are removed, and the OpenMP parallelism is introduced in the trace process of MOC in the ARM environment so that the parallel ability of multiple Phytium processor cores in one cluster node can be investigated. The frequency of the reference Intel commercial processor is about twice that of the Phytium processor, and the difference between the serial running efficiency of the ported programs are kept at 3~4 times. Due to the cache size of the Phytium processor, the performance difference of some large input data cases may be greater. After OpenMP parallelism, the running efficiency of KYLIN2 is close to the serial efficiency of Intel computing environment, which proves that the single node with multiple Phytium processors can replace some schemes of nuclear engineering calculation. The program porting results also show that the hybrid cluster system with different types of processors can fully utilize domestic hardware in the case of computing resource crisis and improve the overall utilization.





Key words: reactor core calculation, nodal method, sub-channel method, characteristic line transport method, ARM64, program porting, OpenMP