• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (05): 809-813.

• 论文 • 上一篇    下一篇

基于Intel Xeon Phi的激光等离子体粒子模拟研究

姚文科1,杜云飞2,吴强1,杨灿群1   

  1. (1.国防科学技术大学并行与分布处理国家重点实验室,湖南 长沙 410073;
    2.国防科学技术大学计算机学院,湖南 长沙 410073)
  • 收稿日期:2013-08-10 修回日期:2013-11-16 出版日期:2014-05-25 发布日期:2014-05-25
  • 基金资助:

    国家863计划资助项目(2012AA010903);国家自然科学基金资助项目(61170049)

Research of LARED-P on Intel Xeon Phi            

YAO Wenke1,DU Yunfei2,WU Qiang1,YANG Canqun1   

  1. (1.National Key Laboratory for Parallel and Distributed Processing,
    National University of Defense Technology,Changsha 410073;
    2.College of Computer,National University of Defense Technology,Changsha 410073,China)
  • Received:2013-08-10 Revised:2013-11-16 Online:2014-05-25 Published:2014-05-25

摘要:

激光等离子体粒子模拟广泛用于探索极端物质状态下的科学问题。将一种基于粒子云网格方法的三维等离子体粒子模拟程序LAREDP移植到Intel Xeon Phi协处理器上。在移植的过程中,综合运用了Native和Offload两种编程模式:首先运用Native模式对LAREDP程序中热点计算任务进行优化研究,通过采用SIMD扩展指令使该计算任务获得了4.61倍的加速;然后运用Offload模式将程序移植到CPU-Intel Xeon Phi异构系统上,并通过使用异步数据传输和双缓冲技术分别提升了程序性能9.8%和21.8%。

关键词: LAREDP, Intel Xeon Phi, Native模式, Offload模式, 512位SIMD扩展指令, 异步数据传输, 双缓冲

Abstract:

Plasma simulations have been widely used to exploit scientific problems under extremely situations. The paper ports a particleincell based plasma code, LAREDP, to Intel Xeon Phi coprocessor. In order to accomplish this, two modes are employed, i.e. the Native mode and the Offload mode. Firstly, the Native mode is employed to study on the hot computing tasks, which have been accelerated to 4.61 times faster by using SIMD extension instructions. Secondly, the Offload mode is employed to transplant the whole code onto a CPUIntel Xeon Phi heterogeneous system. We also adopt optimizations such as asynchronous data transferring and double buffer technique to improve the performance. And we obtain 9.8% and 21.8% improvement respectively.
    

Key words: LARED-P;Intel Xeon Phi;native mode;offload mode;512bit SIMD extension;asynchronous data transferring;double buffer