• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2012, Vol. 34 ›› Issue (8): 147-153.

• 论文 • Previous Articles     Next Articles

GPU Parallel Optimization of the Oceanic General Circulation Model POP

GUO Song,DOU Yong,LEI Yuanwu   

  1. (State Key Laboratory of High Performance Computing,
    National University of Defense Technology,Changsha 410073,China)
  • Received:2012-04-28 Revised:2012-06-11 Online:2012-08-25 Published:2012-08-25

Abstract:

POP is a global ocean circumfluence model,which is used widely to the ocean research and climate prediction.With the resolution increasing,the requirement of computing ability is geometrically increasing,which limits the development of the POP ocean model.Based on the analysis of the equations and numerical characteristics of the POP ocean model,this paper plants the POP ocean model to the GPU platform with the CUDA Fortran programming model,and adopts hybrid parallelism to implement coalesced access to the global memory, reduces the use of local memory,improves the data reusability with registers,and enlarges the code executing on the GPU to minimize the communication between CPU and GPU. Research and experiment show that the CUPOP running on one NVIDIA Tesla C2070 card can achieve up to 8.47 times and 1.5 times respectively,compared with the serial program and sixMPI processes program running on the Intel Xeon X5675 CPU.

Key words: CUDA GPU;parallel ocean program model;GPUPOP;CUDA Fortran