• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2014, Vol. 36 ›› Issue (08): 1435-1440.

Previous Articles     Next Articles

Accelerating 3D GVF field computation on #br# Xeon Phi using stencil optimization         

QI Jin1,LI Kuan2,YANG Canqun1,DU Yunfei2   

  1. (1.National Laboratory of Parallel and Distributed Processing,National University of Defense Technology,Changsha 410073;(2.College of Computer Science,National University of Defense Technology,Changsha 410073,China)
  • Received:2013-08-12 Revised:2013-11-11 Online:2014-08-25 Published:2014-08-25

Abstract:

3D Gradient Vector Flow (GVF) field has wide applications in many image processing algorithms. The computation of GVF field typically needs several iterations and is rather time consuming. Therefore, it is important and meaningful to improve the computation speed of 3D GVF field. The data level parallelism and thread level parallelism are introduced to accelerate the GVF field computation procedure on Intel Xeon Phi many core integrated platform for the first time. Meanwhile, GVF field computation is a kind of stencil computation, whose computationmemory access ratio is low. A novel cache blocking strategy is proposed to fully utilize the L2 cache of Xeon Phi architecture,and to improve the computation speed of GVF field. The experimental results show that the proposed optimizations could effectively improve the speed of GVF filed computation. Especially, for a 5123 3D image, compared with the performance obtained by a 2.6G Hz 8 core 16threads Intel Xeon E52670 CPU, the speedup achieved on Xeon Phi is 2.77X.

Key words: 3D GVF field, Xeon Phi, stencil optimization, cache blocking