Accelerating 3D GVF field computation on #br#
Xeon Phi using stencil optimization

Computer Engineering & Science ›› 2014, Vol. 36 ›› Issue (08): 1435-1440.

Previous Articles Next Articles

Accelerating 3D GVF field computation on #br# Xeon Phi using stencil optimization

QI Jin1,LI Kuan2,YANG Canqun1,DU Yunfei2

(1.National Laboratory of Parallel and Distributed Processing,National University of Defense Technology,Changsha 410073;(2.College of Computer Science,National University of Defense Technology,Changsha 410073,China)

Received:2013-08-12 Revised:2013-11-11 Online:2014-08-25 Published:2014-08-25

Abstract

Abstract:

3D Gradient Vector Flow (GVF) field has wide applications in many image processing algorithms. The computation of GVF field typically needs several iterations and is rather time consuming. Therefore, it is important and meaningful to improve the computation speed of 3D GVF field. The data level parallelism and thread level parallelism are introduced to accelerate the GVF field computation procedure on Intel Xeon Phi many core integrated platform for the first time. Meanwhile, GVF field computation is a kind of stencil computation, whose computationmemory access ratio is low. A novel cache blocking strategy is proposed to fully utilize the L2 cache of Xeon Phi architecture，and to improve the computation speed of GVF field. The experimental results show that the proposed optimizations could effectively improve the speed of GVF filed computation. Especially, for a 5123 3D image, compared with the performance obtained by a 2.6G Hz 8 core 16threads Intel Xeon E52670 CPU, the speedup achieved on Xeon Phi is 2.77X.

Key words: 3D GVF field, Xeon Phi, stencil optimization, cache blocking

QI Jin, LI Kuan, YANG Canqun, DU Yunfei. Accelerating 3D GVF field computation on #br# Xeon Phi using stencil optimization [J]. Computer Engineering & Science, 2014, 36(08): 1435-1440.

[1]	Lin1,2,WANG Yi chao1,QIN Qiang1,LI Shuo3,WEN Min hua1,Satoshi Matsuoka2. Modeling and evaluating Intel IMCI vgather instruction using stencilsJames [J]. Computer Engineering & Science, 2016, 38(09): 1741-1747.
[2]	YANG Xiangsen1,JIN Jun2,WANG Peng1,MA Zhaogui1,KANG Yonggan1. Wave equation prestack depth migration based on Xeon Phi platform [J]. J4, 2015, 37(05): 907-913.
[3]	XIONG Min，WANG Yongxian. Parallel optimization of the seismic wave PKTM algorithm on CPU+MIC heterogeneous platform [J]. J4, 2015, 37(01): 14-22.
[4]	YAO Wenke1，DU Yunfei2,WU Qiang1,YANG Canqun1. Research of LARED-P on Intel Xeon Phi [J]. J4, 2014, 36(05): 809-813.

Accelerating 3D GVF field computation on #br# Xeon Phi using stencil optimization

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 4

Recommended Articles 0

Metrics

Comments