• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Accelerating CNN on mobile GPU

WANG Xiang-xin1,SHI Yang2,WEN Mei2   

  1. (1.Information Center of Armed Police Fire Center,Changsha 410205;
    2.College of Computer,National University of Defense Technology,Changsha 410073,China)
     
  • Received:2016-11-08 Revised:2017-02-15 Online:2018-01-25 Published:2018-01-25

Abstract:

Convolutional Neural Networks (CNNs) are playing an increasingly important role in areas such as image classification and speech recognition because of their excellent performance. Some researchers have already wanted to apply this deep learning process on mobile phones, but the performance of the porting program is unsatisfactory due to the huge amount of computation of CNN. In order to explore how to solve this problem, this paper uses a deep learning framework named MXNet to realize the forward process of CNN on mobile phones and focuses on the use of GPU that is another powerful computing device on the mobile phone. Based on the OpenCL common programming framework, we use matrix multiplication to compute the most time-consuming convolution in the forward process and move it to the GPU. Besides, serval improvements are made to achieve better performance. Finally, the experimental results show that we succeed in reducing the time of the forward process to half of the original time.
 

Key words: CNN, mobile phone, mobile GPU, fast algorithm, OpenCL