• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

OpenCV parallel optimization on TI 6678 DSP

LI Jin1,LUO Xin-jie2,HU Xiao1,CHEN Yue-yue1   

  1. (1.College of Computer,National University of Defense Technology,Changsha 410073;
    2.School of Electronics Engineering and Computer Science,Peking University,Beijing 100871,China)
  • Received:2017-11-03 Revised:2018-02-10 Online:2018-05-25 Published:2018-05-25

Abstract:

Digital Signal Processing (DSP) is widely used in various industrial fields and military equipment fields. OpenCV is a common open source image processing algorithm library. However, there are few implementations for OpenCV transplantation and optimization on DSP platforms. In this paper, OpenCV is successfully transplanted on a TMS320C6678 DSP platform and generates an underlying support library with most functions preserved. Based on this, we deeply analyze the computational features and data flow of some OpenCV library functions running on this platform. As a result, an optimization method for these OpenCV library functions is proposed. This method combines DMA, Cache operations and OpenMP parallel frameworks, which are supported by TI 6678 architecture. According to this method, we implement the optimization and multi-core parallelism for a class of OpenCV library functions on the TI 6678 chip. With the help of our method, the optimized OpenCV library function running on a single core of TI 6678 chip can be speeded up by up to 3.6 times. On this basis, we parallelize this class of library functions on 8 cores, obtaining the speedup of 2.55 to 7.06.
 

Key words: TMS320C6678, OpenCV, OpenMP, multi-core parallelism