• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (11): 1909-1917.

• High Performance Computing • Previous Articles     Next Articles

Optimization of dot product algorithms on FT-M7002

GUO Pan-pan1,2,CHEN Meng-xue3,LIANG Zu-da1,2,MA Xiao-chang3,XU Bang-jian4   

  1. (1.School of Computer and Artificial Intelligence,Zhengzhou University,Zhengzhou 450066;
    2.National Supercomputing Center in Zhengzhou(Zhengzhou University),Zhengzhou 450001;
    3.School of Electrical and Information Engineering,Hunan University,Changsha 410082;
    4.School of Information Science and Engineering,Hunan University,Changsha  410082,China)
  • Received:2022-02-07 Revised:2022-04-01 Accepted:2022-11-25 Online:2022-11-25 Published:2022-11-25

Abstract: On the high-performance DSP of domestic FT-M7002 platform, different types of dot product algorithms are optimized and implemented. The technical chain of the mathematical library of the processor platform is consummated. Taking full advantage of FT-M7002 kernel architecture, SIMD vector parallelization, DMA dual channel transmission, SVR transmission and other optimization methods for dot product algorithm are realized. The research fully excavates the vector parallelism of the program, effectively improving the speed of data transmission and improving the performance of the program. The experimental results show that the average performance ratio of different types of dot product algorithms after and before optimization on FT platform is 12.416 6~45.233 8. Compared with the performance of different types of dot product functions in dsplib library on TI official website on TMS320C6678 processor, the average performance ratio between FT platform and TI platform is 1.371 6 ~ 4.519 6. The research results show that the DSP platform has obvious computational performance advantages over TI mainstream platform. 

Key words: FT-M7002, digital signal processor(DSP), dot product algorithm, vector, DMA dual channel transmission, SVR transmission