Convolutional neural network inference and training vectorization method for multicore vector accelerators

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (04): 580-589.

• High Performance Computing • Previous Articles Next Articles

Convolutional neural network inference and training vectorization method for multicore vector accelerators

CHEN Jie,LI Cheng，LIU Zhong

(College of Computer Science and Technology,National University of Defense Technology,Changsha 410073,China)

Received:2023-01-04 Revised:2023-05-08 Accepted:2024-04-25 Online:2024-04-25 Published:2024-04-17

Abstract

Abstract: With the widespread application of deep learning, represented by convolutional neural networks (CNNs), the computational requirements of neural network models have increased rapidly, driving the development of deep learning accelerators. The research focus has shifted to how to accelerate and optimize the performance of neural network models based on the architectural characteristics of accelerators. For the VGG network model inference and training algorithms on the independently designed multi core vector accelerator FT-M7004, vectorized mapping methods for core operators such as convolution, pooling, and fully connected layers are proposed. Optimization strategies, including SIMD vectorization, DMA double-buffered transfer, and weight sharing, are employed to fully exploit the architectural advantages of the vector accelerator, achieving high computational efficiency. Experimental results indicate that on the FT-M7004 platform, the average computational efficiency for convolution layer inference and training is 86.62% and 69.63%, respectively; for fully connected layer inference and training, the average computational efficiency reaches 93.17% and 81.98%, respectively. The inference computational efficiency of the VGG network model on FT-M7004 exceeds that on the GPU platform by over 20%.

Key words: multicore vector accelerator, convolutional neural network, inference algorithm, training algorithm

CHEN Jie, LI Cheng, LIU Zhong. Convolutional neural network inference and training vectorization method for multicore vector accelerators[J]. Computer Engineering & Science, 2024, 46(04): 580-589.

[1]	XU Xin, LI Ruo-shi, YUAN Ye, LIU Na. Semantic segmentation of foggy driving scenes based on learnable image filter [J]. Computer Engineering & Science, 2024, 46(11): 2027-2034.
[2]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[3]	PAN Yu-qing, YU Hao, LI Feng. An abnormal sound detection method based on weighted non-negative matrix decomposition [J]. Computer Engineering & Science, 2024, 46(08): 1425-1432.
[4]	TIAN Hong-peng, WU Jing-wei. RIB-NER:A span-based Chinese named entity recognition model [J]. Computer Engineering & Science, 2024, 46(07): 1311-1320.
[5]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(05): 826-835.
[6]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(05): 945-950.
[7]	CAO Hao-dong, WANG Hai-tao, HE Jian-fen. Date-aware sequential recommendation fusing local information of sequences [J]. Computer Engineering & Science, 2024, 46(04): 734-742.
[8]	QIN Wen-qiang, WU Zhong-cheng, ZHANG Jun, LI Fang, . Design of convolutional neural network acceleration system based on heterogeneous platform [J]. Computer Engineering & Science, 2024, 46(01): 12-20.
[9]	ZHOU Li, ZHAO Zhi-qiao, PAN Guo-teng, TIE Jun-bo, ZHAO Wang. RISC-V based design of graph convolutional neural network accelerator [J]. Computer Engineering & Science, 2023, 45(12): 2113-2120.
[10]	YU Zi-cheng, LING Jie. A DGA domain name detection method based on Transformer and multi-feature fusion [J]. Computer Engineering & Science, 2023, 45(08): 1416-1423.
[11]	LIU Jun-qi, TU Wen-xuan, ZHU En. Survey on graph convolutional neural network [J]. Computer Engineering & Science, 2023, 45(08): 1472-1481.
[12]	YI Xiao, MA Sheng, XIAO Nong. Running optimization of deep learning accelerators under different pruning strategies [J]. Computer Engineering & Science, 2023, 45(07): 1141-1148.
[13]	CUI Ke-bin, CUI Ye-wei. A circuit breaker moving contact tracking methods based on convolution and Transformer [J]. Computer Engineering & Science, 2023, 45(07): 1236-1244.
[14]	Peride Abdurehim, Turdi Tohti, Askar Hamdulla, . An entity relation extraction method based on deep learning [J]. Computer Engineering & Science, 2023, 45(05): 895-902.
[15]	DONG Peng-shan, ZHANG Jing, JIN Ri-ze. Sentiment analysis of Chinese product reviews based on dual-channel gated composite network [J]. Computer Engineering & Science, 2023, 45(05): 911-919.

Convolutional neural network inference and training vectorization method for multicore vector accelerators

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments