Running optimization of deep learning accelerators under different pruning strategies

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (07): 1141-1148.

• High Performance Computing • Previous Articles Next Articles

Running optimization of deep learning accelerators under different pruning strategies

YI Xiao，MA Sheng,XIAO Nong

（College of Computer Science and Technology，National University of Defense Technology，Changsha 410073，China）

Received:2021-12-08 Revised:2022-02-25 Accepted:2023-07-25 Online:2023-07-25 Published:2023-07-11

Abstract

Abstract: Convolutional neural networks have achieved great success in the field of image analysis. With the development of deep learning，deep learning models are becoming more and more complex，and the amount of deep learning calculations is increasing rapidly. The sparse algorithm can effectively reduce the amount of deep learning calculations without reducing the accuracy. This paper uses three different pruning strategies under the ResNet18 model and GoogleNet model to reduce the calculation amount of the deep learning model. The results show that the global unstructured pruning strategy has a sparsity of 94% and 90% without reducing the accuracy respectively, the level unstructured pruning strategy has an average sparsity of 83% and 56% without basically reducing the accuracy respectively, and the level structured strategy has an average sparsity of 34% and 22% without basically reducing the accuracy respectively. Under the three pruning strategies, the delay and power consumption results obtained by running the sparse deep learning model in the Eyeriss deep learning accelerator shows that, compared with the unpruned strategy, under the ResNet model, the global unstructured pruning strategy has a 66.0% reduction in latency and a 60.7% reduction in power consumption, the level unstructured pruning strategy has a 66.0% reduction in delay and a 80.6% reduction in power consumption, and the level structured pruning strategy has a 65.6% reduction in latency and a 33.5% reduction in power consumption. Under the GoogleNet model, the global unstructured pruning strategy has a 74.5% reduction in latency and a 63.2% reduction in power consumption, the level unstructured pruning strategy has a 73.6% reduction in delay and a 55.0% reduction in power consumption, and the level structured pruning strategy has a 26.8% reduction in latency and a 5.8% reduction in power consumption. Therefore, this paper concludes that the global unstructured pruning strategy can greatly reduce the delay and energy consumption without reducing the accuracy. Under the level unstructured pruning strategy, the delay and energy consumption can be greatly reduced under the premise of slightly reducing the accuracy.

Key words: deep learning accelerator, convolutional neural network, pruning

YI Xiao, MA Sheng, XIAO Nong. Running optimization of deep learning accelerators under different pruning strategies[J]. Computer Engineering & Science, 2023, 45(07): 1141-1148.

[1]	TIAN Hong-peng, WU Jing-wei. RIB-NER:A span-based Chinese named entity recognition model [J]. Computer Engineering & Science, 2024, 46(07): 1311-1320.
[2]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(05): 826-835.
[3]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(05): 945-950.
[4]	CHEN Jie, LI Cheng, LIU Zhong. Convolutional neural network inference and training vectorization method for multicore vector accelerators [J]. Computer Engineering & Science, 2024, 46(04): 580-589.
[5]	CAO Hao-dong, WANG Hai-tao, HE Jian-fen. Date-aware sequential recommendation fusing local information of sequences [J]. Computer Engineering & Science, 2024, 46(04): 734-742.
[6]	QIN Wen-qiang, WU Zhong-cheng, ZHANG Jun, LI Fang, . Design of convolutional neural network acceleration system based on heterogeneous platform [J]. Computer Engineering & Science, 2024, 46(01): 12-20.
[7]	ZHOU Li, ZHAO Zhi-qiao, PAN Guo-teng, TIE Jun-bo, ZHAO Wang. RISC-V based design of graph convolutional neural network accelerator [J]. Computer Engineering & Science, 2023, 45(12): 2113-2120.
[8]	YU Zi-cheng, LING Jie. A DGA domain name detection method based on Transformer and multi-feature fusion [J]. Computer Engineering & Science, 2023, 45(08): 1416-1423.
[9]	LIU Jun-qi, TU Wen-xuan, ZHU En. Survey on graph convolutional neural network [J]. Computer Engineering & Science, 2023, 45(08): 1472-1481.
[10]	CUI Ke-bin, CUI Ye-wei. A circuit breaker moving contact tracking methods based on convolution and Transformer [J]. Computer Engineering & Science, 2023, 45(07): 1236-1244.
[11]	Peride Abdurehim, Turdi Tohti, Askar Hamdulla, . An entity relation extraction method based on deep learning [J]. Computer Engineering & Science, 2023, 45(05): 895-902.
[12]	DONG Peng-shan, ZHANG Jing, JIN Ri-ze. Sentiment analysis of Chinese product reviews based on dual-channel gated composite network [J]. Computer Engineering & Science, 2023, 45(05): 911-919.
[13]	LIU Zi-jian, DING Wei-long, XING Meng-da, LI Han, HUANG Ye. Conv-WGAIN:Convolutional generative adversarial imputation net for multivariate time series missing data [J]. Computer Engineering & Science, 2023, 45(05): 931-939.
[14]	HU Zong-cheng, DUAN Xiao-wei, ZHOU Ya-tong, HE Hao. Research on dynamic gesture recognition based on multimodal fusion [J]. Computer Engineering & Science, 2023, 45(04): 665-673.
[15]	YUAN Ye, HUANG Li-qing, YE Feng, HUANG Tian-qiang, LUO Hai-feng, XU Chao, . A real-time facial manipulation video detection model based on ensemble learning dual-stream neural network [J]. Computer Engineering & Science, 2023, 45(03): 470-477.

Running optimization of deep learning accelerators under different pruning strategies

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments