Towards convolutional neural network acceleration
and compression via K-means algotrithm

Abstract

Abstract:

In recent years, the field of machine learning develops rapidly. As a typical representative, neural networks are widely used in various industrial fields, such as speech recognition and image recognition. As the environment of application becomes more complex, the accuracy requirements become higher, and the network scale becomes larger. Large-scale neural networks are both computationintensive and storage-intensive. The convolutional layer is computationintensive and the fully connected layer is storage-intensive. The processing speed of the former cannot keep up with its memory access speed, while the access speed of the later cannot keep up with its processing speed. Based on the confidence interval of the prediction accuracy of neural network training, we propose a neural network acceleration and compression method using the K-means algorithm. We reduce the amount of calculation by compressing the input feature map during the convolution process; and reduce the amount of storage by compressing the weight of the fully connected layer. The proposed method can greatly reduce the calculation amount of a single convolution layer of AlexNet network by up to 100 times. By adding appropriate K-means layer, the speedup of the processing time of the whole network can reach 2.077, and the network compression can reach 8.7%.

Key words: neural network, confidence interval, acceleration, cluster compression

CHEN Guilin,MA Sheng,GUO Yang,LI Yihuang,XU Rui.

Towards convolutional neural network acceleration

and compression via K-means algotrithm

[J]. Computer Engineering & Science.

[1]	JIANG Jing-fei, HE Yuan-hong, XU Jin-wei, XU Shi-yao, QIAN Xi-fu. NM-SpMM:A semi-structured sparse matrix multiplication algorithm for domestic heterogeneous vector processors [J]. Computer Engineering & Science, 2024, 46(07): 1141-1150.
[2]	TIAN Hong-peng, WU Jing-wei. RIB-NER:A span-based Chinese named entity recognition model [J]. Computer Engineering & Science, 2024, 46(07): 1311-1320.
[3]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(05): 826-835.
[4]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(05): 945-950.
[5]	CHEN Jie, LI Cheng, LIU Zhong. Convolutional neural network inference and training vectorization method for multicore vector accelerators [J]. Computer Engineering & Science, 2024, 46(04): 580-589.
[6]	Xie-zhong, CHEN Xu, JING Yong-jun, WANG Shu-yang. Semi-supervised website topic classification based on hetero-geneous graph neural networkWANG [J]. Computer Engineering & Science, 2024, 46(04): 635-646.
[7]	WU Xia, ZHENG Hong-ying, XIAO Di. A dual-verification model watermarking scheme based on certification files [J]. Computer Engineering & Science, 2024, 46(04): 647-656.
[8]	YU Tian-ci, GAO Shang. A code summarization generation model fusing multi-structure data [J]. Computer Engineering & Science, 2024, 46(04): 667-675.
[9]	LI Qing-feng, JIN Liu, MA Hui-fang, ZHANG Ruo-yi. A dual-view contrastive learning-guided multi-behavior recommendation method [J]. Computer Engineering & Science, 2024, 46(04): 707-715.
[10]	CAO Hao-dong, WANG Hai-tao, HE Jian-fen. Date-aware sequential recommendation fusing local information of sequences [J]. Computer Engineering & Science, 2024, 46(04): 734-742.
[11]	MA Xue, HE Xing-xing, LAN Yong-qi, LI Ying-fang. Treelet-based graph neural network for premise selection in first-order logic [J]. Computer Engineering & Science, 2024, 46(02): 374-380.
[12]	SUN Qing-xiao, LIU Yi, YANG Hai-long, WANG Yi-qing, JIA Jie, LUAN Zhong-zhi, QIAN De-pei. GNNSched: A GNN inference task scheduling framework on GPU [J]. Computer Engineering & Science, 2024, 46(01): 1-11.
[13]	QIN Wen-qiang, WU Zhong-cheng, ZHANG Jun, LI Fang, . Design of convolutional neural network acceleration system based on heterogeneous platform [J]. Computer Engineering & Science, 2024, 46(01): 12-20.
[14]	JI Jun-hao, ZHANG Yu-shu, ZHAO Ruo-yu, WEN Wen-ying, DONG Li. Adversarial visible watermark attack based on intelligent evolutionary algorithm [J]. Computer Engineering & Science, 2024, 46(01): 63-71.
[15]	ZHOU Li, ZHAO Zhi-qiao, PAN Guo-teng, TIE Jun-bo, ZHAO Wang. RISC-V based design of graph convolutional neural network accelerator [J]. Computer Engineering & Science, 2023, 45(12): 2113-2120.

Towards convolutional neural network acceleration

and compression via K-means algotrithm

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments