Design and simulation of a deep learning SoC architecture

Abstract

Abstract:

The explosive growth of information volume in the Internet era and the popularization of deep learning have made traditional generalpurpose computing unable to meet largescale, highconcurrency computing requirements. Heterogeneous computing can release greater computing power for deep learning, satisfy higher performance requirements, and be applied to a wider range of computing scenarios. We design and simulate a complete heterogeneous SoC architecture for deep learning. Firstly, we analyze the computational features of commonly used deep learning algorithms such as GoogleNet, VGG and SSD, and summarize them into a limited number of deep learning common operator classes which are displayed in charts and structure diagrams. At the same time, the pseudo instruction stream at the minimum operator level is generated. Then, based on extracted algorithm features, a hardwareaccelerated AI IP core for deep learning is designed, and a heterogeneous computing SoC architecture is constructed. Finally, experimental verification on the simulation modeling platform shows that the performance to power ratio of the SoC system is greater than 1.5 TOPS/W. The 10channel 1080p 30fps video can be processed frame by frame by the GoogleNet algorithm, and the end-to-end processing time of each frame is no more than 30ms.

Key words: heterogeneous computing, deep learning, acceleration unit, simulation modeling

CUI Haoran，LI Han，FENG Yujing，WU Meng，WANG Chao，TAO Guanliang，ZHANG Zhimin.

Design and simulation of a deep learning SoC architecture

[J]. Computer Engineering & Science.

[1]	DING Jian-ping, LI Wei-jun, LIU Xue-yang, CHEN Xu. A review of named entity recognition research [J]. Computer Engineering & Science, 2024, 46(07): 1296-1310.
[2]	HU Zhao-hua, WANG Chang-fu, . A small object detection algorithm of remote sensing image based on improved Faster R-CNN [J]. Computer Engineering & Science, 2024, 46(06): 1063-1071.
[3]	TAN Yu-song, WANG Wei, JIAN Song-lei, YI Chao-xiong. Weakly-supervised IDS with abnormal-preserving transformation learning [J]. Computer Engineering & Science, 2024, 46(05): 801-809.
[4]	GUO Chen-liang, YAN Shao-hong, ZONG Chen-qi. Research on parallel acceleration of line cloud privacy attack algorithm [J]. Computer Engineering & Science, 2024, 46(04): 615-625.
[5]	GAO Shan, LI Shi-jie, CAI Zhi-ping. A survey of Chinese text classification based on deep learning [J]. Computer Engineering & Science, 2024, 46(04): 684-692.
[6]	LUO Yue-tong, LI Chao, ZHOU Bo, ZHANG Yan-kong. An interactive separation method for confusable defects in industrial defect classification [J]. Computer Engineering & Science, 2024, 46(03): 463-470.
[7]	Lv Fu, HAN Xiao-tian, FENG Yong-an, XIANG Liang. A texture image classification method based on adaptive texture feature fusion [J]. Computer Engineering & Science, 2024, 46(03): 488-498.
[8]	JI Xu-rui, WEI De-jian, ZHANG Jun-zhong, ZHANG Shuai, CAO Hui. Research progress on information extraction methods of Chinese electronic medical records [J]. Computer Engineering & Science, 2024, 46(02): 325-337.
[9]	HUANG Ze-biao, DONG De-zun, QI Xing-yun. Gloo+: Accelerating distributed training of deep learning using in-network computing [J]. Computer Engineering & Science, 2024, 46(01): 28-36.
[10]	QIU Xiao-meng, WANG Lin, GU Wen-jun, SONG Wei, TIAN Hao-lai, HU Yu. A time series image semantic segmentation model modified by optical flow [J]. Computer Engineering & Science, 2024, 46(01): 102-110.
[11]	CUI Hao, WAN Ya-ping, ZHONG Hua, NIE Ming-xing, XIAO Yang. Human activity recognition based on LoRa devices [J]. Computer Engineering & Science, 2024, 46(01): 111-121.
[12]	ZHANG Qian, CHEN Zi-qiang, SUN Zong-wei, LAI Jing-an. A fog target detection algorithm fusing high-resolution network [J]. Computer Engineering & Science, 2023, 45(11): 1970-1981.
[13]	LIU Yu-mo, LIU Jian-fei, HAO Lu-guo, ZENG Wen-bin. A multi-scale feature fusion network based fast CU partitioning in HEVC intra coding [J]. Computer Engineering & Science, 2023, 45(11): 1991-1998.
[14]	LI Zhuo-xuan, ZHOU Ya-tong. iSFF-DBNet:An improved text detection algorithm in e-commerce images [J]. Computer Engineering & Science, 2023, 45(11): 2008-2017.
[15]	WU Chao, WEI Qian, ZHOU Jun-wei, LI Hui-min, SUN Guang-zhong. A parallel ambient noise data preprocessing algorithm based on heterogenous computing platform [J]. Computer Engineering & Science, 2023, 45(10): 1711-1719.

Design and simulation of a deep learning SoC architecture

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments