基于RISC-V的图卷积神经网络加速器设计

计算机工程与科学 ›› 2023, Vol. 45 ›› Issue (12): 2113-2120.

基于RISC-V的图卷积神经网络加速器设计

周理，赵祉乔，潘国腾，铁俊波，赵王

（国防科技大学计算机学院，湖南长沙 410073）

收稿日期:2022-11-18 修回日期:2023-05-06 接受日期:2023-12-25 出版日期:2023-12-25 发布日期:2023-12-14

RISC-V based design of graph convolutional neural network accelerator

ZHOU Li,ZHAO Zhi-qiao,PAN Guo-teng,TIE Jun-bo,ZHAO Wang

(College of Computer Science and Technology，National University of Defense Technology,Changsha 410073,China)

Received:2022-11-18 Revised:2023-05-06 Accepted:2023-12-25 Online:2023-12-25 Published:2023-12-14

摘要/Abstract

摘要： 图卷积神经网络GCN当前主要在PyTorch等深度学习框架上基于GPU实现加速。然而GCN的运算过程包含多层嵌套的矩阵乘法和数据访存操作，使用GPU虽然可以满足实时性需求，但是部署代价大、能效比低。为了提高GCN算法的计算性能并保持软件灵活性，提出一种基于RSIC-V SoC的定制GCN加速器，在蜂鸟E203的SoC平台中通过点积运算扩展指令和硬件加速器软硬件协同的方法实现了针对GCN的加速，通过神经网络参数分析确定了从浮点数到32位定点数的硬件量化方案。实验结果表明，在Cora数据集上运行GCN算法时，该加速器没有精度损失，速度最高提高了6.88倍。

关键词: RISC-V, 图卷积神经网络, 硬件加速器, 指令集

Abstract: Graph Convolutional Networks (GCN), an algorithm for processing non-Euclidean data, is currently mainly implemented on deep learning frameworks such as PyTorch for GPU acceleration. GCN's computation process involves nested matrix multiplication and data access operations, which can be satisfied by GPU in real-time but have high deployment costs and low energy efficiency. To improve the computational performance of GCN algorithm while maintaining software flexibility, this paper proposes a custom GCN accelerator based on RSIC-V SoC, which extends the dot product operation and hardware accelerator through hardware-software co-design in the hummingbird E203 SoC platform. The neural network parameter analysis determines the hardware quantization scheme from floating point to 32-bit fixed point. Experimental results show that the proposed accelerator has no accuracy loss and can achieve a maximum speedup of 6.88 times when running GCN algorithm on Cora dataset.

Key words: RISC-V, graph convolutional neural network, hardware accelerator, instruction set

周理, 赵祉乔, 潘国腾, 铁俊波, 赵王. 基于RISC-V的图卷积神经网络加速器设计[J]. 计算机工程与科学, 2023, 45(12): 2113-2120.

ZHOU Li, ZHAO Zhi-qiao, PAN Guo-teng, TIE Jun-bo, ZHAO Wang. RISC-V based design of graph convolutional neural network accelerator[J]. Computer Engineering & Science, 2023, 45(12): 2113-2120.

编辑推荐

Metrics

阅读次数

全文

620

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	620

来源	本网站	其他网站

次数	511	109
比例	82%	18%

摘要

349

最新录用	在线预览	正式出版

0	0	349

	来源	本网站

	次数	349
	比例	100%

[1]	王洁, 付丹阳, . 基于RISC-V的超标量处理器的ROB压缩方法[J]. 计算机工程与科学, 2024, 46(07): 1185-1192.
[2]	李飞, 郭绍忠, 周蓓, 宋广辉, 郝江伟, 许瑾晨. RISC-V基础数学库性能优化[J]. 计算机工程与科学, 2023, 45(09): 1532-1543.
[3]	刘俊奇, 涂文轩, 祝恩. 图卷积神经网络综述[J]. 计算机工程与科学, 2023, 45(08): 1472-1481.
[4]	牟明任, 贾海鹏, 张云泉, 邓明森, 曲国远, 魏大洲, 张广婷. 基于ARM架构的中值滤波算法优化[J]. 计算机工程与科学, 2022, 44(10): 1738-1746.
[5]	陆松, 蒋句平, 任会峰. 基于FPGA快速实现定制化RISC-V处理器[J]. 计算机工程与科学, 2022, 44(10): 1747-1752.
[6]	单辉, 丁成鑫, 赵中英, 周明成, 贾霄生, 李超, . 基于多图卷积神经网络和注意力机制的学术新星预测方法[J]. 计算机工程与科学, 2022, 44(05): 924-932.
[7]	萧嘉乐, 梁东宝, 陈弟虎, 粟涛. 基于FPGA的高效可伸缩的MobileNet加速器实现[J]. 计算机工程与科学, 2021, 43(04): 628-633.
[8]	夏〓飞，窦〓勇，雷国庆. 基于FPGA的非编码RNA基因检测算法加速器研究[J]. J4, 2011, 33(12): 153-158.