基于RISC-V架构的强化学习容器化方法研究

计算机工程与科学 ›› 2021, Vol. 43 ›› Issue (2): 266-273.

基于RISC-V架构的强化学习容器化方法研究

徐子晨,崔傲，王玉皞，刘韬

（南昌大学信息工程学院，江西南昌 330031）

收稿日期:2020-05-03 修回日期:2020-07-06 出版日期:2021-02-25 发布日期:2021-02-23
基金资助:
国家自然科学基金（61702250）;国家重点研发计划（2018YFB14043033）;国家核高基（2018ZX01035-101）;中科院计算机体系结构国家重点实验室开放课题（CARCHB202017）

A containerization method for reinforcement learning based on RISC-V architecture

XU Zi-chen,CUI Ao,WANG Yu-hao,LIU Tao

(School of Information Engineering,Nanchang University,Nanchang 330031,China )

Received:2020-05-03 Revised:2020-07-06 Online:2021-02-25 Published:2021-02-23

摘要/Abstract

摘要： RISC-V作为近年来最热门的开源指令集架构，被广泛应用于各个特定领域的微处理器，特别是机器学习领域的模块化定制。但是，现有的RISC-V应用需要将传统软件或模型在RISC-V指令集上重新编译或优化，故如何能快速地在RISC-V体系结构上部署、运行和测试机器学习框架是一个亟待解决的技术问题。使用虚拟化技术可以解决跨平台的模型部署和运行问题。但是，传统的虚拟化技术，例如虚拟机，对原生系统性能要求高，资源占用多，运行响应慢，往往不适用于RISC-V架构的应用场景。讨论在资源受限的RISC-V架构上的强化学习虚拟化问题。首先，通过采用容器化技术减少上层软件构建虚拟化代价，去除冗余中间件，定制命名空间隔离特定进程，有效提升学习任务资源利用率，实现模型训练快速执行；其次，利用RISC-V指令集的特征进一步优化上层神经网络模型，
提高强化学习效率；最后，实现整体优化和容器化方法系统原型，并通过多种基准测试集完成系统原型性能评估。容器化技术和传统RISC-V架构下交叉编译深度神经网络模型的方法相比，仅付出相对较小的额外性能代价，能快速实现更多、更复杂的深度学习软件框架的部署及运行；与Hypervisor虚拟机方法相比，基于RISC-V的模型具有近似的部署时间，并大量减少了性能损失。初步实验结果表明，容器化及其上的优化方法是实现基于RISC-V架构的软件和学习模型快速部署的一种有效方法。

关键词:

虚拟化, 神经网络, RISC-V

Abstract: As the hottest open-source instruction set architecture in recent years, RISC-V is widely used in a variety of domain-specific microprocessors, especially for modular customization in the field of machine learning. However, existing RISC-V applications require recompilation or optimization of legacy software or models on the RISC-V instruction set. Therefore, how to rapidly deploy, run, and test machine learning frameworks on RISC-V architectures is a pressing technology challenges. The use of virtualization technology can solve the problem of deploying and running models across platforms. However, traditional virtualization techniques, such as virtual machines, are often not applicable to RISC-V architecture scenarios due to their high performance requirements for native systems, high resource footprint, and slow operational response. Discussion of reinforcement learning virtualization on resource-constrained RISC-V architectures. Firstly, by adopting containerization technology, reducing the cost of virtualization for upper-level software builds, removing redundant middleware, and customizing namespaces to isolate specific processes, we effectively improve the resource utilization for learning tasks and achieve the rapid execution of model training. Secondly, the features of the RISC-V instruction set are used to further optimize the upper neural network model and optimize the reinforcement learning efficiency. Finally, a system prototype of the overall optimization and containerization method is implement- ed and the performance evaluation of the prototype is completed by testing multiple benchmark test sets. Containerization techniques enable the rapid deployment and operation of more complex and deep learning software frameworks at a relatively small additional performance cost, compared to traditional methods of cross-compiling deep neural network models in RISC-V architectures. RISC-V based models have approximate deployment time and reduce substantial performance losses compared to the hypervisor VM method. Preliminary experimental results demonstrate that containerization and the optimization method on it are an effective way to achieve the rapid deployment of software and learning models based on RISC-V architecture.

Key words: virtualization, neural network, RISC-V

中图分类号:

徐子晨, 崔傲, 王玉皞, 刘韬. 基于RISC-V架构的强化学习容器化方法研究[J]. 计算机工程与科学, 2021, 43(2): 266-273.

XU Zi-chen, CUI Ao, WANG Yu-hao, LIU Tao. A containerization method for reinforcement learning based on RISC-V architecture[J]. Computer Engineering & Science, 2021, 43(2): 266-273.

[1]	刘金竹, 张东, 李冠宇. 基于密集卷积和多特征感知的链接预测模型研究[J]. 计算机工程与科学, 2025, 47(8): 1483-1492.
[2]	高志玲1, 赵新宇1, 2. 基于PKUSEG-Text-GCN的肿瘤疾病预测模型[J]. 计算机工程与科学, 2025, 47(7): 1303-1311.
[3]	陈旭, 陈子雄, 景永俊, 王叔洋, 宋吉飞. 基于双曲图卷积神经网络的切片级漏洞检测方法[J]. 计算机工程与科学, 2025, 47(5): 851-863.
[4]	王莹, 杨青, 王翔宇, 张勇, . 基于非对称空间特征的脑电信号情感分析研究[J]. 计算机工程与科学, 2025, 47(5): 921-930.
[5]	李珍琪, 王强, 齐星云, 赖明澈, 赵言亢, 陆亿行, 黎渊. 轻量化卷积神经网络硬件加速设计及FPGA实现[J]. 计算机工程与科学, 2025, 47(4): 582-591.
[6]	王煜恒, 刘强, 伍晓洁. RCGNN：图注入攻击下的图神经网络鲁棒性认证方法[J]. 计算机工程与科学, 2025, 47(3): 434-447.
[7]	景永俊, 王浩, 邵堃, 王晓峰. 一种基于图热核扩散卷积的网络入侵检测方法[J]. 计算机工程与科学, 2025, 47(3): 459-471.
[8]	李娇, 高磊怡, 张瑞欣, 吴越, 邓红霞. 基于脉冲注意力机制的轻量化面部超分重建方法[J]. 计算机工程与科学, 2025, 47(3): 494-503.
[9]	陈宇灵, 李翔. 基于图结构提示实现低资源场景下的节点分类[J]. 计算机工程与科学, 2025, 47(3): 534-547.
[10]	黄颖, 唐敏, . 基于深度神经网络的隐私保护基因检测[J]. 计算机工程与科学, 2025, 47(2): 265-275.
[11]	侯萱, 梁志贞, 张磊, 刘佰龙, 张雪飞. 基于上下文全局空间图的轨迹用户链接[J]. 计算机工程与科学, 2025, 47(2): 336-348.
[12]	朱嘉骏, 包美凯, 张凯, 刘烨, 刘淇. 基于多源知识注入的常识问答方法研究[J]. 计算机工程与科学, 2025, 47(2): 349-360.
[13]	李瑞红, 李晓红, 姚锦, 王闪闪. 基于双通道异质超图神经网络的引文推荐方法[J]. 计算机工程与科学, 2025, 47(2): 361-369.
[14]	王鹏, 张嘉诚, 范毓洋, . 适应于硬件部署的神经网络剪枝量化算法[J]. 计算机工程与科学, 2024, 46(9): 1547-1553.
[15]	袁佳伟, 赵进. 基于图神经网络的OMCI模型相似性计算[J]. 计算机工程与科学, 2024, 46(9): 1576-1586.