A parallel Monte Carlo tree search algorithm for multi-agent game

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (12): 2128-2133.

• High Performance Computing • Previous Articles Next Articles

A parallel Monte Carlo tree search algorithm for multi-agent game

GUAN Yan-xia1,LIU Xun-yun2,LIU Yun-tao1,Xie Min1,XU Xin-hai2

(1.College of Computer Science and Technology,National University of Defense Technology,Changsha 410073;
2.War Research Institute,Academy of Military Sciences,Beijing 100091,China)

Received:2021-04-02 Revised:2021-09-24 Accepted:2022-12-25 Online:2022-12-25 Published:2022-12-25

Abstract

Abstract: Monte Carlo tree search algorithm is a commonly used reinforcement learning algorithm, and the exponential growth of the dynamic space of the algorithm in the game process has become a factor that restricts the improvement of the algorithm learning efficiency. Based on the parallel approach to optimize the Monte Carlo tree search algorithm, a parallel Monte Carlo tree search algorithm based on the transfer of winning rate estimate is proposed. The improved parallel game search strategy framework consists of one main process and several sub-processes, in which the sub-processes are used for exploration, and the main process makes decisions according to the winning rate estimate data transmitted by the sub-processes. Combined with the multi-agent game platform Pommerman for experimental validation, the parallel Monte Carlo tree search algorithm can enhance the resource utilization rate, game-winning rate, and decision-making efficiency over the traditional Monte Carlo tree search algorithm.

Key words: multi-agent game, Pommerman, multi-process, parallel Monte Carlo tree search

GUAN Yan-xia, LIU Xun-yun, LIU Yun-tao, Xie Min, XU Xin-hai. A parallel Monte Carlo tree search algorithm for multi-agent game[J]. Computer Engineering & Science, 2022, 44(12): 2128-2133.

[1]	TANG Zhu, CHEN Baohai, WANG Jingyu, ZHU Qi. OpenOCD debugging optimization for isomorphic asymmetric multi-core architecture [J]. Computer Engineering & Science, 2025, 47(01): 45-55.
[2]	LI Hui, JU Peng-jin, JI Yong-xing. Error tracing and location technology in multi-processor cache coherence verification [J]. Computer Engineering & Science, 2022, 44(07): 1171-1180.
[3]	WU Jianguo，CHEN Haiyan，LIU Sheng，DENG Rangyu，CHEN Junjie. A survey of performance improvement methods for multi-core cache sparse directory [J]. Computer Engineering & Science, 2019, 41(03): 385-392.
[4]	. [J]. J4, 2008, 30(9): 25-28.

A parallel Monte Carlo tree search algorithm for multi-agent game

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 4

Recommended Articles

Metrics

Comments