• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (12): 2128-2133.

• High Performance Computing • Previous Articles     Next Articles

A parallel Monte Carlo tree search algorithm for multi-agent game

GUAN Yan-xia1,LIU Xun-yun2,LIU Yun-tao1,Xie Min1,XU Xin-hai2   

  1. (1.College of Computer Science and Technology,National University of Defense Technology,Changsha 410073;
    2.War Research Institute,Academy of Military Sciences,Beijing 100091,China)
  • Received:2021-04-02 Revised:2021-09-24 Accepted:2022-12-25 Online:2022-12-25 Published:2022-12-25

Abstract: Monte Carlo tree search algorithm is a commonly used reinforcement learning algorithm, and the exponential growth of the dynamic space of the algorithm in the game process has become a factor that restricts the improvement of the algorithm learning efficiency. Based on the parallel approach to optimize the Monte Carlo tree search algorithm, a parallel Monte Carlo tree search algorithm based on the transfer of winning rate estimate is proposed. The improved parallel game search strategy framework consists of one main process and several sub-processes, in which the sub-processes are used for exploration, and the main process makes decisions according to the winning rate estimate data transmitted by the sub-processes. Combined with the multi-agent game platform Pommerman for experimental validation, the parallel Monte Carlo tree search algorithm can enhance the resource utilization rate, game-winning rate, and decision-making efficiency over the traditional Monte Carlo tree search algorithm.

Key words: multi-agent game, Pommerman, multi-process, parallel Monte Carlo tree search