• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

A RL-based command and control algorithm for
SoS confrontation simulation at the tactical level

YAN Xuefei,LI Xinming,LIU Dong,LIU Desheng,LI Qiang   

  1. (Laboratory of Science and Technology on Complex Electronic System Simulation,Equipment Academy,Beijing 101416,China)
  • Received:2017-04-24 Revised:2017-06-28 Online:2018-08-25 Published:2018-08-25

Abstract:

Aiming at the problem that the traditional cognition techniques are not adaptive to the uncertainty and complexity in the Weapon SystemofSystems (WSoS) confrontation environment, a Command and Control (C2) algorithm based on Reinforcement Learning (RL) is proposed for the WSoS confrontation simulation at the tactical level. The UML architecture of WSoS that consists of a communication class, scouting class, attacking class, command class, supplying class and repairing class is designed and the battle simulation platform with the battle scenario is introduced. Then, based on the illustration and hypothesis for the command agent’s cognition problem, the parameter’s normalization, the discrete of the Q table based on GRBF neural network, the strip temporal difference mechanism and the learning process of the structure of the network are explained for the improved Qleaning cognition algorithm. Finally, the validation and effectiveness of the algorithm is proved through the battle simulation experiment of the airground unify confrontation SoS. Besides, through a lot of visualization recall analysis for the C2 algorithm, we found that the coordination of the firepower and the continuous tactical maneuver are important to the operational effectiveness and injure decrease.
 

Key words: weapon system-of-systems, battle simulation, reinforcement learning, GRBF neural network, cognition and decision