• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Optimal strategy planning of BDI agent based on
 Q-learning in uncertain environments

WAN Qian1,2,LIU Wei1,2,XU Longlong1,2,GUO Jingzhi1,2   

  1. (1.School of Computer Science and Engineering,Wuhan Institute of Technology,Wuhan 430073;
    2.Hubei Provincial Key Laboratory of Intelligent Robot,Wuhan 430073,China)

     
  • Received:2018-06-05 Revised:2018-08-13 Online:2019-01-25 Published:2019-01-25

Abstract:

The belief-desire-intention (BDI) model can solve the problem of reasoning and decision-making of agents in a particular environment, but lacks the ability of decision-making and learning in dynamic and uncertain environments. Reinforcement learning solves the decision-making problem of agent in unknown environments, but lacks the rule description and logical reasoning of the BDI model. Aiming at the strategic planning problem of the BDI in the unknown and dynamic environment, we propose an optimal strategy planning method based on Q-learning algorithm of reinforcement learning. And we make  improvement for the decision-making mechanism on the implementation model of the BDI—agent speak language (ASL). Finally, the simulation of the maze on the ASL simulation platform Jason proves the feasibility of this method, and the new agent model can fulfill tasks in uncertain environments.
 

Key words: BDI agent, reinforcement learning, Q-learning, ASL, Jason, planning