Optimal strategy planning of BDI agent based on
 Q-learning in uncertain environments

Computer Engineering & Science

Previous Articles Next Articles

Optimal strategy planning of BDI agent based on

Q-learning in uncertain environments

WAN Qian1,2,LIU Wei1,2,XU Longlong1,2,GUO Jingzhi1,2

（1.School of Computer Science and Engineering,Wuhan Institute of Technology,Wuhan 430073;

2.Hubei Provincial Key Laboratory of Intelligent Robot,Wuhan 430073,China）

Received:2018-06-05 Revised:2018-08-13 Online:2019-01-25 Published:2019-01-25

Abstract

Abstract:

The belief-desire-intention (BDI) model can solve the problem of reasoning and decision-making of agents in a particular environment, but lacks the ability of decision-making and learning in dynamic and uncertain environments. Reinforcement learning solves the decision-making problem of agent in unknown environments, but lacks the rule description and logical reasoning of the BDI model. Aiming at the strategic planning problem of the BDI in the unknown and dynamic environment, we propose an optimal strategy planning method based on Q-learning algorithm of reinforcement learning. And we make improvement for the decision-making mechanism on the implementation model of the BDI—agent speak language (ASL). Finally, the simulation of the maze on the ASL simulation platform Jason proves the feasibility of this method, and the new agent model can fulfill tasks in uncertain environments.

Key words: BDI agent, reinforcement learning, Q-learning, ASL, Jason, planning

WAN Qian1,2,LIU Wei1,2,XU Longlong1,2,GUO Jingzhi1,2.

Optimal strategy planning of BDI agent based on

Q-learning in uncertain environments

[J]. Computer Engineering & Science.

[1]	ZHANG Zheng, XIA Xiaoyun, CHEN Zefeng, XIANG Yi. A staged strategy incorporating reinforcement learning to solve the travelling thief problem [J]. Computer Engineering & Science, 2025, 47(01): 140-149.
[2]	YU Shirui, JIANG Chunmao. A cloud computing virtual machine scheduling strategy based on fuzzy reinforcement learning [J]. Computer Engineering & Science, 2025, 47(01): 56-65.
[3]	DUAN Cheng-long, YUAN Jie, CHANG Qian-kun, ZHANG Ning-ning. Inverse reinforcement learning algorithm based on D2GA [J]. Computer Engineering & Science, 2024, 46(11): 2053-2062.
[4]	GU Ying-cheng, WEI Liu, JIANG Ning, CHENG Huan-yu, LIU Kai, SONG Yu, LIU Mei-zhao, TANG Lei, CHEN Yu, ZHANG Sheng. Edge server assignment for distributed interactive applications in edge environments [J]. Computer Engineering & Science, 2024, 46(10): 1748-1756.
[5]	CAI Yu, GUAN Zheng, WANG Zeng-wen, WANG Xue, YANG Zhi-jun. Resource allocation algorithm for distinguished services in vehicular networks based on multi-agent deep reinforcement learning [J]. Computer Engineering & Science, 2024, 46(10): 1757-1764.
[6]	AN Yuan-yuan, MA Xiao-ning. Flight path planning based on improved genetic algorithm and multi-objective optimization model [J]. Computer Engineering & Science, 2024, 46(09): 1660-1666.
[7]	Lv Qian-ru, YANG Xiang-rui, CAI Zhi-ping. A computer wargame path planning method based on influence map [J]. Computer Engineering & Science, 2024, 46(06): 1041-1049.
[8]	ZHUANG Shu-xin, CHEN Yong-hong, HAO Yi-hang, WU Wei-wei, XU Xue-yong, WANG Wan-yuan. A population diversity-based robust policy generation method in adversarial game environments#br# [J]. Computer Engineering & Science, 2024, 46(06): 1081-1091.
[9]	SHEN Ke-yu, YOU Zhi-yu, LIU Yong-xin. A multi-scene adaptive A* algorithm based on fitting-first search [J]. Computer Engineering & Science, 2024, 46(01): 142-149.
[10]	LI Zhong-hua, YUAN Jie, GUO Zhen-yu. Robot path planning of goal-directed Bi-RRT based on information inspiration [J]. Computer Engineering & Science, 2023, 45(12): 2237-2245.
[11]	ZHANG Bei, MIN Hua-song, ZHANG Xin-ming. A differential mutation and territorial search equilibrium optimizer and its application in robot path planning [J]. Computer Engineering & Science, 2023, 45(11): 2078-2090.
[12]	ZENG Fan-feng, WANG Chun-zhen, LI Chen. An unsupervised video summarization algorithm based on deep and shallow feature fusion [J]. Computer Engineering & Science, 2023, 45(09): 1602-1610.
[13]	ZHANG Zhi-yuan, CHEN Hai-jin, ZHANG Yi-ming. An optimized A* algorithm based on local obstacle rate pre-acquisition and bidirectional parent node change [J]. Computer Engineering & Science, 2023, 45(09): 1661-1669.
[14]	YU Jia-bin, CHEN Zhi-hao, DENG Wei, XU Ji-ping, ZHAO Zhi-yao, WANG Xiao-yi. A traversal multi-target path planning algorithm for unmanned cruise ship [J]. Computer Engineering & Science, 2023, 45(05): 840-848.
[15]	WANG Yang, CHEN Zhi-bin. A dynamic graph transformer model for solving CVRP [J]. Computer Engineering & Science, 2023, 45(05): 859-868.

Optimal strategy planning of BDI agent based on

Q-learning in uncertain environments

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments