[1] |
Petrie C.Agentbased software engineering[M]∥
|
|
Agentoriented software engineering.Berlin:SpringerVerlag,2001:5975.
|
[2] |
Yan Yuejin, Li Zhoujun, Chen Yuexin.Multiagent system architecture[J].Computer Science,2001,28(5):7780.(in Chinese)
|
[3] |
Chen Mei,Hu Xiaohui.BDI Agent action planning mechanism based on reinforcement learning[J].Computer Engineering and Design,2011,32(3):10431046.(in Chinese)
|
[4] |
Yang Fangqiong.Multisensor information fusion for positioning and navigation for mobile robot[D].Changsha:Central South University,2010.(in Chinese)
|
[5] |
Ancona D,Mascardi V.CooBDI:Extending the BDI model with cooperativity[C]∥Proc of International Workshop on Declarative Agent Languages and Technologies,2003:109134.
|
[6] |
Mcgeary F,Decker K.Modeling a virtual food court using DECAF[C]∥Proc of the 2nd International Workshop on MultiAgentBased Simulation,2000:6881.
|
[7] |
Burgemeestre B C,Hulstijn J,Tan Y H.Towards an architecture for selfregulating agents:A case study in international trade[C]∥
|
|
Proc of the 5th International Conference on Coordination,Organizations,Institutions and Norms in Agent Systems V,2010:320333.
|
[8] |
Pokahr A,Braubach L,Lamersdorf W.Jadex:A BDI reasoning engine[M]∥MultiAgent Programming.New York:Springer US,2005:149174.
|
[9] |
Bordini R H,Hübner J F,Wooldridge M.Programming multiagent systems in AgentSpeak using Jason[M]∥
|
|
Chichester:Wiley Publishing,2008.
|
[10] |
Sutton R S,Barto A G.Reinforcement learning:An introduction,bradford book[J].IEEE Transactions on Neural Networks,2005,16(1):285286.
|
[11] |
Schwartz H M.Multiagent machine learning:A reinforcement approach[J].Journal of Cellence,2014,103(6):989998.
|
[12] |
Watkins C J C H,Dayan P.Technical note:Qlearning[J].Machine Learning,1992,8(34):279292.
|
[13] |
Xu Shuang,Jia Yunde.Intention tracking based reinforcement learning agent model[J].Journal of Beijing Institute of Technology,2004,24(8):679682.(in Chinese)
|
[14] |
Liu Xinyu,Hong Bingrong.A multiagent dynamic cooperation model based on BDI framework and its application[J].Journal of Computer Research and Development,2002,39(7):797801.(in Chinese)
|
[15] |
Rabinowitz N C,Perbet F,Song H F,et al.Machine theory of mind[EB/OL].[20180517].https://arxiv.org/abs/1802.07740.
|
[16] |
Feliu J L. Use of reinforcement learning (RL) for plan generation in beliefdesireintention (BDI) agent systems
|
[D] |
US:University of Rhode Island,2013.
|
[17] |
Broekens J,Hindriks K,Wiggers P.Reinforcement learning as heuristic for actionrule preferences[C]∥
|
|
Proc of the 8th International Conference on Programming MultiAgent Systems,2010:2540.
|
[18] |
Badica A,Badica C,Ivanovic M,et al.An approach of temporal difference learning using agentoriented programming[C]∥Proc of IEEE International Conference on Control Systems and Computer Science,2015:735742.
|
[19] |
Li G,Whiteson S,Knox W B,et al.Social interaction for efficient agent learning from human reward[J].Autonomous Agents and MultiAgent Systems,2018,32(1):125.
|
[20] |
Guo Yan.The research and development of agentbased modeling approach[EB/OL].[20100329].
|
|
http://www.paper.edu.cn/releasepaper/content/201003982.(in Chinese)
|
[21] |
Morreale V,Bonura S,Francaviglia G.Goaloriented development of BDI [C]∥Proc of the IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2006:7172.
|
[22] |
Jason: A javabased interpreter for an extended version of AgentSpeak[EB/OL].[20180517].http://jason.sourceforge.net/.
|
[23] |
Habib A,Khan M I,Jia U.Optimal route selection in complex multistage supply chain networks using SARSA(λ)[C]∥Proc of IEEE International Conference on Computer and Information Technology,2017:170175.
|
[24] |
Píbil R,Novák P,Brom C,et al.Notes on pragmatic agent
|
|
programming with Jason[C]∥Proc of the 9th International Workshop on Programming MultiAgent Systems,2011:5873.
|
[25] |
Reinforcement learning through asynchronous advantage actorcritic on a GPU[EB/OL].[20180517].http://cn.arxiv.org/abs/1611.06256.
|
[26] |
Hong Changhao.Research on multiagent rescue simulation system[D].Harbin:Harbin Engineering University,2011.(in Chinese)
|
|
附中文参考文献:
|
[2] |
颜跃进,李舟军,陈跃新.多Agent系统体系结构[J].计算机科学,2001,28(5):7780.
|
[3] |
陈梅,胡晓辉.基于加强学习的BDI Agent动作规划机制[J].计算机工程与设计,2011,32(3):10431046.
|
[4] |
杨放琼.基于信息融合的移动机器人定位导航及其深海采矿应用研究[D].长沙:中南大学,2010.
|
[13] |
续爽,贾云得.一种基于意图跟踪和强化学习的agent模型[J].北京理工大学学报,2004,24(8):679682.
|
[14] |
刘新宇,洪炳镕.基于BDI框架的多Agent动态协作模型与应用研究[J].计算机研究与发展,2002,39(7):797801.
|
[20] |
郭雁. 基于Agent的建模方法的研究与开发[EB/OL].[20100329].http://www.paper.edu.cn/releasepaper/content/201003982.
|
[26] |
洪长昊.多智能体救援仿真系统研究[D].哈尔滨:哈尔滨工程大学,2011.
|