Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (05): 836-844.
Previous Articles Next Articles
NIE Lei1,2,LIU Bo1,2,LI Peng1,2,HE Heng1,2
Received:
Revised:
Accepted:
Online:
Published:
Abstract: How to select an access network in heterogeneous vehicular network environment is crucial for the service experience of vehicular terminal users. The current Q-learning based network selection method uses the interaction between the agent and the environment to iteratively learn network selection strategies and further realize better network resource allocation. However, this kind of methods usually have the problems of inefficient iterations and slow convergence caused by oversized state space. Besides, overestimations caused by the updates of Q tables lead to unreasonable utilization of network resources. Aiming at above problems, a Multi-agent Q-learning based Selection Method (MQSM) is proposed for heterogeneous vehicular network with 5G communication. The above method adopts the multi-agent cooperative learning idea and gets the total return value of action selection by alternate update of double Q tables. Finally, it achieves a long-term effective optimal network selection decision set in heterogeneous vehicular network environment. Experiment results show that, compared with similar methods, MQSM has better performance in terms of total system handovers, average discount values and network resource utilization.
Key words: multi-agent, Q-learning, network selection, heterogeneous vehicular network, 5G communication
NIE Lei, LIU Bo, LI Peng, HE Heng, . A multi-agent Q-learning based selection method for heterogeneous vehicular network[J]. Computer Engineering & Science, 2021, 43(05): 836-844.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2021/V43/I05/836