DocumentCode :
1789175
Title :
Cooperative multi-agent traffic signal control system using fast gradient-descent function approximation for V2I networks
Author :
Weirong Liu ; Jing Liu ; Jun Peng ; Zhengfa Zhu
Author_Institution :
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
fYear :
2014
fDate :
10-14 June 2014
Firstpage :
2562
Lastpage :
2567
Abstract :
The traffic signal control is the basic method to solve the urban congestions problem coming with the accelerating urbanization. For large city, it is challenge to improve the traffic signal control flexibility to adapt the real-time traffic change while to decrease the computation complexity. This paper proposes a cooperative Q-learning with function approximation(CQFA) algorithm for vehicle to infrastructure (V2I) networks. By gathering the local intersection traffic information from V2I networks and employing cooperative behaviors with neighboring intersections, the algorithm can achieve the optimal policy without any central supervising agents. To address the curse of dimensionality effectively, the Q-learning function is approximated by using a fast gradient-descent function approximation method to pick out the optimal Q-learning action. The Q-learning with Function Approximation algorithm combining the cooperative mechanism balances the urban traffic flow and uses approximating strategy to decrease the computation dimensionality. It can improve the traffic throughout, reduce the average waiting time and avoid congestions. Simulation results verify the effectiveness of the proposed algorithm.
Keywords :
computational complexity; cooperative systems; function approximation; gradient methods; learning (artificial intelligence); mobile radio; multi-agent systems; road traffic control; CQFA algorithm; V2I networks; average waiting time; central supervising agents; computation complexity; cooperative Q-learning with function approximation algorithm; cooperative behaviors; cooperative multiagent traffic signal control system; fast gradient-descent function approximation method; local intersection traffic information; neighboring intersections; optimal policy; real-time traffic change; urban congestion problem; urban traffic flow; vehicle to infrastructure networks; Approximation algorithms; Function approximation; Learning (artificial intelligence); Roads; Vehicles; Wireless communication;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (ICC), 2014 IEEE International Conference on
Conference_Location :
Sydney, NSW
Type :
conf
DOI :
10.1109/ICC.2014.6883709
Filename :
6883709
Link To Document :
بازگشت