• DocumentCode
    1789175
  • Title

    Cooperative multi-agent traffic signal control system using fast gradient-descent function approximation for V2I networks

  • Author

    Weirong Liu ; Jing Liu ; Jun Peng ; Zhengfa Zhu

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
  • fYear
    2014
  • fDate
    10-14 June 2014
  • Firstpage
    2562
  • Lastpage
    2567
  • Abstract
    The traffic signal control is the basic method to solve the urban congestions problem coming with the accelerating urbanization. For large city, it is challenge to improve the traffic signal control flexibility to adapt the real-time traffic change while to decrease the computation complexity. This paper proposes a cooperative Q-learning with function approximation(CQFA) algorithm for vehicle to infrastructure (V2I) networks. By gathering the local intersection traffic information from V2I networks and employing cooperative behaviors with neighboring intersections, the algorithm can achieve the optimal policy without any central supervising agents. To address the curse of dimensionality effectively, the Q-learning function is approximated by using a fast gradient-descent function approximation method to pick out the optimal Q-learning action. The Q-learning with Function Approximation algorithm combining the cooperative mechanism balances the urban traffic flow and uses approximating strategy to decrease the computation dimensionality. It can improve the traffic throughout, reduce the average waiting time and avoid congestions. Simulation results verify the effectiveness of the proposed algorithm.
  • Keywords
    computational complexity; cooperative systems; function approximation; gradient methods; learning (artificial intelligence); mobile radio; multi-agent systems; road traffic control; CQFA algorithm; V2I networks; average waiting time; central supervising agents; computation complexity; cooperative Q-learning with function approximation algorithm; cooperative behaviors; cooperative multiagent traffic signal control system; fast gradient-descent function approximation method; local intersection traffic information; neighboring intersections; optimal policy; real-time traffic change; urban congestion problem; urban traffic flow; vehicle to infrastructure networks; Approximation algorithms; Function approximation; Learning (artificial intelligence); Roads; Vehicles; Wireless communication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (ICC), 2014 IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • Type

    conf

  • DOI
    10.1109/ICC.2014.6883709
  • Filename
    6883709