Title :
Using Least-Square Policy Iteration to Online Optimize the Parameters of ALV´s Speed Controller
Author :
Qi Zhu ; Jian Wang ; Bin Dai ; Xin Xu
Author_Institution :
Inst. of Unmanned Syst., Nat. Univ. of Defense Tech, Changsha, China
Abstract :
Due to the highly non-linear properties of the longitudinal dynamics of autonomous land vehicles (ALVs), it is difficult to tune the parameters of a speed controller for the autonomous driving of ALVs. Aiming at this problem, in this paper, a novel learning-Based speed controller is proposed, which is composed of a time-varying proportional-integral (PI) control structure and a learning-Based learning module. A near-optimal policy is obtained by least-square policy iteration (LSPI), which is an approximate policy iteration method. The learning-Based module uses the near-optimal policy to realize online tuning of the PI coefficients. The simulation results show that the proposed controller can optimize the control performance by combining different non-optimal coefficients of the PI structure.
Keywords :
PI control; learning (artificial intelligence); mobile robots; vehicles; velocity control; ALV speed controller; LSPI; PI control structure; approximate policy iteration method; autonomous driving; autonomous land vehicles; highly nonlinear properties; learning-based speed controller; least-square policy iteration; longitudinal dynamics; near-optimal policy; online tuning; parameter optimisation; time-varying proportional-integral; Acceleration; Mathematical model; Optimization; Pi control; Vectors; Vehicles; Velocity control; LSPI; approximate policy iteration; autonomous land vehicle; reinforcement learning; speed control;
Conference_Titel :
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2013 5th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-0-7695-5011-4
DOI :
10.1109/IHMSC.2013.205