مرکز منطقه ای اطلاع رساني علوم و فناوري - Self-Learning Cruise Control Using Kernel-Based Least Squares Policy Iteration

DocumentCode :

1757948

Title :

Self-Learning Cruise Control Using Kernel-Based Least Squares Policy Iteration

Author :

Jian Wang ; Xin Xu ; Daxue Liu ; Zhenping Sun ; Qingyang Chen

Author_Institution :

Coll. of Mechatron. & Autom., Nat. Univ. of Defense Technol., Changsha, China

Volume :

Issue :

fYear :

2014

fDate :

41760

Firstpage :

1078

Lastpage :

1087

Abstract :

This paper presents a novel learning-based cruise controller for autonomous land vehicles (ALVs) with unknown dynamics and external disturbances. The learning controller consists of a time-varying proportional-integral (PI) module and an actor-critic learning control module with kernel machines. The learning objective for the cruise control is to make the vehicle´s longitudinal velocity follow a smoothed spline-based speed profile with the smallest possible errors. The parameters in the PI module are adaptively tuned based on the vehicle´s state and the action policy of the learning control module. Based on the state transition data of the vehicle controlled by various initial policies, the action policy of the learning control module is optimized by kernel-based least squares policy iteration (KLSPI) in an offline way. The effectiveness of the proposed controller was tested on an ALV platform during long-distance driving in urban traffic and autonomous driving on off-road terrain. The experimental results of the cruise control show that the learning control method can realize data-driven controller design and optimization based on KLSPI and that the controller´s performance is adaptive to different road conditions.

Keywords :

PI control; adaptive control; control system synthesis; dynamic programming; iterative methods; least squares approximations; mobile robots; off-road vehicles; road traffic; road vehicles; time-varying systems; unsupervised learning; velocity control; actor-critic learning control module; autonomous driving; autonomous land vehicles; kernel machines; kernel-based least squares policy iteration; learning control module action policy; long-distance driving; off-road terrain; self-learning cruise control; smoothed spline-based speed profile; time-varying proportional-integral module; urban traffic; vehicle control; vehicle longitudinal velocity; Acceleration; Function approximation; Kernel; Polynomials; Splines (mathematics); Tuning; Vehicles; Approximate dynamic programming (ADP); autonomous land vehicle (ALV); cruise control; kernel-based least squares policy iteration (KLSPI); reinforcement learning; speed control; speed control.;

fLanguage :

English

Journal_Title :

Control Systems Technology, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6536

Type :

jour

DOI :

10.1109/TCST.2013.2271276

Filename :

6584761

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1757948