DocumentCode :
4104
Title :
Kernel-Based Approximate Dynamic Programming for Real-Time Online Learning Control: An Experimental Study
Author :
Xin Xu ; Chuanqiang Lian ; Lei Zuo ; Haibo He
Author_Institution :
Inst. of Unmanned Syst., Nat. Univ. of Defense Technol., Changsha, China
Volume :
22
Issue :
1
fYear :
2014
fDate :
Jan. 2014
Firstpage :
146
Lastpage :
156
Abstract :
In the past decade, there has been considerable research interest in learning control methods based on reinforcement learning (RL) and approximate dynamic programming (ADP). As an important class of function approximation techniques, kernel methods have been recently applied to improve the generalization ability of RL and ADP methods but most previous works were only based on simulation. This paper focuses on experimental studies of real-time online learning control for nonlinear systems using kernel-based ADP methods. Specifically, the kernel-based dual heuristic programming (KDHP) method is applied and tested on real-time control systems. Two kernel-based online learning control schemes are presented for uncertain nonlinear systems by using simulation data and online sampling data, respectively. Learning control experiments were performed on a single-link inverted pendulum system as well as a double-link inverted pendulum system. From the experimental results, it is shown that both online learning control schemes, either using simulation data or using real sampling data, are effective for approximating near-optimal control policies of nonlinear dynamical systems with model uncertainties. In addition, it is demonstrated that KDHP can achieve better performance than conventional DHP, which uses multilayer perceptron neural networks.
Keywords :
Markov processes; dynamic programming; function approximation; generalisation (artificial intelligence); learning (artificial intelligence); learning systems; nonlinear control systems; optimal control; pendulums; sampling methods; uncertain systems; KDHP method; RL; double-link inverted pendulum system; function approximation techniques; generalization ability; kernel-based ADP methods; kernel-based approximate dynamic programming; kernel-based dual heuristic programming; learning control methods; model uncertainties; multilayer perceptron neural networks; near-optimal control policies; realtime online learning control; reinforcement learning; sampling data; single-link inverted pendulum system; uncertain nonlinear systems; Approximate dynamic programming (ADP); Markov decision processes (MDPs); learning control; online learning; reinforcement learning (RL);
fLanguage :
English
Journal_Title :
Control Systems Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6536
Type :
jour
DOI :
10.1109/TCST.2013.2246866
Filename :
6492103
Link To Document :
بازگشت