مرکز منطقه ای اطلاع رساني علوم و فناوري - Kernel-Based Approximate Dynamic Programming for Real-Time Online Learning Control: An Experimental Study

DocumentCode :

4104

Title :

Kernel-Based Approximate Dynamic Programming for Real-Time Online Learning Control: An Experimental Study

Author :

Xin Xu ; Chuanqiang Lian ; Lei Zuo ; Haibo He

Author_Institution :

Inst. of Unmanned Syst., Nat. Univ. of Defense Technol., Changsha, China

Volume :

Issue :

fYear :

2014

fDate :

Jan. 2014

Firstpage :

146

Lastpage :

156

Abstract :

In the past decade, there has been considerable research interest in learning control methods based on reinforcement learning (RL) and approximate dynamic programming (ADP). As an important class of function approximation techniques, kernel methods have been recently applied to improve the generalization ability of RL and ADP methods but most previous works were only based on simulation. This paper focuses on experimental studies of real-time online learning control for nonlinear systems using kernel-based ADP methods. Specifically, the kernel-based dual heuristic programming (KDHP) method is applied and tested on real-time control systems. Two kernel-based online learning control schemes are presented for uncertain nonlinear systems by using simulation data and online sampling data, respectively. Learning control experiments were performed on a single-link inverted pendulum system as well as a double-link inverted pendulum system. From the experimental results, it is shown that both online learning control schemes, either using simulation data or using real sampling data, are effective for approximating near-optimal control policies of nonlinear dynamical systems with model uncertainties. In addition, it is demonstrated that KDHP can achieve better performance than conventional DHP, which uses multilayer perceptron neural networks.

Keywords :

Markov processes; dynamic programming; function approximation; generalisation (artificial intelligence); learning (artificial intelligence); learning systems; nonlinear control systems; optimal control; pendulums; sampling methods; uncertain systems; KDHP method; RL; double-link inverted pendulum system; function approximation techniques; generalization ability; kernel-based ADP methods; kernel-based approximate dynamic programming; kernel-based dual heuristic programming; learning control methods; model uncertainties; multilayer perceptron neural networks; near-optimal control policies; realtime online learning control; reinforcement learning; sampling data; single-link inverted pendulum system; uncertain nonlinear systems; Approximate dynamic programming (ADP); Markov decision processes (MDPs); learning control; online learning; reinforcement learning (RL);

fLanguage :

English

Journal_Title :

Control Systems Technology, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6536

Type :

jour

DOI :

10.1109/TCST.2013.2246866

Filename :

6492103

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=4104