Title :
Associative reinforcement learning for discrete-time optimal control
Author :
Howell, M.N. ; Gordon, T.J.
Author_Institution :
Dept. of Aeronaut. & Autom. Eng., Loughborough Univ. of Technol., UK
Abstract :
This paper investigates the application of associative reinforcement learning techniques to the optimal control of linear discrete-time dynamic systems. Associative reinforcement learning involves the trial and error interaction with a dynamic system to determine the control actions that optimally achieve some desired performance index. The methodology can be applied either online or off-line and in a model based or model free manner. Associative reinforcement learning techniques are applied to the optimal regulator (LQR) control of discrete-time linear systems. Adaptive critic designs are implemented and the convergence speed compared for the different approaches. These methods can determine the optimal state and state/action value function and the optimal policy without requiring system models
Keywords :
linear systems; LQR control; adaptive critic designs; associative reinforcement learning; convergence speed; discrete-time optimal control; dynamic system; linear discrete-time dynamic systems; optimal control; optimal regulator control; performance index; state/action value function; trial-and-error interaction;
Conference_Titel :
Learning Systems for Control (Ref. No. 2000/069), IEE Seminar
Conference_Location :
Birmingham
DOI :
10.1049/ic:20000342