DocumentCode :
345638
Title :
Neuro-dynamic programming based on self-organized patterns
Author :
Si, Jennie ; Wang, Yu-tsung
Author_Institution :
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ, USA
fYear :
1999
fDate :
1999
Firstpage :
120
Lastpage :
125
Abstract :
This paper introduces a real-time learning control mechanism, as a robust and efficient scheme of neuro-dynamic programming. The objective of the learning controller is to optimize a certain performance measure by learning to create appropriate control actions through interacting with the environment. The controller is set out to learn to perform better over time starting with no prior knowledge about the system. The system under consideration does not render a complete system model describing its behaviors. Instead, real-time sampled measurements are available to the designer. The state measurements are first analyzed by similarity and organized by proximity. Control actions are then generated in relevance to the state patterns. A critic network serves the purpose of `monitoring´ the performance of the controller to achieve a given optimality. We provide detailed implementation, and performance evaluations of this learning controller in a cart-pole balancing problem
Keywords :
dynamic programming; learning systems; neurocontrollers; real-time systems; self-adjusting systems; cart-pole balancing; learning control; neurocontrol; neurodynamic programming; performance evaluations; real-time systems; self-organising systems; Artificial neural networks; Computational efficiency; Computational modeling; Control systems; Dynamic programming; Functional programming; Neurofeedback; Operations research; Optimal control; Robust control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control/Intelligent Systems and Semiotics, 1999. Proceedings of the 1999 IEEE International Symposium on
Conference_Location :
Cambridge, MA
ISSN :
2158-9860
Print_ISBN :
0-7803-5665-9
Type :
conf
DOI :
10.1109/ISIC.1999.796641
Filename :
796641
Link To Document :
بازگشت