DocumentCode
328301
Title
A new approach of adaptive reinforcement learning control
Author
Yang, Boo-Ho ; Asada, Haruhiko
Author_Institution
Dept. of Mech. Eng., MIT, Cambridge, MA, USA
Volume
1
fYear
1993
fDate
25-29 Oct. 1993
Firstpage
627
Abstract
A new learning algorithm for connectionist networks that solves a class of optimal control problems is presented. The algorithm, called adaptive reinforcement learning algorithm, employs a second network to model immediate reinforcement provided from the task environment and adaptively identify it through experience. Output perturbation and correlation techniques are used to translate mere critic signals into useful learning signals for the connectionist controller. Compared with the direct approaches of reinforcement learning, this algorithm shows faster and guaranteed improvement in the control performance. Robustness against inaccuracy of the model is also discussed.
Keywords
adaptive control; correlation methods; intelligent control; learning (artificial intelligence); neural nets; optimal control; adaptive reinforcement learning control; connectionist networks; critic signals; neural nets; optimal control; output correlation; output perturbation; Adaptive control; Control systems; Force measurement; Force sensors; Learning; Mechanical engineering; Mechanical systems; Optimal control; Programmable control; Robotic assembly;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 1993. IJCNN '93-Nagoya. Proceedings of 1993 International Joint Conference on
Print_ISBN
0-7803-1421-2
Type
conf
DOI
10.1109/IJCNN.1993.713993
Filename
713993
Link To Document