DocumentCode
2314589
Title
Online learning control by association and reinforcement
Author
Si, Jennie ; Wang, Yu-tsung
Author_Institution
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ, USA
Volume
3
fYear
2000
fDate
2000
Firstpage
221
Abstract
This paper focuses on a systematic treatment for developing a generic online learning control system based on the fundamental principle of reinforcement learning or more specifically neuro-dynamic programming. This real time learning system improves its performance over time in two aspects: it learns from its own mistakes through the reinforcement signal from the external environment and try to reinforce its action to improve future performance; and system´s state associated with the positive reinforcement is memorized through a network learning process where in the future, similar states will be more positively associated with a control action leading to a positive reinforcement. Two successful candidates of online learning control designs are introduced. Real time learning algorithms can be derived for individual components in the learning system. Some analytical insights are provided to give some guidelines on the entire online learning control system
Keywords
dynamic programming; learning (artificial intelligence); neurocontrollers; real-time systems; self-organising feature maps; adaptive critic network; learning by association; neural nets; neural-dynamic programming; online learning control; real time system; reinforcement learning; self organising map; Control design; Control systems; Dynamic programming; Guidelines; Learning systems; Optimal control; Real time systems; System performance; System testing; Velocity measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
Conference_Location
Como
ISSN
1098-7576
Print_ISBN
0-7695-0619-4
Type
conf
DOI
10.1109/IJCNN.2000.861307
Filename
861307
Link To Document