DocumentCode
2224464
Title
Q-learning automaton
Author
Qian, Fei ; Hirata, Hironori
Author_Institution
Eng. Fac., Hiroshima Kokusai Gakuin Univ., Japan
fYear
2003
fDate
13-16 Oct. 2003
Firstpage
432
Lastpage
437
Abstract
Reinforcement learning is the problem faced by a controller that must learn behavior through trial and error interactions with a dynamic environment. The controller´s goal is to maximize reward over time, by producing an effective mapping of states of actions called policy. To construct the model of such systems, we present a generalized learning automaton approach with Q-learning behaviors. Compared to Q-learning, the computational experiments of the pursuit problems show that the proposed reinforcement scheme obtains better results in terms of convergence speed and memory size.
Keywords
learning automata; multi-agent systems; stochastic automata; Q-learning automaton; Q-learning behaviors; adaptive evolutionary mechanism; computational experiments; control units; convergence speed; dynamic environment; environment information; generalized learning automaton approach; memory size; multiagent coordinative problem solving; multiagent reinforcement system; optimization problems; random environment state; reinforcement learning; reinforcement scheme; self-consistency space; Automatic control; Context modeling; Control systems; Convergence; Error correction; Helium; Intelligent robots; Learning automata; State-space methods; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on
Print_ISBN
0-7695-1931-8
Type
conf
DOI
10.1109/IAT.2003.1241115
Filename
1241115
Link To Document