DocumentCode :
2224464
Title :
Q-learning automaton
Author :
Qian, Fei ; Hirata, Hironori
Author_Institution :
Eng. Fac., Hiroshima Kokusai Gakuin Univ., Japan
fYear :
2003
fDate :
13-16 Oct. 2003
Firstpage :
432
Lastpage :
437
Abstract :
Reinforcement learning is the problem faced by a controller that must learn behavior through trial and error interactions with a dynamic environment. The controller´s goal is to maximize reward over time, by producing an effective mapping of states of actions called policy. To construct the model of such systems, we present a generalized learning automaton approach with Q-learning behaviors. Compared to Q-learning, the computational experiments of the pursuit problems show that the proposed reinforcement scheme obtains better results in terms of convergence speed and memory size.
Keywords :
learning automata; multi-agent systems; stochastic automata; Q-learning automaton; Q-learning behaviors; adaptive evolutionary mechanism; computational experiments; control units; convergence speed; dynamic environment; environment information; generalized learning automaton approach; memory size; multiagent coordinative problem solving; multiagent reinforcement system; optimization problems; random environment state; reinforcement learning; reinforcement scheme; self-consistency space; Automatic control; Context modeling; Control systems; Convergence; Error correction; Helium; Intelligent robots; Learning automata; State-space methods; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on
Print_ISBN :
0-7695-1931-8
Type :
conf
DOI :
10.1109/IAT.2003.1241115
Filename :
1241115
Link To Document :
بازگشت