Q-learning automaton

Author

Qian, Fei ; Hirata, Hironori

Author_Institution

Eng. Fac., Hiroshima Kokusai Gakuin Univ., Japan

fYear

2003

fDate

13-16 Oct. 2003

Firstpage

432

Lastpage

437

Abstract

Reinforcement learning is the problem faced by a controller that must learn behavior through trial and error interactions with a dynamic environment. The controller´s goal is to maximize reward over time, by producing an effective mapping of states of actions called policy. To construct the model of such systems, we present a generalized learning automaton approach with Q-learning behaviors. Compared to Q-learning, the computational experiments of the pursuit problems show that the proposed reinforcement scheme obtains better results in terms of convergence speed and memory size.

Keywords

learning automata; multi-agent systems; stochastic automata; Q-learning automaton; Q-learning behaviors; adaptive evolutionary mechanism; computational experiments; control units; convergence speed; dynamic environment; environment information; generalized learning automaton approach; memory size; multiagent coordinative problem solving; multiagent reinforcement system; optimization problems; random environment state; reinforcement learning; reinforcement scheme; self-consistency space; Automatic control; Context modeling; Control systems; Convergence; Error correction; Helium; Intelligent robots; Learning automata; State-space methods; Stochastic processes;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on

Print_ISBN

0-7695-1931-8

Type

conf

DOI

10.1109/IAT.2003.1241115

Filename

1241115