• DocumentCode
    2224464
  • Title

    Q-learning automaton

  • Author

    Qian, Fei ; Hirata, Hironori

  • Author_Institution
    Eng. Fac., Hiroshima Kokusai Gakuin Univ., Japan
  • fYear
    2003
  • fDate
    13-16 Oct. 2003
  • Firstpage
    432
  • Lastpage
    437
  • Abstract
    Reinforcement learning is the problem faced by a controller that must learn behavior through trial and error interactions with a dynamic environment. The controller´s goal is to maximize reward over time, by producing an effective mapping of states of actions called policy. To construct the model of such systems, we present a generalized learning automaton approach with Q-learning behaviors. Compared to Q-learning, the computational experiments of the pursuit problems show that the proposed reinforcement scheme obtains better results in terms of convergence speed and memory size.
  • Keywords
    learning automata; multi-agent systems; stochastic automata; Q-learning automaton; Q-learning behaviors; adaptive evolutionary mechanism; computational experiments; control units; convergence speed; dynamic environment; environment information; generalized learning automaton approach; memory size; multiagent coordinative problem solving; multiagent reinforcement system; optimization problems; random environment state; reinforcement learning; reinforcement scheme; self-consistency space; Automatic control; Context modeling; Control systems; Convergence; Error correction; Helium; Intelligent robots; Learning automata; State-space methods; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on
  • Print_ISBN
    0-7695-1931-8
  • Type

    conf

  • DOI
    10.1109/IAT.2003.1241115
  • Filename
    1241115