• DocumentCode
    466107
  • Title

    Complex-Valued Reinforcement Learning

  • Author

    Hamagami, Tomoki ; Shibuya, Takashi ; Shimada, Shingo

  • Author_Institution
    Yokohama Nat. Univ., Yokohama
  • Volume
    5
  • fYear
    2006
  • fDate
    8-11 Oct. 2006
  • Firstpage
    4175
  • Lastpage
    4179
  • Abstract
    A new reinforcement learning algorithm with complex-valued functions is proposed. The algorithm is inspired by complex-valued neural networks introducing complex numbers representing phase and amplitude into a conventional neural network. The strong advantage of using complex values in reinforcement learning is that the state-action function in a time series can be easily extended. In particular, considering the coherence of each complex value, the proposed learning algorithm can represent the context of agent behavior. This extension allows compensating for the perceptual aliasing problem and provides for the intelligent behavior of mobile robots in the real world. The complex-valued functions are applied to the conventional reinforcement learning algorithms: Q-learning and profit sharing. These algorithms are evaluated by simple maze problems and a bar-carrying task involving perceptual aliasings. Simulation experiments show that the new algorithm can efficiently solve perceptual aliasing.
  • Keywords
    learning (artificial intelligence); neural nets; time series; Q-learning; complex-valued functions; complex-valued neural networks; complex-valued reinforcement learning; mobile robots; perceptual aliasing problem; profit sharing; reinforcement learning algorithm; state-action function; time series; Control systems; Cybernetics; Intelligent agent; Intelligent robots; Learning; Mobile robots; Multiagent systems; Neural networks; Resonance light scattering; Robot control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    1-4244-0099-6
  • Electronic_ISBN
    1-4244-0100-3
  • Type

    conf

  • DOI
    10.1109/ICSMC.2006.384789
  • Filename
    4274554