• DocumentCode
    1592869
  • Title

    About an initial value of Q-value in Profit Sharing

  • Author

    Uemura, Wataru ; Ueno, Atsushi ; Tatsumi, Shoji

  • Author_Institution
    Dept. of Electron. & Informatics, Ryukoku Univ., Ohtsu
  • fYear
    2006
  • Firstpage
    2463
  • Lastpage
    2466
  • Abstract
    A profit sharing method that is one of the reinforcement learning methods distributes the reward to Q-values of rules. A Q-value of a profit sharing method that is used at the action selection has the received value of its rule. In this paper, we discuss an initial value of Q-value and propose the setting method for the initial value of Q-value. If the initial value is too large than the distribution value, the action selection becomes always random selection. If the initial value is too small, the action selection outputs only one action that learned at a beginner. For resolving these problems, we must set the non-problem value at each state. So we propose the Q-value setting method for the initial value of Q-value at each state. The experiment shows that this method is better than the conventional method
  • Keywords
    learning (artificial intelligence); Q-value setting method; initial value setting method; profit sharing method; random selection; reinforcement learning methods; Informatics; Learning systems; Robots; Exploration and Exploitation; Profit Sharing; Reinforecement Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE-ICASE, 2006. International Joint Conference
  • Conference_Location
    Busan
  • Print_ISBN
    89-950038-4-7
  • Electronic_ISBN
    89-950038-5-5
  • Type

    conf

  • DOI
    10.1109/SICE.2006.315143
  • Filename
    4108055