• DocumentCode
    2226315
  • Title

    Handling different level of unstable reward environment through an estimation of reward distribution in XCS

  • Author

    Tatsumi, Takato ; Komine, Takahiro ; Sato, Hiroyuki ; Takadama, Keiki

  • Author_Institution
    The University of Electro-Communications 1-5-1, Chofugaoka, Chofu-shi Tokyo, Japan
  • fYear
    2015
  • fDate
    25-28 May 2015
  • Firstpage
    2973
  • Lastpage
    2980
  • Abstract
    XCS is an accuracy-based learning classifier system (LCS) which is powered by a reinforcement algorithm. We expect it will have when the reward for a state / action pair is unstable, because it is not possible to correctly estimate the evaluation. This paper focuses on learning in a different level of an unstable reward environment and proposes XCS-URE (XCS for Unstable Reward Environment) by improving XCS for such an environment. For this purpose, XCS-URE estimates the reward distribution of the classifier (i.e., if-then rule) by using the standard deviation of the acquired reward, and adjusts the accuracy of the classifier depending on the reward distribution. In order to investigate the effectiveness of XCS-URE, this paper applies XCS and XCS-URE into the multiple unstable reward environments which have a different level of the unstable rewards added by Gaussian noise. The experiments on the modified multiplexer problems have the following implications: (1) in the environment same Gaussian noise is added, XCS cannot performs properly due to the low accuracy of the classifier in the noisy environments, while XCS-URE can perform properly by acquiring the appropriate classifiers even in such an environment; (2) in the same environment, XCS-URE can reduce the population size without decreasing the correct rate as compared to XCS; and (3) even in the environment different Gaussian noises depending on the situation are added, XCS-URE can reduce the population size without decreasing the correct rate by adjusting the accuracy of the classifier depending on the reward distribution.
  • Keywords
    Accuracy; Integrated optics; Accuracy criterion; Generalization; Learning Classifier System; Standard deviation; XCS;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2015 IEEE Congress on
  • Conference_Location
    Sendai, Japan
  • Type

    conf

  • DOI
    10.1109/CEC.2015.7257259
  • Filename
    7257259