DocumentCode
2226315
Title
Handling different level of unstable reward environment through an estimation of reward distribution in XCS
Author
Tatsumi, Takato ; Komine, Takahiro ; Sato, Hiroyuki ; Takadama, Keiki
Author_Institution
The University of Electro-Communications 1-5-1, Chofugaoka, Chofu-shi Tokyo, Japan
fYear
2015
fDate
25-28 May 2015
Firstpage
2973
Lastpage
2980
Abstract
XCS is an accuracy-based learning classifier system (LCS) which is powered by a reinforcement algorithm. We expect it will have when the reward for a state / action pair is unstable, because it is not possible to correctly estimate the evaluation. This paper focuses on learning in a different level of an unstable reward environment and proposes XCS-URE (XCS for Unstable Reward Environment) by improving XCS for such an environment. For this purpose, XCS-URE estimates the reward distribution of the classifier (i.e., if-then rule) by using the standard deviation of the acquired reward, and adjusts the accuracy of the classifier depending on the reward distribution. In order to investigate the effectiveness of XCS-URE, this paper applies XCS and XCS-URE into the multiple unstable reward environments which have a different level of the unstable rewards added by Gaussian noise. The experiments on the modified multiplexer problems have the following implications: (1) in the environment same Gaussian noise is added, XCS cannot performs properly due to the low accuracy of the classifier in the noisy environments, while XCS-URE can perform properly by acquiring the appropriate classifiers even in such an environment; (2) in the same environment, XCS-URE can reduce the population size without decreasing the correct rate as compared to XCS; and (3) even in the environment different Gaussian noises depending on the situation are added, XCS-URE can reduce the population size without decreasing the correct rate by adjusting the accuracy of the classifier depending on the reward distribution.
Keywords
Accuracy; Integrated optics; Accuracy criterion; Generalization; Learning Classifier System; Standard deviation; XCS;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2015 IEEE Congress on
Conference_Location
Sendai, Japan
Type
conf
DOI
10.1109/CEC.2015.7257259
Filename
7257259
Link To Document