DocumentCode :
2498772
Title :
Structure search of probabilistic models and data correction for EDA-RL
Author :
Handa, Hisashi
Author_Institution :
Grad. Sch. of Natural Sci. & Technol., Okayama Univ., Okayama, Japan
fYear :
2011
fDate :
11-15 April 2011
Firstpage :
332
Lastpage :
337
Abstract :
We have proposed a novel Estimation of Distribution Algorithm for solving reinforcement learning problems: EDA-RL. The EDA-RL can perform well if the complexity of the structure of the probabilistic model is adapted to the difficulty of given problems. Therefore, this paper proposes a structure search method of the probabilistic model in the EDA-RL as in conventional EDA taking account multivariate dependencies. Moreover, a data correction method by eliminating loops of state transitions is also proposed. Computational simulations on maze problems, which have several perceptual aliasing states, show the effectiveness of the proposed method.
Keywords :
learning (artificial intelligence); probability; search problems; EDA-RL; data correction method; estimation of distribution algorithm; probabilistic model; reinforcement learning problems; structure search method; Adaptation models; Computational modeling; Estimation; Learning; Markov processes; Mathematical model; Probabilistic logic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9887-1
Type :
conf
DOI :
10.1109/ADPRL.2011.5967388
Filename :
5967388
Link To Document :
بازگشت