Title :
Structure search of probabilistic models and data correction for EDA-RL
Author_Institution :
Grad. Sch. of Natural Sci. & Technol., Okayama Univ., Okayama, Japan
Abstract :
We have proposed a novel Estimation of Distribution Algorithm for solving reinforcement learning problems: EDA-RL. The EDA-RL can perform well if the complexity of the structure of the probabilistic model is adapted to the difficulty of given problems. Therefore, this paper proposes a structure search method of the probabilistic model in the EDA-RL as in conventional EDA taking account multivariate dependencies. Moreover, a data correction method by eliminating loops of state transitions is also proposed. Computational simulations on maze problems, which have several perceptual aliasing states, show the effectiveness of the proposed method.
Keywords :
learning (artificial intelligence); probability; search problems; EDA-RL; data correction method; estimation of distribution algorithm; probabilistic model; reinforcement learning problems; structure search method; Adaptation models; Computational modeling; Estimation; Learning; Markov processes; Mathematical model; Probabilistic logic;
Conference_Titel :
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9887-1
DOI :
10.1109/ADPRL.2011.5967388