• DocumentCode
    2843058
  • Title

    Solving Multi-objective Reinforcement Learning Problems by EDA-RL - Acquisition of Various Strategies

  • Author

    Handa, Hisashi

  • Author_Institution
    Grad. Sch. of Natural Sci. & Technol., Okayama Univ., Okayama, Japan
  • fYear
    2009
  • fDate
    Nov. 30 2009-Dec. 2 2009
  • Firstpage
    426
  • Lastpage
    431
  • Abstract
    EDA-RL, estimation of distribution algorithms for reinforcement learning problems, have been proposed by us recently. The EDA-RL can improve policies by EDA scheme: First, select better episodes. Secondly, estimate probabilistic models, i.e., policies, and finally, interact with the environment for generating new episodes. In this paper, the EDA-RL is extended for multi-objective reinforcement learning problems, where reward is given by several criteria. By incorporating the notions in evolutionary multi-objective optimization, the proposed method is enable to acquire various strategies by a single run.
  • Keywords
    evolutionary computation; learning (artificial intelligence); optimisation; probability; distribution algorithms; evolutionary multi-objective optimization; multiobjective reinforcement learning problems; probabilistic models; Design optimization; Electronic design automation and methodology; Evolutionary computation; Intelligent systems; Learning; Markov random fields; Optimization methods; Probability distribution; Safety; State estimation; Estimation of Distribution Algorithms; Evolutionary Multi-Objective Optimisation; Reinforcement Learning Problems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
  • Conference_Location
    Pisa
  • Print_ISBN
    978-1-4244-4735-0
  • Electronic_ISBN
    978-0-7695-3872-3
  • Type

    conf

  • DOI
    10.1109/ISDA.2009.92
  • Filename
    5364903