DocumentCode :
2843058
Title :
Solving Multi-objective Reinforcement Learning Problems by EDA-RL - Acquisition of Various Strategies
Author :
Handa, Hisashi
Author_Institution :
Grad. Sch. of Natural Sci. & Technol., Okayama Univ., Okayama, Japan
fYear :
2009
fDate :
Nov. 30 2009-Dec. 2 2009
Firstpage :
426
Lastpage :
431
Abstract :
EDA-RL, estimation of distribution algorithms for reinforcement learning problems, have been proposed by us recently. The EDA-RL can improve policies by EDA scheme: First, select better episodes. Secondly, estimate probabilistic models, i.e., policies, and finally, interact with the environment for generating new episodes. In this paper, the EDA-RL is extended for multi-objective reinforcement learning problems, where reward is given by several criteria. By incorporating the notions in evolutionary multi-objective optimization, the proposed method is enable to acquire various strategies by a single run.
Keywords :
evolutionary computation; learning (artificial intelligence); optimisation; probability; distribution algorithms; evolutionary multi-objective optimization; multiobjective reinforcement learning problems; probabilistic models; Design optimization; Electronic design automation and methodology; Evolutionary computation; Intelligent systems; Learning; Markov random fields; Optimization methods; Probability distribution; Safety; State estimation; Estimation of Distribution Algorithms; Evolutionary Multi-Objective Optimisation; Reinforcement Learning Problems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-1-4244-4735-0
Electronic_ISBN :
978-0-7695-3872-3
Type :
conf
DOI :
10.1109/ISDA.2009.92
Filename :
5364903
Link To Document :
بازگشت