DocumentCode
1675461
Title
Multi-objective reinforcement learning algorithm for MOSDMP in unknown environment
Author
Zhao, Yun ; Chen, Qingwei ; Hu, Weili
Author_Institution
Dept. of Autom., Univ. of Sci. & Technol., Nanjing, China
fYear
2010
Firstpage
3190
Lastpage
3194
Abstract
In this paper, a new multi-objective reinforcement learning algorithm for multi-objective sequential decision making problems in unknown environment is proposed. The salient characters of the algorithm are: (1) decision maker´s objective preference is introduced to guide learning direction; (2) a new measure of comparing action decisions under several objectives based on the fuzzy inference system is defined; (3) fast learning speed can be achieved. Simulation results demonstrate that the proposed algorithm has a good learning performance.
Keywords
decision making; fuzzy reasoning; learning (artificial intelligence); MOSDMP; fuzzy inference system; learning speed; multiobjective reinforcement learning algorithm; multiobjective sequential decision making problem; Algorithm design and analysis; Delta modulation; Inference algorithms; Learning; Markov processes; Optimization; Silicon; Action decision; Fuzzy inference system; Markov decision processes (MDP); Reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Control and Automation (WCICA), 2010 8th World Congress on
Conference_Location
Jinan
Print_ISBN
978-1-4244-6712-9
Type
conf
DOI
10.1109/WCICA.2010.5553980
Filename
5553980
Link To Document