Multi-objective reinforcement learning algorithm for MOSDMP in unknown environment

Author

Zhao, Yun ; Chen, Qingwei ; Hu, Weili

Author_Institution

Dept. of Autom., Univ. of Sci. & Technol., Nanjing, China

fYear

2010

Firstpage

3190

Lastpage

3194

Abstract

In this paper, a new multi-objective reinforcement learning algorithm for multi-objective sequential decision making problems in unknown environment is proposed. The salient characters of the algorithm are: (1) decision maker´s objective preference is introduced to guide learning direction; (2) a new measure of comparing action decisions under several objectives based on the fuzzy inference system is defined; (3) fast learning speed can be achieved. Simulation results demonstrate that the proposed algorithm has a good learning performance.

Keywords

decision making; fuzzy reasoning; learning (artificial intelligence); MOSDMP; fuzzy inference system; learning speed; multiobjective reinforcement learning algorithm; multiobjective sequential decision making problem; Algorithm design and analysis; Delta modulation; Inference algorithms; Learning; Markov processes; Optimization; Silicon; Action decision; Fuzzy inference system; Markov decision processes (MDP); Reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Control and Automation (WCICA), 2010 8th World Congress on

Conference_Location

Jinan

Print_ISBN

978-1-4244-6712-9

Type

conf

DOI

10.1109/WCICA.2010.5553980

Filename

5553980

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1675461