DocumentCode :
2248040
Title :
Hierarchical policy iteration for large-scale POMDP systems
Author :
Jiang, Xiaofeng ; Ji, Zhe ; Xi, Hongsheng
Author_Institution :
Department of Automation, University of Science and Technology of China, Hefei 230027, P.R. China
fYear :
2015
fDate :
28-30 July 2015
Firstpage :
2401
Lastpage :
2406
Abstract :
In this paper, the complex partially observable Markov decision process (POMDP) systems with discrete state and action spaces are studied from a large-scale system point of view. By introducing the hierarchical control methods, the complex high-dimensional POMDP system can be decomposed into some low-dimensional subsystems without the related constraints. The optimization problem of each subsystem can be solved independently with the simulation-based policy iteration algorithm on the basis of sensitivity analysis. The computational overhead for optimizing the entire system can be significantly reduced. This algorithm does not need any overly strict assumption and can be applied to most of the practical problems. One numerical example is provided to illustrate the applicability of the algorithm.
Keywords :
Algorithm design and analysis; Approximation algorithms; Large-scale systems; Markov processes; Mobile communication; Optimization; Tin; Partially observable Markov decision process; complex system; hierarchical policy; sensitivity analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2015 34th Chinese
Conference_Location :
Hangzhou, China
Type :
conf
DOI :
10.1109/ChiCC.2015.7260009
Filename :
7260009
Link To Document :
بازگشت