مرکز منطقه ای اطلاع رساني علوم و فناوري - Hierarchical policy iteration for large-scale POMDP systems

DocumentCode :

2248040

Title :

Hierarchical policy iteration for large-scale POMDP systems

Author :

Jiang, Xiaofeng ; Ji, Zhe ; Xi, Hongsheng

Author_Institution :

Department of Automation, University of Science and Technology of China, Hefei 230027, P.R. China

fYear :

2015

fDate :

28-30 July 2015

Firstpage :

2401

Lastpage :

2406

Abstract :

In this paper, the complex partially observable Markov decision process (POMDP) systems with discrete state and action spaces are studied from a large-scale system point of view. By introducing the hierarchical control methods, the complex high-dimensional POMDP system can be decomposed into some low-dimensional subsystems without the related constraints. The optimization problem of each subsystem can be solved independently with the simulation-based policy iteration algorithm on the basis of sensitivity analysis. The computational overhead for optimizing the entire system can be significantly reduced. This algorithm does not need any overly strict assumption and can be applied to most of the practical problems. One numerical example is provided to illustrate the applicability of the algorithm.

Keywords :

Algorithm design and analysis; Approximation algorithms; Large-scale systems; Markov processes; Mobile communication; Optimization; Tin; Partially observable Markov decision process; complex system; hierarchical policy; sensitivity analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control Conference (CCC), 2015 34th Chinese

Conference_Location :

Hangzhou, China

Type :

conf

DOI :

10.1109/ChiCC.2015.7260009

Filename :

7260009

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2248040