Title :
Observation-based performance sensitivity analysis for POMDPs
Author :
Ji, Zhe ; Jiang, Xiaofeng ; Xi, Hongsheng
Author_Institution :
Department of Auto, School of Information Science and Technology, University of Science and Technology of China, 230027, Hefei, China
Abstract :
In this paper, the performance sensitivity analysis for Markov decision processes (MDPs) are generalized to study the partially observable Markov decision processes (POMDPs). The performance derivative formula and the performance difference formula based on observation are derived in this paper. The derivation does not need any overly strict assumptions. In order to find the optimal policy based on observation, an observation-based policy iteration algorithm is designed. An example is presented to show the applicability of the algorithm finally.
Keywords :
Algorithm design and analysis; Approximation algorithms; Markov processes; Mathematical model; Optimization; Sensitivity analysis; Steady-state; POMDPs; Performance derivative formula; Performance difference formula; Performance sensitivity analysis;
Conference_Titel :
Control Conference (CCC), 2015 34th Chinese
Conference_Location :
Hangzhou, China
DOI :
10.1109/ChiCC.2015.7259887