مرکز منطقه ای اطلاع رساني علوم و فناوري - The <formula formulatype="inline"> <img src="/images/tex/388.gif" alt="n"> </formula>th-Order Bias Optimality for Multichain Markov Decision Processes

DocumentCode :

1109397

Title :

The $n$ th-Order Bias Optimality for Multichain Markov Decision Processes

Author :

Cao, Xi-Ren ; Zhang, Junyu

Author_Institution :

Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong

Volume :

Issue :

fYear :

2008

fDate :

3/1/2008 12:00:00 AM

Firstpage :

496

Lastpage :

508

Abstract :

In this paper, we propose a new approach to the theory of finite multichain Markov decision processes (MDPs) with different performance optimization criteria. We first propose the concept of nth-order bias; then, using the average reward and bias difference formulas derived in this paper, we develop an optimization theory for finite MDPs that covers a complete spectrum from average optimality, bias optimality, to all high-order bias optimality, in a unified way. The approach is simple, direct, natural, and intuitive; it depends neither on Laurent series expansion nor on discounted MDPs. We also propose one-phase policy iteration algorithms for bias and high-order bias optimal policies, which are more efficient than the two-phase algorithms in the literature. Furthermore, we derive high-order bias optimality equations. This research is a part of our effort in developing sensitivity-based learning and optimization theory.

Keywords :

Markov processes; decision theory; iterative methods; optimisation; series (mathematics); Laurent series expansion; average reward-bias difference formula; finite multichain Markov decision process; nth-order bias optimality; one-phase policy iteration algorithm; optimization criteria; Computer vision; Convergence; Equations; Linear programming; Optimization; Sun; User-generated content; $n$ th-bias optimality; $n$ th potentials; Average optimality; Markov decision processes (MDPs); bias optimality; discrete-event systems; policy iteration;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/TAC.2007.915168

Filename :

4475714

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1109397