Influence and variance of a Markov chain: application to adaptive discretization in optimal control

Author

Munos, Remi ; Moore, Andrew

Author_Institution

Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume

2

fYear

1999

fDate

6/21/1905 12:00:00 AM

Firstpage

1464

Abstract

This paper addresses the difficult problem of deciding where to refine the resolution of adaptive discretizations for solving continuous time-and-space deterministic optimal control problems. We introduce two measures, influence and variance of a Markov chain. Influence measures the extent to which changes of some state affect the value function at other states. Variance measures the heterogeneity of the future cumulated active rewards (whose mean is the value function). We combine these two measures to derive a nonlocal efficient splitting criterion that takes into account the impact of a state on other states when deciding whether to split. We illustrate this method on the non-linear, two dimensional “Car on the Hill” and the 4d “space-shuttle” and “airplane-meeting” control problems

Keywords

Markov processes; discrete time systems; linear systems; optimal control; 4D space-shuttle problem; Markov chain; adaptive discretization; airplane-meeting control problem; continuous time-and-space deterministic optimal control problems; heterogeneity; influence; nonlinear 2D Car on the Hill problem; nonlocal efficient splitting criterion; optimal control; variance; Adaptive control; Optimal control; Programmable control; State-space methods; Web pages;

fLanguage

English

Publisher

ieee

Conference_Titel

Decision and Control, 1999. Proceedings of the 38th IEEE Conference on

Conference_Location

Phoenix, AZ

ISSN

0191-2216

Print_ISBN

0-7803-5250-5

Type

conf

DOI

10.1109/CDC.1999.830188

Filename

830188