مرکز منطقه ای اطلاع رساني علوم و فناوري - Feature-based methods for large scale dynamic programming

DocumentCode :

294458

Title :

Feature-based methods for large scale dynamic programming

Author :

Tsitsiklis, John N. ; Van Roy, Benjamin

Author_Institution :

Lab. for Inf. & Decision Syst., MIT, Cambridge, MA, USA

Volume :

fYear :

1995

fDate :

13-15 Dec 1995

Firstpage :

565

Abstract :

Summary form only given. We develop a methodological framework and present a few different ways in which dynamic programming and compact representations can be combined to solve large scale stochastic control problems. In particular, we develop algorithms that employ two types of feature-based compact representations; that is, representations that involve feature extraction and a relatively simple approximation architecture. We prove the convergence of these algorithms and provide bounds on the approximation error. As an example, one of these algorithms is used to generate a strategy for the game of Tetris. Furthermore, we provide a counter-example illustrating the difficulties of integrating compact representations with dynamic programming, which exemplifies the shortcomings of certain simple approaches

Keywords :

Markov processes; approximation theory; convergence of numerical methods; decision theory; dynamic programming; iterative methods; stochastic systems; Markov decision problem; Tetris game; approximation architecture; approximation error; compact representations; convergence; cost-to-go function; feature extraction; feature-based methods; iteration algorithm; large scale dynamic programming; stochastic control; Approximation algorithms; Artificial intelligence; Control systems; Cost function; Dynamic programming; Large-scale systems; Nonlinear control systems; State-space methods; Stochastic processes; Stochastic systems;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 1995., Proceedings of the 34th IEEE Conference on

Conference_Location :

New Orleans, LA

ISSN :

0191-2216

Print_ISBN :

0-7803-2685-7

Type :

conf

DOI :

10.1109/CDC.1995.478954

Filename :

478954

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=294458