DocumentCode
294956
Title
Efficient estimation and control for Markov processes
Author
Burnetas, Apostolos N. ; Katehakis, Michael N.
Author_Institution
Dept. of Oper. Res., Case Western Reserve Univ., Cleveland, OH, USA
Volume
2
fYear
1995
fDate
13-15 Dec 1995
Firstpage
1402
Abstract
Considers the problem of sequential control for a finite state and action Markovian decision process with incomplete information regarding the transition probabilities P∈P˜. Under suitable irreducibility assumptions for P˜, the authors construct adaptive policies that maximize the rate of convergence of realized rewards to that of the optimal (non adaptive) policy under complete information. These adaptive policies are specified via an easily computable index function, of states, controls and statistics, so that one takes a control with the largest index value in the current state in every period
Keywords
Markov processes; convergence; decision theory; discrete time systems; probability; queueing theory; action Markovian decision process; adaptive policies; finite state; incomplete information; irreducibility assumptions; rate of convergence; sequential control; transition probabilities; Adaptive control; History; Markov processes; Operations research; Optimal control; Process control; Programmable control; Random variables; State estimation; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
Conference_Location
New Orleans, LA
ISSN
0191-2216
Print_ISBN
0-7803-2685-7
Type
conf
DOI
10.1109/CDC.1995.480297
Filename
480297
Link To Document