Title :
Efficient estimation and control for Markov processes
Author :
Burnetas, Apostolos N. ; Katehakis, Michael N.
Author_Institution :
Dept. of Oper. Res., Case Western Reserve Univ., Cleveland, OH, USA
Abstract :
Considers the problem of sequential control for a finite state and action Markovian decision process with incomplete information regarding the transition probabilities P∈P˜. Under suitable irreducibility assumptions for P˜, the authors construct adaptive policies that maximize the rate of convergence of realized rewards to that of the optimal (non adaptive) policy under complete information. These adaptive policies are specified via an easily computable index function, of states, controls and statistics, so that one takes a control with the largest index value in the current state in every period
Keywords :
Markov processes; convergence; decision theory; discrete time systems; probability; queueing theory; action Markovian decision process; adaptive policies; finite state; incomplete information; irreducibility assumptions; rate of convergence; sequential control; transition probabilities; Adaptive control; History; Markov processes; Operations research; Optimal control; Process control; Programmable control; Random variables; State estimation; Statistics;
Conference_Titel :
Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
Conference_Location :
New Orleans, LA
Print_ISBN :
0-7803-2685-7
DOI :
10.1109/CDC.1995.480297