• DocumentCode
    294956
  • Title

    Efficient estimation and control for Markov processes

  • Author

    Burnetas, Apostolos N. ; Katehakis, Michael N.

  • Author_Institution
    Dept. of Oper. Res., Case Western Reserve Univ., Cleveland, OH, USA
  • Volume
    2
  • fYear
    1995
  • fDate
    13-15 Dec 1995
  • Firstpage
    1402
  • Abstract
    Considers the problem of sequential control for a finite state and action Markovian decision process with incomplete information regarding the transition probabilities P∈P˜. Under suitable irreducibility assumptions for P˜, the authors construct adaptive policies that maximize the rate of convergence of realized rewards to that of the optimal (non adaptive) policy under complete information. These adaptive policies are specified via an easily computable index function, of states, controls and statistics, so that one takes a control with the largest index value in the current state in every period
  • Keywords
    Markov processes; convergence; decision theory; discrete time systems; probability; queueing theory; action Markovian decision process; adaptive policies; finite state; incomplete information; irreducibility assumptions; rate of convergence; sequential control; transition probabilities; Adaptive control; History; Markov processes; Operations research; Optimal control; Process control; Programmable control; Random variables; State estimation; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
  • Conference_Location
    New Orleans, LA
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-2685-7
  • Type

    conf

  • DOI
    10.1109/CDC.1995.480297
  • Filename
    480297