• DocumentCode
    3030759
  • Title

    Adaptive control of Markov chains, I: Finite parameter set

  • Author

    Borkar, Vivek ; Varaiya, P.

  • Author_Institution
    University of California, Berkeley, California
  • Volume
    2
  • fYear
    1979
  • fDate
    12-14 Dec. 1979
  • Firstpage
    744
  • Lastpage
    749
  • Abstract
    Consider a controlled Markov chain whose transition probabilities depend upon an unknovn parameter ?? taking values in finite set A. To each a is associated a prespecified stationary control law ??(??). The adaptive control lay selects at each time t the control action indicated by ??(??t) where ??t is the maximum likelihood estimate of ??. It is shown that ??t converges to a parameter ??* such that the ´closed loop transition probabilities corresponding to ??* and ??(??*) are the same as those corresponding to ??0 and ??(??*) where ??0 is the true parameter. The situation vhen ??0 does not belong to the model set A is briefly discussed.
  • Keywords
    Adaptive control; Laboratories; Programmable control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control including the Symposium on Adaptive Processes, 1979 18th IEEE Conference on
  • Conference_Location
    Fort Lauderdale, FL, USA
  • Type

    conf

  • DOI
    10.1109/CDC.1979.270287
  • Filename
    4046515