DocumentCode
3030759
Title
Adaptive control of Markov chains, I: Finite parameter set
Author
Borkar, Vivek ; Varaiya, P.
Author_Institution
University of California, Berkeley, California
Volume
2
fYear
1979
fDate
12-14 Dec. 1979
Firstpage
744
Lastpage
749
Abstract
Consider a controlled Markov chain whose transition probabilities depend upon an unknovn parameter ?? taking values in finite set A. To each a is associated a prespecified stationary control law ??(??). The adaptive control lay selects at each time t the control action indicated by ??(??t) where ??t is the maximum likelihood estimate of ??. It is shown that ??t converges to a parameter ??* such that the ´closed loop transition probabilities corresponding to ??* and ??(??*) are the same as those corresponding to ??0 and ??(??*) where ??0 is the true parameter. The situation vhen ??0 does not belong to the model set A is briefly discussed.
Keywords
Adaptive control; Laboratories; Programmable control;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control including the Symposium on Adaptive Processes, 1979 18th IEEE Conference on
Conference_Location
Fort Lauderdale, FL, USA
Type
conf
DOI
10.1109/CDC.1979.270287
Filename
4046515
Link To Document