DocumentCode
830949
Title
Adaptive control of Markov chains, I: Finite parameter set
Author
Borkar, V. ; Varaiya, Pravin
Author_Institution
University of California, Berkeley, CA, USA
Volume
24
Issue
6
fYear
1979
fDate
12/1/1979 12:00:00 AM
Firstpage
953
Lastpage
957
Abstract
Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter α taking values in finite set
. To each α is associated a prespecified stationary control law
. The adaptive control law selects at each time
the control action indicated by
where αt is the maximum likelihood estimate of α. It is shown that αt converges to a parameter α*such that the "closed-loop" transition probabilities corresponding to α*and
are the same as those corresponding to α0and
where α0is the true parameter. The situation when α0does not belong to the model set
is briefly discussed.
. To each α is associated a prespecified stationary control law
. The adaptive control law selects at each time
the control action indicated by
where α
are the same as those corresponding to α0and
where α0is the true parameter. The situation when α0does not belong to the model set
is briefly discussed.Keywords
Adaptive control; Markov processes; Adaptive control; Adaptive systems; Conferences; Control theory; Cost function; Learning systems; Maximum likelihood estimation; Pattern recognition; Random variables; Stochastic systems;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/TAC.1979.1102191
Filename
1102191
Link To Document