مرکز منطقه ای اطلاع رساني علوم و فناوري - Adaptive control of Markov chains, I: Finite parameter set

DocumentCode :

830949

Title :

Adaptive control of Markov chains, I: Finite parameter set

Author :

Borkar, V. ; Varaiya, Pravin

Author_Institution :

University of California, Berkeley, CA, USA

Volume :

Issue :

fYear :

1979

fDate :

12/1/1979 12:00:00 AM

Firstpage :

953

Lastpage :

957

Abstract :

Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter α taking values in finite set $A$ . To each α is associated a prespecified stationary control law $\\phi(\\alpha )$ . The adaptive control law selects at each time $t$ the control action indicated by $\\phi(\\alpha _{t})$ where αtis the maximum likelihood estimate of α. It is shown that αtconverges to a parameter α^*such that the "closed-loop" transition probabilities corresponding to α^*and $\\phi(\\alpha ^{\\ast })$ are the same as those corresponding to α⁰and $\\phi(\\alpha )$ where α⁰is the true parameter. The situation when α⁰does not belong to the model set $A$ is briefly discussed.

Keywords :

Adaptive control; Markov processes; Adaptive control; Adaptive systems; Conferences; Control theory; Cost function; Learning systems; Maximum likelihood estimation; Pattern recognition; Random variables; Stochastic systems;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/TAC.1979.1102191

Filename :

1102191

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=830949