DocumentCode :
2972090
Title :
Adaptive Markov and semi-Markov decision processes: a weak contrast function approach
Author :
Milito, Rodolfo A. ; Cruz, J.B., Jr.
Author_Institution :
AT&T Bell Lab., Holmdel, NJ, USA
fYear :
1988
fDate :
7-9 Dec 1988
Firstpage :
646
Abstract :
The authors consider the control of dynamic systems modeled as a family of either Markov or semi-Markov processes, parameterized by an unknown parameter α which takes values in a given finite set A . The true parameter α0, which represents the real system, belongs to A. The problem is to devise a control strategy that despite the initial ignorance of α0 can still achieve the minimum long-run average cost. The authors first introduce a class of adaptive controllers for Markov chains that meet the challenge, under fairly relaxed conditions. These algorithms are based on the concept of weak contrast functions, an extension of P.L. Mandi´s (1974) contrast functions. A class of adaptive controllers for semi-Markov processes is formulated based on the weak contrast function concept. The authors show that these controllers achieve the minimum long-run average cost and illustrate their application to the adaptive multilayer control of Markov chains
Keywords :
Markov processes; adaptive control; decision theory; time-varying systems; Markov chains; adaptive Markov decision processes; adaptive controllers; dynamic systems; multilayer control; semi-Markov decision processes; time varying systems; weak contrast function; Adaptive control; Communication system traffic control; Control system synthesis; Control systems; Costs; Loss measurement; Programmable control; Random variables; Telecommunication computing; Telecommunication control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 1988., Proceedings of the 27th IEEE Conference on
Conference_Location :
Austin, TX
Type :
conf
DOI :
10.1109/CDC.1988.194390
Filename :
194390
Link To Document :
بازگشت