DocumentCode :
831796
Title :
Theory for automatic learning under partially observed Markov-dependent noise
Author :
Yakowtiz, S. ; Jayawardena, Thusitha ; Li, Shu
Author_Institution :
Dept. of Syst. & Ind. Eng., Arizona Univ., Tucson, AZ, USA
Volume :
37
Issue :
9
fYear :
1992
fDate :
9/1/1992 12:00:00 AM
Firstpage :
1316
Lastpage :
1324
Abstract :
A vigorous branch of automatic learning is directed at the task of locating a global minimum of an unknown multimodal function f(θ) on the basis of noisy observations L(θ(i))=f(θ(i))+W (θ(i)) taken at sequentially-chosen control points {θ(i)}. In all preceding convergence deviations known to the authors, the noise is postulated to depend on the past only through control selection. Here they allow the observation noise sequence to be stochastically dependent, in particular, a function of an unknown underlying Markov decision process, the observations being the stagewise losses. In a sense, in order to be made precise, the algorithm offered is shown to attain asymptotically optimal performance, and rates are assured. A motivating example from queueing theory is offered, and connections with classical problems of Markov control theory and other disciplines are mentioned
Keywords :
Markov processes; decision theory; learning (artificial intelligence); noise; Markov control theory; automatic learning; global minimum; noisy observations; partially observed Markov-dependent noise; queueing theory; stagewise losses; unknown multimodal function; unknown underlying Markov decision process; Automatic control; Control theory; Convergence; Kernel; Minimization methods; Queueing analysis; Random variables; State-space methods; Stochastic processes; Stochastic resonance;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/9.159569
Filename :
159569
Link To Document :
بازگشت