Title :
Theory for automatic learning under partially observed Markov-dependent noise
Author :
Yakowtiz, S. ; Jayawardena, Thusitha ; Li, Shu
Author_Institution :
Dept. of Syst. & Ind. Eng., Arizona Univ., Tucson, AZ, USA
fDate :
9/1/1992 12:00:00 AM
Abstract :
A vigorous branch of automatic learning is directed at the task of locating a global minimum of an unknown multimodal function f(θ) on the basis of noisy observations L(θ(i))=f(θ(i))+W (θ(i)) taken at sequentially-chosen control points {θ(i)}. In all preceding convergence deviations known to the authors, the noise is postulated to depend on the past only through control selection. Here they allow the observation noise sequence to be stochastically dependent, in particular, a function of an unknown underlying Markov decision process, the observations being the stagewise losses. In a sense, in order to be made precise, the algorithm offered is shown to attain asymptotically optimal performance, and rates are assured. A motivating example from queueing theory is offered, and connections with classical problems of Markov control theory and other disciplines are mentioned
Keywords :
Markov processes; decision theory; learning (artificial intelligence); noise; Markov control theory; automatic learning; global minimum; noisy observations; partially observed Markov-dependent noise; queueing theory; stagewise losses; unknown multimodal function; unknown underlying Markov decision process; Automatic control; Control theory; Convergence; Kernel; Minimization methods; Queueing analysis; Random variables; State-space methods; Stochastic processes; Stochastic resonance;
Journal_Title :
Automatic Control, IEEE Transactions on