DocumentCode :
1434389
Title :
Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation
Author :
Sadegh, Payman ; Spall, James C.
Author_Institution :
Inst. of Math. Modeling, Tech. Univ., Lyngby, Denmark
Volume :
43
Issue :
10
fYear :
1998
fDate :
10/1/1998 12:00:00 AM
Firstpage :
1480
Lastpage :
1484
Abstract :
The simultaneous perturbation stochastic approximation (SPSA) algorithm has attracted considerable attention for challenging optimization problems where it is difficult or impossible to obtain a direct gradient of the objective (say, loss) function. The approach is based on a highly efficient simultaneous perturbation approximation to the gradient based on loss function measurements. SPSA is based on picking a simultaneous perturbation (random) vector in a Monte Carlo fashion as part of generating the approximation to the gradient. This paper derives the optimal distribution for the Monte Carlo process. The objective is to minimize the mean square error of the estimate. The authors also consider maximization of the likelihood that the estimate be confined within a bounded symmetric region of the true parameter. The optimal distribution for the components of the simultaneous perturbation vector is found to be a symmetric Bernoulli in both cases. The authors end the paper with a numerical study related to the area of experiment design
Keywords :
Monte Carlo methods; approximation theory; design of experiments; optimisation; probability; Monte Carlo process; loss function measurements; mean square error; optimal distribution; optimal random perturbations; simultaneous perturbation gradient approximation; stochastic approximation; symmetric Bernoulli distribution; Automatic control; Constraint optimization; Control systems; Discrete time systems; Linear matrix inequalities; Mathematics; Optimal control; Riccati equations; State feedback; Stochastic processes;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/9.720513
Filename :
720513
Link To Document :
بازگشت