Title :
Matrix conditioning and adaptive simultaneous perturbation stochastic approximation method
Author_Institution :
Appl. Phys. Lab., Johns Hopkins Univ., Laurel, MD, USA
Abstract :
The paper proposes a modification to the simultaneous perturbation stochastic approximation (SPSA) methods based on comparisons made between the first order and the second order SPSA (1SPSA and 2SPSA) algorithms from the perspective of loss function Hessian. At finite iterations, the convergence rate depends on matrix conditioning of the loss function Hessian. It is shown that 2SPSA converges more slowly for a loss function with an M-conditioned Hessian than the one with a well-conditioned Hessian. On the other hand, the convergence rate of 1SPSA is less sensitive to the matrix conditioning of loss function Hessians. The modified 2SPSA (M2SPSA) eliminates the error amplification caused by the inversion of an ill-conditioned Hessian at finite iterations which leads to significant improvements in its convergence rate in problems with an ill-conditioned Hessian matrix. Asymptotically, the efficiency analysis shows that M2SPSA is also superior to 2SPSA in terms of its convergence rate coefficients. It is shown that for the same asymptotic convergence rate, the ratio of the mean square errors for M2SPSA to 2SPSA is always less than one except for a perfectly conditioned Hessian
Keywords :
Hessian matrices; iterative methods; matrix inversion; perturbation techniques; stochastic processes; 1SPSA; 2SPSA; M2SPSA; SPSA methods; adaptive simultaneous perturbation stochastic approximation method; asymptotic convergence rate; convergence rate; convergence rate coefficients; efficiency analysis; error amplification; finite iterations; first order algorithms; ill-conditioned Hessian; loss function Hessian; matrix conditioning; mean square errors; modified 2SPSA; perfectly conditioned Hessian; second order algorithms; well-conditioned Hessian; Approximation algorithms; Approximation methods; Convergence; Laboratories; Loss measurement; Mean square error methods; Optimization methods; Parameter estimation; Physics; Stochastic processes;
Conference_Titel :
American Control Conference, 2001. Proceedings of the 2001
Conference_Location :
Arlington, VA
Print_ISBN :
0-7803-6495-3
DOI :
10.1109/ACC.2001.945918