Title :
Optimal Rates for Zero-Order Convex Optimization: The Power of Two Function Evaluations
Author :
Duchi, John C. ; Jordan, Michael I. ; Wainwright, Martin J. ; Wibisono, Andre
Author_Institution :
Dept. of Stat. & Electr. Eng., Stanford Univ., Stanford, CA, USA
Abstract :
We consider derivative-free algorithms for stochastic and nonstochastic convex optimization problems that use only function values rather than gradients. Focusing on nonasymptotic bounds on convergence rates, we show that if pairs of function values are available, algorithms for d-dimensional optimization that use gradient estimates based on random perturbations suffer a factor of at most √d in convergence rate over traditional stochastic gradient methods. We establish such results for both smooth and nonsmooth cases, sharpening previous analyses that suggested a worse dimension dependence, and extend our results to the case of multiple (m ≥ 2) evaluations. We complement our algorithmic development with information-theoretic lower bounds on the minimax convergence rate of such problems, establishing the sharpness of our achievable results up to constant (sometimes logarithmic) factors.
Keywords :
convergence; convex programming; estimation theory; perturbation techniques; stochastic programming; algorithmic development; d-dimensional optimization; derivative-free algorithm; function evaluation; function value; gradient estimate; information-theoretic lower bound; minimax convergence rate; nonasymptotic bound; nonstochastic convex optimization problem; optimal rate; random perturbation; stochastic gradient method; worse dimension dependence; zero-order convex optimization; Convergence; Convex functions; Mirrors; Optimization; Smoothing methods; Standards; Vectors; Optimization; lower bounds; on-line learning; statistical learning; zero-order optimization;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2015.2409256