Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization

Samuel Horvath; Lihua Lei; Peter Richtarik; Michael I. Jordan

doi:10.1137/21M1394308

Back

Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization

Journal article

Open access

Peer reviewed

Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization

Samuel Horvath, Lihua Lei, Peter Richtarik and Michael I. Jordan

SIAM journal on mathematics of data science, Vol.4(2), pp.634-648

06/2022

DOI: https://doi.org/10.1137/21M1394308

Abstract

Mathematics

Mathematics, Applied

Physical Sciences

Science & Technology

Adaptivity is an important yet under-studied property in modern optimization theory. The gap be-tween the state-of-the-art theory and the current practice is striking in that algorithms with desirable theoretical guarantees typically involve drastically different settings of hyperparameters, such as step size schemes and batch sizes, in different regimes. Despite the appealing theoretical results, such divisive strategies provide little, if any, insight to practitioners to select algorithms that work broadly without tweaking the hyperparameters. In this work, blending the "geometrization" technique intro-duced by [L. Lei and M. I. Jordan, Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017, pp. 148-156] and the SARAH algorithm of [L. M. Nguyen, J. Liu, K. Scheinberg, and M. Takac, Proceedings of the 34th International Conference on Machine Learn-ing, 2017, pp. 2613-2621], we propose the geometrized SARAH algorithm for nonconvex finite-sum and stochastic optimization. Our algorithm is proved to achieve adaptivity to both the magnitude of the target accuracy and the Polyak-Lojasiewicz (PL) constant, if present. In addition, it achieves the best-available convergence rate for non-PL objectives simultaneously while outperforming existing algorithms for PL objectives.

Files and links (1)

url

https://doi.org/10.1137/21M1394308View

Published (Version of record) Open

Metrics

1 Record Views

See more details

Details

Title: Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization
Creators - without role: Samuel Horvath - King Abdullah University of Science and Technology
Lihua Lei - Stanford University
Peter Richtarik - King Abdullah University of Science and Technology
Michael I. Jordan - University of California, Berkeley
Publication Details: SIAM journal on mathematics of data science, Vol.4(2), pp.634-648
Publisher: Siam Publications
Number of pages: 15
Identifiers: 9941589708331
Academic Unit: King Abdullah University of Science & Technology
Language: English
Resource Type: Journal article