Robustly stable accelerated momentum methods with a near-optimal L2 gain and H∞ performance

Mert Gürbüzbalaban

doi:10.48550/arXiv.2309.11481

Back

Robustly stable accelerated momentum methods with a near-optimal L2 gain and H∞ performance

Technical documentation

Open access

Robustly stable accelerated momentum methods with a near-optimal L2 gain and H∞ performance

Mert Gürbüzbalaban

Autumn 2023

DOI: https://doi.org/10.48550/arXiv.2309.11481

Abstract

We consider the problem of minimizing a strongly convex smooth function where the gradients are subject to additive worst-case deterministic errors that are square-summable. We study the trade-offs between the convergence rate and robust-ness to gradient errors when designing the parameters of a first-order algorithm. We focus on a general class of momentum methods (GMM) with constant stepsize and two momentum parameters which can recover gradient descent (GD), Nesterov's accelerated gradient (NAG), the heavy-ball (HB) and the triple momentum methods (TMM) as special cases. We measure the robustness of an algorithm in terms of the cumulative suboptimality over the iterations normalized by the squared 2 norm of the gradient errors. This quantity can be interpreted as the (squared) 2 gain of a dynamical system that represents the GMM iterations where the input is the gradient error sequence and the output is a weighted distance to the optimum. For quadratic objectives, we compute the 2 gain explicitly leveraging its representation as the H_∞ norm of the GMM system in the frequency domain and construct gradient errors that lead to worst-case performance explicitly. We also study the stability of GMM with respect to multiplicative errors by characterizing the structured real and stability radius of the GMM system through their connections to the H_∞ norm. This allows us to compare GD, HB, NAG methods in terms of robustness, and argue that HB is not as robust as NAG despite being the fastest in terms of the rate. We then develop the robustly stable heavy ball method that can be faster than NAG while being at the best robustness level possible. We also propose the robustly stable gradient descent that is the fastest version of GD with constant stepsize while being at the best robustness level. Finally, we extend our framework to general strongly convex smooth objectives, providing non-asymptotic rate results for inexact GMM methods and bounds on the 2 gain where we can choose the GMM parameters to systematically trade off the rate to robustness in a computationally tractable framework.

Files and links (3)

pdf

arxiv_main_v12_hinf2.11 MBDownload View

Version of Record (VoR) Open Access

url

https://doi.org/10.48550/arXiv.2309.11481View

Version of Record (VoR) arXiv

url

Report an accessibility issueView

Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

Details

Title: Subtitle: Robustly stable accelerated momentum methods with a near-optimal L2 gain and H∞ performance
Creators: Mert Gürbüzbalaban (Author) - Rutgers University, Management Science and Information Systems (RBS)
Date published: 2023
Number of pages: 47
Grants: N00014-21-1-2244, Office of Naval Research (United States, Arlington) - ONR
DMS-2053485, U.S. National Science Foundation (United States, Alexandria) - NSF
Academic Unit: Management Science and Information Systems (RBS)
Language: English
Resource Type: Technical documentation
Comment: Dedicated to Professor Michael Overton on his seventieth birthday.
Identifiers: 991031891489204646