Composing Optimized Stepsize Schedules for Gradient Descent

Published Online:

References

  • [1] Altschuler JM, Parrilo PA (2025) Acceleration by stepsize hedging: Silver stepsize schedule for smooth convex optimization. Math. Programming 213:1105–1118.CrossrefGoogle Scholar
  • [2] Altschuler JM, Parrilo PA (2025) Acceleration by stepsize hedging: Multi-step descent and the silver stepsize schedule. J. ACM 72(2):1–38.CrossrefGoogle Scholar
  • [3] Das Gupta S, Van Parys BPG, Ryu EK (2024) Branch-and-bound performance estimation programming: A unified methodology for constructing optimal optimization methods. Math. Program. 204(1–2):567–639. CrossrefGoogle Scholar
  • [4] Drori Y, Teboulle M (2014) Performance of first-order methods for smooth convex minimization: A novel approach. Math. Programming 145:451–482.CrossrefGoogle Scholar
  • [5] Grimmer B, Shu K, Wang AL (2023) Accelerated gradient descent via long steps. Preprint, submitted September 27, https://arxiv.org/abs/2309.09961.Google Scholar
  • [6] Grimmer B, Shu K, Wang AL (2024) A strengthened conjecture on the minimax optimal constant stepsize for gradient descent. Preprint, submitted July 16, https://arxiv.org/abs/2407.11739.Google Scholar
  • [7] Grimmer B, Shu K, Wang AL (2025) Accelerated objective gap and gradient norm convergence for gradient descent via long steps. INFORMS J. Optim. 7(2):156–169.LinkGoogle Scholar
  • [8] Kim J (2025) A proof of exact convergence rate of gradient descent. Part II. Performance criterion (f(xn)−f*)/‖x0−x*‖2. Preprint, submitted March 26, https://arxiv.org/abs/2412.04427.Google Scholar
  • [9] Kim D, Fessler JA (2016) Optimized first-order methods for smooth convex minimization. Math. Programming 159(1):81–107.CrossrefGoogle Scholar
  • [10] Kim D, Fessler JA (2021) Optimizing the efficiency of first-order methods for decreasing the gradient of smooth convex functions. J. Optim. Theory Appl. 188(1):192–219.CrossrefGoogle Scholar
  • [11] Kim J, Ozdaglar A, Park C, Ryu E (2023) Time-reversed dissipation induces duality between minimizing gradient norm and function value. Adv. Neural Inform. Processing Systems 36:23389–23440.Google Scholar
  • [12] Luner A, Grimmer B (2024) On averaging and extrapolation for gradient descent. Preprint, submitted February 19, https://arxiv.org/abs/2402.12493.Google Scholar
  • [13] Malitsky Y, Mishchenko K (2020) Adaptive gradient descent without descent. Proc. 37th Internat. Conf. Machine Learn. ICML 2020, vol. 119 (PMLR, New York), 6702–6712.Google Scholar
  • [14] Nesterov Y (1983) A method for solving the convex programming problem with convergence rate o(1/k2). Proc. USSR Acad. Sci. 269:543–547.Google Scholar
  • [15] Nesterov Y (2014) Introductory Lectures on Convex Optimization: A Basic Course, 1st ed. (Springer Publishing Company, New York).Google Scholar
  • [16] Rotaru T, Glineur F, Patrinos P (2024) Exact worst-case convergence rates of gradient descent: A complete analysis for all constant stepsizes over nonconvex and convex functions. Preprint, submitted June 25, https://arxiv.org/abs/2406.17506.Google Scholar
  • [17] Taylor A, Hendrickx J, Glineur F (2017) Exact worst-case performance of first-order methods for composite convex optimization. SIAM J. Optim. 27(3):1283–1313.CrossrefGoogle Scholar
  • [18] Taylor A, Hendrickx J, Glineur F (2017) Smooth strongly convex interpolation and exact worst-case performance of first-order methods. Math. Programming 161:307–345.CrossrefGoogle Scholar
  • [19] Teboulle M, Vaisbourd Y (2023) An elementary approach to tight worst case complexity analysis of gradient based methods. Math. Programming 201(1–2):63–96.CrossrefGoogle Scholar
  • [20] Wang B, Ma S, Yang J, Zhou D (2024) Relaxed proximal point algorithm: Tight complexity bounds and acceleration without momentum. Preprint, submitted October 11, https://arxiv.org/abs/2410.08890.Google Scholar
  • [21] Zhang Z, Jiang R (2024) Accelerated gradient descent by concatenation of stepsize schedules. Preprint, submitted October 16, https://arxiv.org/abs/2410.12395.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.