AdaBB: Adaptive Barzilai-Borwein Method for Convex Optimization
References
- [1] (2023) Acceleration by stepsize hedging I: Multi-step descent and the silver stepsize schedule. Preprint, submitted September 14, https://arxiv.org/abs/2309.07879.Google Scholar
- [2] (2023) Acceleration by stepsize hedging II: Silver stepsize schedule for smooth convex optimization. Preprint, submitted September 28, https://arxiv.org/abs/2309.16530.Google Scholar
- [3] (1966) Minimization of functions having Lipschitz continuous first partial derivatives. Pacific J. Math. 16:1–3.Crossref, Google Scholar
- [4] (1988) Two-point step size gradient methods. IMA J. Numer. Anal. 8(1):141–148.Crossref, Google Scholar
- [5] (2009) A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2(1):183–202.Crossref, Google Scholar
- [6] (2019) Stabilized Barzilai-Borwein method. J. Comput. Math. 37(6):916–936.Crossref, Google Scholar
- [7] (2011) LIBSVM: A library for support vector machines. ACM Trans. Intelligent Systems Tech. 2(3):27.Crossref, Google Scholar
- [8] (2002) R-linear convergence of the Barzilai and Borwein gradient method. IMA J. Numer. Anal. 22(1):1–10.Crossref, Google Scholar
- [9] (2024) Branch-and-bound performance estimation programming: A unified methodology for constructing optimal optimization methods. Math. Program. 204:567–639.Crossref, Google Scholar
- [10] (2018) On the steplength selection in gradient methods for unconstrained optimization. Appl. Math. Comput. 318:176–195.Google Scholar
- [11] (2018) On the properties of convex functions over open sets. Preprint, submitted December 30, https://arxiv.org/abs/1812.02419.Google Scholar
- [12] (2014) Performance of first-order methods for smooth convex minimization: A novel approach. Math. Program. 145(1–2):451–482.Crossref, Google Scholar
- [13] (2011) Adaptive subgradient methods for online learning and stochastic optimization. J. Machine Learn. Res. 12:2121–2159.Google Scholar
- [14] (2019) Generalized uniformly optimal methods for nonlinear programming. J. Sci. Comput. 79(3):1854–1881.Crossref, Google Scholar
- [15] (2023) Provably faster gradient descent via long steps. Preprint, submitted July 12, https://arxiv.org/abs/2307.06324.Google Scholar
- [16] (2023) Accelerated gradient descent via long steps. Preprint, submitted September 27, https://arxiv.org/abs/2309.09961.Google Scholar
- [17] (1986) A nonmonotone line search technique for Newton’s method. SIAM J. Numer. Anal. 23(4):707–716.Crossref, Google Scholar
- [18] (2023) Optimal and parameter-free gradient minimization methods for convex and nonconvex optimization. Preprint, submitted October 18, https://arxiv.org/abs/2310.12139.Google Scholar
- [19] (2023) On the convergence of adaptive first order methods: Proximal gradient and alternating minimization algorithms. Preprint, submitted November 30, https://arxiv.org/abs/2311.18431.Google Scholar
- [20] (2023) Adaptive proximal algorithms for convex optimization under local Lipschitz continuity of the gradient. Preprint, submitted January 11, https://arxiv.org/abs/2301.04431.Google Scholar
- [21] (2023) A simple uniformly optimal method without line search for convex optimization. Preprint, submitted October 16, https://arxiv.org/abs/2310.10082.Google Scholar
- [22] (2021) An average curvature accelerated composite gradient method for nonconvex smooth composite optimization problems. SIAM J. Optim. 31(1):217–243.Crossref, Google Scholar
- [23] (2023) Average curvature FISTA for nonconvex smooth composite optimization problems. Comput. Optim. Appl. 86(1):275–302.Crossref, Google Scholar
- [24] (2021) A FISTA-type accelerated gradient algorithm for solving smooth nonconvex composite optimization problems. Comput. Optim. Appl. 79(3):649–679.Crossref, Google Scholar
- [25] (2020) Adaptive gradient descent without descent. Proc. 37th Internat. Conf. Machine Learn. (ACM, New York).Google Scholar
- [26] (2024) Adaptive proximal gradient method for convex optimization. 38th Conf. Neural Inform. Processing Systems (Curran Associates, Inc., Red Hook, NY).Google Scholar
- [27] (2010) Adaptive bound optimization for online convex optimization. Proc. 23rd Annu. Conf. Learn. Theory 2010.Google Scholar
- [28] (2004) Introductory Lectures on Convex Optimization: A Basic Course, Applied Optimization, vol. 87 (Kluwer Academic Publishers, Boston).Crossref, Google Scholar
- [29] (2013) Gradient methods for minimizing composite functions. Math. Program. 140(1):125–161.Crossref, Google Scholar
- [30] (2006) Cubic regularization of Newton method and its global performance. Math. Program. 108(1):177–205.Crossref, Google Scholar
- [31] (2013) Estimation of spectral bounds in gradient algorithms. Acta Appl. Math. 127:117–136.Crossref, Google Scholar
- [32] (1993) On the Barzilai and Borwein choice of steplength for the gradient method. IMA J. Numer. Anal. 13(3):321–326.Crossref, Google Scholar
- [33] (1997) The Barzilai and Borwein gradient method for the large scale unconstrained minimization problem. SIAM J. Optim. 7(1):26–33.Crossref, Google Scholar
- [34] (2016) Barzilai-Borwein step size for stochastic gradient descent. 30th Conf. Neural Inform. Processing Systems (Curran Associates, Inc., Red Hook, NY).Google Scholar
- [35] (2017) Exact worst-case performance of first-order methods for composite convex optimization. SIAM J. Optim. 27(3):1283–1313.Crossref, Google Scholar
- [36] (2017) Smooth strongly convex interpolation and exact worst-case performance of first-order methods. Math. Program. 161(1–2):307–345.Crossref, Google Scholar
- [37] (2023) An elementary approach to tight worst case complexity analysis of gradient based methods. Math. Program. 201(1–2):63–96.Crossref, Google Scholar

