Coordinate Descent Without Coordinates: Tangent Subspace Descent on Riemannian Manifolds
Published Online:14 Apr 2022https://doi.org/10.1287/moor.2022.1253
References
- [1] (2007) Optimization Algorithms on Matrix Manifolds (Princeton University Press, Princeton, NJ).Google Scholar
- [2] (2013) On the convergence of block coordinate descent type methods. SIAM J. Optim. 23(4):2037–2060.Crossref, Google Scholar
- [3] (1991) On the spectra of sums of orthogonal projections with applications to parallel computing. BIT 31(1):76–88.Crossref, Google Scholar
- [4] (2020) Proximal gradient method for nonsmooth optimization over the Stiefel manifold. SIAM J. Optim. 30(1):210–239.Google Scholar
- [5] (1995) The angle between subspaces of a Hilbert space. Approximation Theory, Wavelets and Applications (Springer, Berlin), 107–130.Crossref, Google Scholar
- [6] (1998) The geometry of algorithms with orthogonality constraints. SIAM J. Matrix Anal. Appl. 20(2):303–353.Crossref, Google Scholar
- [7] (1998) Subgradient algorithm on Riemannian manifolds. J. Optim. Theory Appl. 97(1):93–104.Crossref, Google Scholar
- [8] (2019) Gradient method for optimization on Riemannian manifolds with lower bounded curvature. SIAM J. Optim. 29(4):2517–2541.Crossref, Google Scholar
- [9] (2015) Convergence analysis of prediction markets via randomized subspace descent. Cortes C, Lawrence ND, Lee DD, Sugiyama M, Garnett R, eds. Advances in Neural Information Processing Systems (Curran Associates), 3034–3042.Google Scholar
- [10] (2018) A new first-order algorithmic framework for optimization problems with orthogonality constraints. SIAM J. Optim. 28(1):302–332.Crossref, Google Scholar
- [11] (2013) Matrix Computations, 4th ed. (Johns Hopkins University Press, Baltimore, MD).Crossref, Google Scholar
- [12] (2022) Riemannian proximal gradient methods. Math. Programming Forthcoming.Crossref, Google Scholar
- [13] (1954) Normal multivariate analysis and the orthogonal group. Ann. Math. Statist. 25(1):40–75.Crossref, Google Scholar
- [14] (2007) Optimization on the orthogonal group for independent component analysis. Davies ME, James CJ, Abdallah SA, Plumbley MD, eds. Independent Component Analysis and Signal Separation (Springer, Berlin), 57–64.Crossref, Google Scholar
- [15] (2010) Generalized power method for sparse principal component analysis. J. Machine Learn. Res. 11(15):517–553.Google Scholar
- [16] (2003) ICA using spacings estimates of entropy. J. Machine Learn. Res. 4:1271–1295.Google Scholar
- [17] (2012) Introduction to Smooth Manifolds, vol. 218. Graduate Texts in Mathematics, 2nd ed. (Springer-Verlag New York).Crossref, Google Scholar
- [18] (2018) Introduction to Riemannian Manifolds, vol. 176. Graduate Texts in Mathematics, 2nd ed. (Springer International Publishing).Crossref, Google Scholar
- [19] (2010) Randomized methods for linear constraints: Convergence rates and conditioning. Math. Oper. Res. 35(3):641–654.Link, Google Scholar
- [20] (2008) Alternating projections on manifolds. Math. Oper. Res. 33(1):216–234.Link, Google Scholar
- [21] (2020) Simple algorithms for optimization on Riemannian manifolds with constraints. Appl. Math. Optim. 82:949–981.Crossref, Google Scholar
- [22] (2017) Accelerated first-order methods for geodesically convex optimization on Riemannian manifolds. Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, eds. Advances in Neural Information Processing Systems (Curran Associates), 4868–4877.Crossref, Google Scholar
- [23] (2012) Efficiency of coordinate descent methods on huge-scale optimization problems. SIAM J. Optim. 22(2):341–362.Crossref, Google Scholar
- [24] (2014) Iteration complexity of randomized block-coordinate descent methods for minimizing a composite function. Math. Programming 144:1–38.Crossref, Google Scholar
- [25] (2019) Riemannian stochastic variance reduced gradient algorithm with retraction and vector transport. SIAM J. Optim. 29(2):1444–1472.Crossref, Google Scholar
- [26] (2014) Coordinate-descent for learning orthogonal matrices through Givens rotations. Xing EP, Jebara T, eds. Proc. 31st Internat. Conf. on Machine Learn., vol. 32 (PMLR), 548–556.Google Scholar
- [27] (2018) Geodesic convex optimization: Differentiation on manifolds, geodesics, and convexity. Preprint, submitted June 17, https://arxiv.org/abs/1806.06373.Google Scholar
- [28] (2015) Alternating proximal gradient method for sparse nonnegative Tucker decomposition. Math. Programming Comput. 7(1):39–70.Crossref, Google Scholar
- [29] (2007) A trust region direct constrained minimization algorithm for the Kohn-Sham equation. SIAM J. Sci. Comput. 29(5):1854–1875.Crossref, Google Scholar
- [30] (2018) An estimate sequence for geodesically convex optimization. Bubeck S, Perchet V, Rigollet P, eds. Proc. 31st Conf. on Learn. Theory, vol. 75 (PMLR), 1703–1723.Google Scholar
- [31] (2019) Primal-dual optimization algorithms over Riemannian manifolds: An iteration complexity analysis. Math. Programming 184:445–490.Crossref, Google Scholar
- [32] (2016) Riemannian SVRG: Fast stochastic optimization on Riemannian manifolds. Lee DD, Sugiyama M, Luxburg UV, Guyon I, Garnett R, eds. Advances in Neural Information Processing Systems (Curran Associates), 4592–4600.Google Scholar
- [33] (2019) Non-smooth optimization over Stiefel manifolds with applications to dimensionality reduction and graph clustering. Proc. 28th Internat. Joint Conf. on Artificial Intelligence (AAAI Press), 1319–1326.Google Scholar

