A Stochastic Sequential Quadratic Optimization Algorithm for Nonlinear-Equality-Constrained Optimization with Rank-Deficient Jacobians
References
- [1] (2021) Sequential quadratic optimization for nonlinear equality constrained stochastic optimization. SIAM J. Optim. 31(2):1352–1379.Crossref, Google Scholar
- [2] (2018) Optimization methods for large-scale machine learning. SIAM Rev. 60(2):223–311.Crossref, Google Scholar
- [3] (2011) LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3):1–27.Crossref, Google Scholar
- [4] (2018) Constraint-aware deep neural network compression. Proceedings of the European Conference on Computer Vision (ECVC), 400–415.Google Scholar
- [5] (2022) Scientific machine learning through physics–informed neural networks: Where we are and what’s next. J. Sci. Comput. 92(88).Crossref, Google Scholar
- [6] (2009) A matrix-free algorithm for equality constrained optimization problems with rank deficient Jacobians. SIAM J. Optim. 20(3):1224–1249.Crossref, Google Scholar
- [7] (2023) Worst-case complexity of an SQP method for nonlinear equality constrained stochastic optimization. Math. Program., ePub ahead of print June 7, https://doi.org/10.1007/s10107-023-01981-1.Crossref, Google Scholar
- [8] (2023) Sequential quadratic optimization for stochastic optimization with deterministic nonlinear inequality and equality constraints. Preprint, submitted February 28, https://arxiv.org/abs/2302.14790.Google Scholar
- [9] (2023) A stochastic-gradient-based interior-point algorithm for solving smooth bound-constrained optimization problems. Preprint, submitted April 28, https://arxiv.org/abs/2304.14907.Google Scholar
- [10] (2020) Stochastic subgradient method converges on tame functions. Found. Comput. Math. 20:119–154.Crossref, Google Scholar
- [11] (2022) Fully stochastic trust-region sequential quadratic programming for equality-constrained optimization problems. Preprint, submitted November 29, https://arxiv.org/abs/2211.15943.Google Scholar
- [13] (2015) CUTEst: A constrained and unconstrained testing environment with safe threads for mathematical optimization. Comput. Optim. Appl. 60:545–557.Crossref, Google Scholar
- [12] (1999) Solving the trust-region subproblem using the Lanczos method. SIAM J. Optim. 9(2):504–525.Crossref, Google Scholar
- [14] (1977) A globally convergent method for nonlinear programming. J. Optim. Theory Appl. 22(3):297–309.Crossref, Google Scholar
- [15] (1979) Exact penalty functions in nonlinear programming. Math. Program. 17:251–269.Crossref, Google Scholar
- [16] (2016) Variance-reduced and projection-free stochastic optimization. Proceedings of International Conference on Machine Learning (ICML), 1263–1271.Google Scholar
- [17] (2021) Physics-informed machine learning. Nat. Rev. Phys. 3:422–440.Crossref, Google Scholar
- [18] (2018) Geometry aware constrained optimization techniques for deep learning. Proceedings of Computer Vision and Pattern Recognition (CVPR), 4460–4469.Google Scholar
- [19] (2019) Stochastic Frank-Wolfe for composite convex minimization. Proceedings of Neural Information Processing Systems (NeurIPS), 14269–14279.Google Scholar
- [20] (2020) Generalized stochastic Frank-Wolfe algorithm with stochastic “substitute” gradient for structured convex optimization. Math. Program. 187:317–349.Google Scholar
- [21] (2021) Physics-informed neural networks with hard constraints for inverse design SIAM J. Sci. Comput. 43(6):B1105–B1132.Google Scholar
- [22] (2023) An adaptive stochastic sequential quadratic programming with differentiable exact augmented lagrangians. Math. Program. 199:721–791.Crossref, Google Scholar
- [23] (2019) A primal-dual formulation for deep learning with constraints. Proceedings of Neural Information Processing Systems (NeurIPS), 12157–12168.Google Scholar
- [24] (2023) Learning differentiable solvers for systems with hard constraints. Preprint, submitted July 18, https://arxiv.org/abs/2207.08675.Google Scholar
- [25] (2006) Numerical optimization. Springer Series in Operations Research and Financial Engineering (Springer-Verlag, New York).Google Scholar
- [26] (1989) Trust region algorithms for optimization with nonlinear equality and inequality constraints. PhD thesis, University of Colorado, Boulder, CO.Google Scholar
- [27] (1978) A fast algorithm for nonlinearly constrained optimization calculations. Numerical Analysis, Lecture Notes in Mathematics, vol. 630 (Springer, Berlin), 144–157.Google Scholar
- [28] (2019) Explicitly imposing constraints in deep networks via conditional gradients gives improved generalization and faster convergence. Proc. Conf. AAAI Artif. Intell. 33:4772–4779.Google Scholar
- [29] (2016) Stochastic Frank-Wolfe methods for nonconvex optimization. 2016 54th Annual Allerton Conference, 1244–1251 (IEEE, Piscataway, NJ).Google Scholar
- [30] (1951) A stochastic approximation method. Ann. Math. Statist. 22(3):400–407.Crossref, Google Scholar
- [31] (1971) A convergence theorem for nonnegative almost supermartingales and some applications. Rustagi JS, ed., Optimizing Methods in Statistics (Academic Press, New York).Google Scholar
- [32] (1983) The conjugate gradient method and trust regions in large scale optimization. SIAM J. Numer. Anal. 20(3):626–637.Crossref, Google Scholar
- [33] (2003) Elementary Probability (Cambridge University Press, Cambridge).Crossref, Google Scholar
- [34] (2000) Failure of global convergence for a class of interior point methods for nonlinear programming. Math. Program. 88(3):565–574.Crossref, Google Scholar
- [35] (2020) One sample stochastic Frank-Wolfe. AISTATS 2020:4012–4023.Google Scholar

