On the Uniform Convergence of Subdifferentials in Stochastic Optimization and Learning
References
- [1] (2013) Basic Topology (Springer, New York).Google Scholar
- [2] (1975) A strong law of large numbers for random compact sets. Ann. Probab. 3(5):879–882.Crossref, Google Scholar
- [3] (1977) Convergence de fonctions convexes, des sous-différentiels et semi-groupes associés. CR Acad. Sci. Paris 284:539–542.Google Scholar
- [4] (1993) On the convergence of subdifferentials of convex functions. Arch. Math. 60(4):389–400.Crossref, Google Scholar
- [5] (2009) Set-Valued Analysis (Springer, New York).Crossref, Google Scholar
- [6] (1965) Integrals of set-valued functions. J. Math. Anal. Appl. 12(1):1–12.Crossref, Google Scholar
- [7] (2019) Subgradient descent learns orthogonal dictionaries. Seventh Internat. Conf. Learn. Representation (ICLR, Appleton, WI).Google Scholar
- [8] (1986) Probability and Measure, 2nd ed. (Wiley, New York).Google Scholar
- [9] (2010) Characterizations of Lojasiewicz inequalities: Subgradient flows, talweg, convexity. Trans. Amer. Math. Soc. 362(6): 3319–3363.Google Scholar
- [10] (2010) Convex functions: Constructions. Characterizations and Counterexamples, vol. 109 (Cambridge University Press, Cambridge, UK).Google Scholar
- [11] (1985) Descent methods for composite nondifferentiable optimization problems. Math. Programming 33:260–279.Crossref, Google Scholar
- [12] (2015) Phase retrieval via Wirtinger flow: Theory and algorithms. IEEE Trans. Inform. Theory 61(4):1985–2007.Crossref, Google Scholar
- [13] (2021) Composite optimization for robust rank one bilinear sensing. Inform. Inference 10(2):333–396.Crossref, Google Scholar
- [14] (2021) Low-rank matrix recovery with composite optimization: Good conditioning and rapid convergence. Foundations Comput. Math. 21(6):1505–1593.Crossref, Google Scholar
- [15] (2019) Stochastic model-based minimization of weakly convex functions. SIAM J. Optim. 29(1):207–239.Crossref, Google Scholar
- [16] (2020) Subgradient methods under weak convexity and tame geometry. SIAG/OPT Views News 28:1–10.Google Scholar
- [17] (2022) Graphical convergence of subgradients in nonconvex optimization and learning. Math. Oper. Res. 47(1):209–231.Link, Google Scholar
- [18] (2020) The nonsmooth landscape of phase retrieval. IMA J. Numerical Anal. 40(4):2652–2695.Crossref, Google Scholar
- [19] (2019) The nonsmooth landscape of blind deconvolution. Workshop Optim. Machine Learn.Google Scholar
- [20] (2021) Rank overspecified robust matrix recovery: Subgradient method and exact recovery. Preprint, submitted September 23, https://arxiv.org/abs/2109.11154.Google Scholar
- [21] (2018) Stochastic methods for composite and weakly convex optimization problems. SIAM J. Optim. 28(4):3229–3259.Crossref, Google Scholar
- [22] (2019) Solving (most) of a set of quadratic equalities: Composite optimization for robust phase retrieval. Inform. Inference 8(3):471–529.Crossref, Google Scholar
- [23] (1967) The sizes of compact subsets of Hilbert space and continuity of Gaussian processes. J. Functional Anal. 1(3):290–330.Crossref, Google Scholar
- [24] (2014) Phase retrieval: Stability and recovery guarantees. Appl. Comput. Harmonic Anal. 36(3):473–494.Crossref, Google Scholar
- [25] (2023) Survey descent: A multipoint generalization of gradient descent for nonsmooth optimization. SIAM J. Optim. 33(1):36–62.Crossref, Google Scholar
- [26] (1998) Bootstrap methods for median regression models. Econometrica 66(6):1327–1351.Crossref, Google Scholar
- [27] (2005) Quantile Regression, vol. 38 (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- [28] (1996) An interior point algorithm for nonlinear quantile regression. J. Econometrics 71(1–2):265–283.Crossref, Google Scholar
- [29] (2022) Adaptive data fusion for multi-task non-smooth optimization. Preprint, submitted October 22, https://arxiv.org/abs/2210.12334.Google Scholar
- [30] (2020) Nonconvex robust low-rank matrix recovery. SIAM J. Optim. 30(1):660–686.Crossref, Google Scholar
- [31] (2022) Solving nonsmooth and nonconvex compound stochastic programs with applications to risk measure minimization. Math. Oper. Res. 47(4):3051–3083.Link, Google Scholar
- [32] (2021) Sign-rip: A robust restricted isometry property for low-rank matrix recovery. Adv. Neural Inform. Processing Systems Workshop Optim. Machine Learn.Google Scholar
- [33] (2013) Lectures on Discrete Geometry, vol. 212 (Springer, New York).Google Scholar
- [34] (1964) On the Betti numbers of real varieties. Proc. Amer. Math. Soc. 15(2):275–280.Crossref, Google Scholar
- [35] (2005) Theory of Random Sets, vol. 19 (Springer, New York).Google Scholar
- [36] (1965) Proximité et dualité dans un espace hilbertien. Bull. Soc. Math. France 93:273–299.Crossref, Google Scholar
- [37] (1992) Amenable functions in optimization. Nonsmooth Optimization: Methods and Applications (Erice), 338–353.Google Scholar
- [38] (1996) Prox-regular functions in variational analysis. Trans. Amer. Math. Soc. 348(5):1805–1838.Crossref, Google Scholar
- [39] (2011) Convergence of stationary points of sample average two-stage stochastic programs: A generalized equation approach. Math. Oper. Res. 36(3):568–592.Link, Google Scholar
- [40] (1970) Convex Analysis (Princeton University Press, Princeton, NJ).Crossref, Google Scholar
- [41] (1981) Favorable classes of Lipschitz continuous functions in subgradient optimization.Google Scholar
- [42] (2000) Optimization of conditional value-at-risk. J Risk 2(3):21–42.Crossref, Google Scholar
- [43] (1998) Variational Analysis (Springer, New York).Crossref, Google Scholar
- [44] (1972) On the density of families of sets. J. Combin. Theory Ser. A 13(1):145–147.Crossref, Google Scholar
- [45] (2007) Uniform laws of large numbers for set-valued mappings and subdifferentials of random functions. J. Math. Anal. Appl. 325(2):1390–1399.Crossref, Google Scholar
- [46] (2021) Lectures on Stochastic Programming: Modeling and Theory (SIAM, Philadelphia).Crossref, Google Scholar
- [47] (1972) A combinatorial problem; stability and order for models and theories in infinitary languages. Pacific J. Math. 41(1):247–261.Crossref, Google Scholar
- [48] (2018) A geometric analysis of phase retrieval. Foundations Comput. Math. 18:1131–1198.Crossref, Google Scholar
- [49] (1965) On the homology of real algebraic varieties. Differential and Combinatorial Topology.Crossref, Google Scholar
- [50] (2000) Asymptotic Statistics, vol. 3 (Cambridge University Press, Cambridge, UK).Google Scholar
- [51] (1996) Weak Convergence and Empirical Processes: With Applications to Statistics (Springer, New York).Crossref, Google Scholar
- [52] (2013) The Nature of Statistical Learning Theory (Springer, New York).Google Scholar
- [53] (1971) On the uniform convergence of relative frequencies of events to their probabilities. Theory Probab. Appl. 16(2):264–280.Crossref, Google Scholar
- [54] (2018) High-Dimensional Probability: An Introduction with Applications in Data Science, vol. 47 (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- [55] (2019) High-Dimensional Statistics: A Non-Asymptotic Viewpoint, vol. 48 (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- [56] (2017) Solving systems of random quadratic equations via truncated amplitude flow. IEEE Trans. Inform. Theory 64(2):773–794.Crossref, Google Scholar
- [57] (1968) Lower bounds for approximation by nonlinear manifolds. Trans. Amer. Math. Soc. 133(1):167–178.Crossref, Google Scholar
- [58] (2010) Uniform exponential convergence of sample average random functions under general sampling with applications in stochastic programming. J. Math. Anal. Appl. 368(2):692–710.Crossref, Google Scholar
- [59] (2009) Smooth sample average approximation of stationary points in nonsmooth stochastic optimization and applications. Math. Programming 119:371–401.Crossref, Google Scholar

