Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives
References
- (2009) A strong conic quadratic reformulation for machine-job assignment with controllable processing times. Oper. Res. Lett. 37(3):187–191.Crossref, Google Scholar
- (2021) Sparse PCA: A new scalable estimator based on integer programming. Preprint, submitted September 23, https://arxiv.org/abs/2109.11142.Google Scholar
- (2013) Mixed-integer nonlinear optimization. Acta Numerica 22:1–131. Crossref, Google Scholar
- (2020) Sparse high-dimensional regression: Exact scalable algorithms and phase transitions. Ann. Statist. 48(1):300–323.Crossref, Google Scholar
- (2016) Best subset selection via a modern optimization lens. Ann. Statist. 44(2):813–852.Crossref, Google Scholar
- (2020) Certifiably optimal sparse inverse covariance estimation. Math. Programming 184(1):491–530.Crossref, Google Scholar
- (1975) Statistical analysis of non-lattice data. J. Roy. Statist. Soc. Ser. D Statistician 24(3):179–195.Google Scholar
- (2011) A constrained ell-1 minimization approach to sparse precision matrix estimation. J. Amer. Statist. Assoc. 106(494):594–607.Crossref, Google Scholar
- (2024) Subgradient regularized multivariate convex regression at scale. SIAM J. Optim. 34(3):2350–2377.Crossref, Google Scholar
- (1972) Covariance selection. Biometrics 28(1):157–175.Crossref, Google Scholar
- (2022) Using ℓ1-relaxation and integer programming to obtain dual bounds for sparse PCA. Oper. Res. 70(3):1914–1932.Link, Google Scholar
- (2015) Regularization vs. relaxation: A conic optimization perspective of statistical variable selection. Preprint, submitted October 20, https://arxiv.org/abs/1510.06083.Google Scholar
- (2021) Scalable inference of sparsely-changing Gaussian Markov random fields. Adv. Neural Inform. Processing Systems 34:6529–6541.Google Scholar
- (2006) Perspective cuts for a class of convex 0–1 mixed integer programs. Math. Programming 106(2):225–236.Crossref, Google Scholar
- (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9(3):432–441.Crossref, Google Scholar
- (2010a) Applications of the lasso and grouped lasso to the estimation of sparse graphical models. Technical report, Stanford University, Stanford, CA.Google Scholar
- (2010b) Regularization paths for generalized linear models via coordinate descent. J. Statist. Software 33(1):1.Crossref, Google Scholar
- (2010) Perspective reformulations of mixed integer nonlinear programs with indicator variables. Math. Programming 124(1):183–205.Crossref, Google Scholar
- (2015) Statistical Learning with Sparsity: The Lasso and Generalizations, Monographs on Statistics and Applied Probability, vol. 143 (CRC Press, Boca Raton, FL).Google Scholar
- (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, vol. 2 (Springer, Berlin).Crossref, Google Scholar
- (2020) Fast best subset selection: Coordinate descent and local combinatorial optimization algorithms. Oper. Res. 68(5):1517–1537.Link, Google Scholar
- (2022) Sparse regression at scale: Branch-and-bound rooted in first-order optimization. Math. Programming 196(1–2):347–388.Crossref, Google Scholar
- (2009) Estimation of sparse binary pairwise Markov networks using pseudo-likelihoods. J. Machine Learn. Res. 10(4):883–906. Google Scholar
- (2015) A Convex pseudolikelihood framework for high dimensional partial correlation estimation with convergence guarantees. J. Royal Statist. Soc. Series B: Statist. Methodology 77(4):803–825. Crossref, Google Scholar
- (1952) Portfolio selection. J. Finance 7(1):77–91.Google Scholar
- (2012) The graphical lasso: New insights and alternatives. Electronic J. Statist. 6:2125–2149. Crossref, Google Scholar
- (2023) Subset selection with shrinkage: Sparse linear modeling when the SNR is low. Oper. Res. 71(1):129–147.Link, Google Scholar
- (2006) High-dimensional graphs and variable selection with the Lasso. Ann. Statist. 34(3):1436–1462.Crossref, Google Scholar
- (2020) Information theoretic optimal learning of Gaussian graphical models. Abernethy J, Agarwal S, eds. Proc. Conf. Learn. Theory (PMLR, New York), 2888–2909.Google Scholar
- (2007) A robust hybrid of lasso and ridge regression. Contemporary Math. 443(7):59–72.Crossref, Google Scholar
- (2009) Partial correlation estimation by joint sparse regression models. J. Amer. Statist. Assoc. 104(486):735–746.Crossref, Google Scholar
- (2008) Model selection in gaussian graphical models: High-dimensional consistency of l1-regularized MLE. Koller D, Schuurmans D, Bengio Y, Bottou L, eds Adv. Neural Inform. Processing Systems (Curran Associates Inc., Red Hook, NY), 1329–1336.Google Scholar
- (2018) Interactive supercomputing on 40,000 cores for machine learning and data analysis. Proc. IEEE High Performance Extreme Comput. Conf. (IEEE, Piscataway, NJ), 1–6.Google Scholar
- (2008) Sparse permutation invariant covariance estimation. Electronic J. Statist. 2:494–515.Crossref, Google Scholar
- (2001) Convergence of a block coordinate descent method for nondifferentiable minimization. J. Optim. Theory Appl. 109(3):475–494.Crossref, Google Scholar
- (2019) High-Dimensional Statistics: A Non-Asymptotic Viewpoint, vol. 48 (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (2010) Information-theoretic bounds on model selection for Gaussian Markov random fields. Proc. IEEE Internat. Sympos. Inform. Theory (IEEE, Piscataway, NJ), 1373–1377.Google Scholar
- (1999) Integer and Combinatorial Optimization, vol. 55 (John Wiley & Sons, New York).Google Scholar
- (2020) Scalable algorithms for the sparse ridge regression. SIAM J. Optim. 30(4):3359–3386.Crossref, Google Scholar

