Simultaneous Dimension Reduction and Variable Selection for Multinomial Logistic Regression

Canhong Wen
Canhong Wen
[email protected]
https://orcid.org/0000-0003-0220-9986
International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, China;
Search for more papers by this author
,
Zhenduo Li
Zhenduo Li
[email protected]
International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, China;
Search for more papers by this author
,
Ruipeng Dong
Corresponding Author
Ruipeng Dong
[email protected]
https://orcid.org/0000-0002-5073-4470
International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, China;
Search for more papers by this author
,
Yijin Ni
Yijin Ni
[email protected]
Industrial and System Engineering, Georgia Institute of Technology, 30318 Atlanta, Georgia;
Search for more papers by this author
,
Wenliang Pan
Corresponding Author
Wenliang Pan
[email protected]
https://orcid.org/0000-0002-9821-6461
Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
Search for more papers by this author

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, China;

Search for more papers by this author

Zhenduo Li

[email protected]

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, China;

Search for more papers by this author

Ruipeng Dong

Corresponding Author

Ruipeng Dong

[email protected]

https://orcid.org/0000-0002-5073-4470

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, China;

Search for more papers by this author

Yijin Ni

[email protected]

Industrial and System Engineering, Georgia Institute of Technology, 30318 Atlanta, Georgia;

Search for more papers by this author

Wenliang Pan

Corresponding Author

Wenliang Pan

[email protected]

https://orcid.org/0000-0002-9821-6461

Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China

Search for more papers by this author

Published Online:9 May 2023https://doi.org/10.1287/ijoc.2022.0132

References

Bayaga A (2010) Multinomial logistic regression: Usage and application in risk analysis. J. Appl. Quant. Methods 5(2):288–297.Google Scholar
Bertsimas D, Mundru N (2020) Sparse convex regression. INFORMS J. Comput. 33(1):262–279.Link, Google Scholar
Bertsimas D, King A, Mazumder R (2016) Best subset selection via a modern optimization lens. Annals Statist. 44(2):813–852.Crossref, Google Scholar
Bickel PJ, Ritov Y, Tsybakov AB (2009) Simultaneous analysis of lasso and dantzig selector. Ann. Statist. 37(4):1705–1732.Crossref, Google Scholar
Bunea F, She Y, Wegkamp MH (2012) Joint variable and rank selection for parsimonious estimation of high-dimensional matrices. Ann. Statist. 40(5):2359–2388.Crossref, Google Scholar
Cawley GC, Talbot NL, Girolami M (2006) Sparse multinomial logistic regression via Bayesian l1 regularisation. Proc. 19th Internat. Conf. Neural Inform. Processing Systems, 209–216.Google Scholar
Chen K (2016) Model diagnostics in reduced-rank estimation. Statist. Interface 9(4):469.Crossref, Google Scholar
Chen L, Huang JZ (2012) Sparse reduced-rank regression for simultaneous dimension reduction and variable selection. J. Amer. Statist. Assoc. 107(500):1533–1545.Crossref, Google Scholar
Chen K, Dong R, Xu W, Zheng Z (2022) Fast stagewise sparse factor regression. J. Machine Learn. Res. 23(271):1–45.Google Scholar
Chen K, Hoffman EA, Seetharaman I, Jiao F, Lin CL, Chan KS (2016) Linking lung airway structure to pulmonary function via composite bridge regression. Ann. Appl. Statist. 10(4):1880.Crossref, Google Scholar
Cheng Y, Wang X, Xia Y (2020) Supervised t-distributed stochastic neighbor embedding for data visualization and classification. INFORMS J. Comput. 33(2):566–585.Google Scholar
Choi S, Hoffman EA, Wenzel SE, Castro M, Fain SB, Jarjour NN, Schiebler ML, et al. (2015) Quantitative assessment of multiscale structural and functional alterations in asthmatic populations. J. Appl. Physiol. 118(10):1286–1298.Crossref, Google Scholar
Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96(456):1348–1360.Crossref, Google Scholar
Fan Y, Lv J (2014) Asymptotic properties for combined l 1 and concave regularization. Biometrika 101(1):57–70.Crossref, Google Scholar
Fan J, Peng H (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann. Statist. 32(3):928–961.Crossref, Google Scholar
Hastie T, Tibshirani R, Friedman J (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Springer Science & Business Media, Boston).Crossref, Google Scholar
Huang J, Jiao Y, Liu Y, Lu X (2018) A constructive approach to l0 penalized regression. J. Machine Learn. Res. 19(1):403–439.Google Scholar
Izenman AJ (1975) Reduced-rank regression for the multivariate linear model. J. Multivariate Anal. 5(2):248–264.Crossref, Google Scholar
Jiang S, Fang SC, Jin Q (2020) Sparse solutions by a quadratically constrained ℓq(0<q<1) minimization model. INFORMS J. Comput. 33(2):511–530.Google Scholar
Krishnapuram B, Carin L, Figueiredo MA, Hartemink AJ (2005) Sparse multinomial logistic regression: Fast algorithms and generalization bounds. IEEE Trans. Pattern Anal. Machine Intelligence 27(6):957–968.Crossref, Google Scholar
Lange K (2016) MM Optimization Algorithms (Society for Industrial and Applied Mathematics, Philadelphia).Crossref, Google Scholar
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc. IEEE 86(11):2278–2324.Crossref, Google Scholar
Lee G, O’Leary JT, Lee SH, Morrison A (2002) Comparison and contrast of push and pull motivational effects on trip behavior: An application of a multinomial logistic regression model. Tourist Anal. 7(2):89–104.Crossref, Google Scholar
Li J, Bioucas-Dias JM, Plaza A (2010) Semisupervised hyperspectral image segmentation using multinomial logistic regression with active learning. IEEE Trans. Geosci. Remote Sensing 48(11):4085–4098.Google Scholar
Li Y, Yang Q, Yang L, Lei N, Zheng K (2019) A scalable electrochemical dehydrogenative cross-coupling of p(o)h compounds with rsh/roh. Chemical Comm. 55:4981–4984.Crossref, Google Scholar
Liu N, Ma Y, Topaloglu H (2020) Assortment optimization under the multinomial logit model with sequential offerings. INFORMS J. Comput. 32(3):835–853.Link, Google Scholar
Maxwell MS, Restrepo M, Henderson SG, Topaloglu H (2010) Approximate dynamic programming for ambulance redeployment. INFORMS J. Comput. 22(2):266–281.Link, Google Scholar
Meier L, Van De Geer S, Bühlmann P (2008) The group lasso for logistic regression. J. Royal Statist. Soc. Ser. B Statist. Methodology 70(1):53–71.Crossref, Google Scholar
Mistry M, Letsios D, Krennrich G, Lee RM, Misener R (2020) Mixed-integer convex nonlinear optimization with gradient-boosted trees embedded. INFORMS J. Comput. 33(3):1103–1119.Link, Google Scholar
Pandya D, Upadhyay S, Harsha SP (2014) Fault diagnosis of rolling element bearing by using multinomial logistic regression and wavelet packet transform. Soft Comput. 18(2):255–266.Crossref, Google Scholar
Połap D, Srivastava G, Yu K (2021) Agent architecture of an intelligent medical system based on federated learning and blockchain technology. J. Inform. Security Appl. 58:102748.Google Scholar
Ripley B, Venables W, Ripley MB (2016) Package ‘nnet’. R package version 7(3-12):700 (R, Vienna).Google Scholar
She Y (2017) Selective factor extraction in high dimensions. Biometrika 104(1):97–110.Google Scholar
Simon N, Friedman J, Hastie T (2013) A blockwise descent algorithm for group-penalized multiresponse and multinomial regression. Preprint, submitted November 26, https://arxiv.org/abs/1311.6529.Google Scholar
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J. Royal Statist. Soc. B 58:267–288.Crossref, Google Scholar
Tutz G, Pössnecker W, Uhlmann L (2015) Variable selection in general multinomial logit models. Comput. Statist. Data Anal. 82:207–222.Crossref, Google Scholar
Uematsu Y, Fan Y, Chen K, Lv J, Lin W (2019) Sofar: Large-scale association network learning. IEEE Trans. Inform. Theory 65(8):4924–4939.Crossref, Google Scholar
Van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J. Machine Learn. Res. 9(November):2579–2605.Google Scholar
Vincent M, Hansen NR (2014) Sparse group lasso and high dimensional multinomial classification. Comput. Statist. Data Anal. 71:771–786.Crossref, Google Scholar
Wang X (2009) Dimension reduction techniques in quasi-Monte Carlo methods for option pricing. INFORMS J. Comput. 21(3):488–504.Link, Google Scholar
Wang Y (2005) A multinomial logistic regression modeling approach for anomaly intrusion detection. Comput. Security 24(8):662–674.Crossref, Google Scholar
Wang L, Peng B, Bradic J, Li R, Wu Y (2020) A tuning-free robust and efficient approach to high-dimensional regression. J. Amer. Statist. Assoc. 115(532):1700–1714.Crossref, Google Scholar
Wen C, Zhang A, Quan S, Wang X (2020) Bess: An r package for best subset selection in linear, logistic and cox proportional hazards models. J. Statist. Software 94(4):1–24.Crossref, Google Scholar
Won D, Manzour H, Chaovalitwongse W (2020) Convex optimization for group feature selection in networked data. INFORMS J. Comput. 32(1):182–198.Link, Google Scholar
Yee TW, Hastie TJ (2003) Reduced-rank vector generalized linear models. Statist. Modeling 3(1):15–41.Crossref, Google Scholar
Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J. Royal Statist. Soc. Ser. B Statist. Methodology 68(1):49–67.Crossref, Google Scholar
Zheng Z, Fan Y, Lv J (2014) High dimensional thresholded regression and shrinkage effect. J. Royal Statist. Soc. Ser. B Statist. Methodology 76(3):627–649.Crossref, Google Scholar
Zheng Z, Bahadori MT, Liu Y, Lv J (2019) Scalable interpretable multi-response regression via seed. J. Machine Learn. Res. 20(107):1–34.Google Scholar
Zhu J, Wen C, Zhu J, Zhang H, Wang X (2020) A polynomial algorithm for best-subset selection problem. Proc. National Acad. Sci. USA 117(52):33117–33123.Crossref, Google Scholar

cover image INFORMS Journal on Computing

Volume 35, Issue 5

September-October 2023

Pages 909-1213, C2

Article Information

Supplemental Material

Metrics

Information

Received:May 12, 2022
Accepted:March 07, 2023
Published Online:May 09, 2023

Cite as

Canhong Wen, Zhenduo Li, Ruipeng Dong, Yijin Ni, Wenliang Pan (2023) Simultaneous Dimension Reduction and Variable Selection for Multinomial Logistic Regression. INFORMS Journal on Computing 35(5):1044-1060.

https://doi.org/10.1287/ijoc.2022.0132

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Simultaneous Dimension Reduction and Variable Selection for Multinomial Logistic Regression

References

Volume 35, Issue 5

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News