Simultaneous Dimension Reduction and Variable Selection for Multinomial Logistic Regression

Published Online:https://doi.org/10.1287/ijoc.2022.0132

References

  • Bayaga A (2010) Multinomial logistic regression: Usage and application in risk analysis. J. Appl. Quant. Methods 5(2):288–297.Google Scholar
  • Bertsimas D, Mundru N (2020) Sparse convex regression. INFORMS J. Comput. 33(1):262–279.LinkGoogle Scholar
  • Bertsimas D, King A, Mazumder R (2016) Best subset selection via a modern optimization lens. Annals Statist. 44(2):813–852.CrossrefGoogle Scholar
  • Bickel PJ, Ritov Y, Tsybakov AB (2009) Simultaneous analysis of lasso and dantzig selector. Ann. Statist. 37(4):1705–1732.CrossrefGoogle Scholar
  • Bunea F, She Y, Wegkamp MH (2012) Joint variable and rank selection for parsimonious estimation of high-dimensional matrices. Ann. Statist. 40(5):2359–2388.CrossrefGoogle Scholar
  • Cawley GC, Talbot NL, Girolami M (2006) Sparse multinomial logistic regression via Bayesian l1 regularisation. Proc. 19th Internat. Conf. Neural Inform. Processing Systems, 209–216.Google Scholar
  • Chen K (2016) Model diagnostics in reduced-rank estimation. Statist. Interface 9(4):469.CrossrefGoogle Scholar
  • Chen L, Huang JZ (2012) Sparse reduced-rank regression for simultaneous dimension reduction and variable selection. J. Amer. Statist. Assoc. 107(500):1533–1545.CrossrefGoogle Scholar
  • Chen K, Dong R, Xu W, Zheng Z (2022) Fast stagewise sparse factor regression. J. Machine Learn. Res. 23(271):1–45.Google Scholar
  • Chen K, Hoffman EA, Seetharaman I, Jiao F, Lin CL, Chan KS (2016) Linking lung airway structure to pulmonary function via composite bridge regression. Ann. Appl. Statist. 10(4):1880.CrossrefGoogle Scholar
  • Cheng Y, Wang X, Xia Y (2020) Supervised t-distributed stochastic neighbor embedding for data visualization and classification. INFORMS J. Comput. 33(2):566–585.Google Scholar
  • Choi S, Hoffman EA, Wenzel SE, Castro M, Fain SB, Jarjour NN, Schiebler ML, et al. (2015) Quantitative assessment of multiscale structural and functional alterations in asthmatic populations. J. Appl. Physiol. 118(10):1286–1298.CrossrefGoogle Scholar
  • Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96(456):1348–1360.CrossrefGoogle Scholar
  • Fan Y, Lv J (2014) Asymptotic properties for combined l 1 and concave regularization. Biometrika 101(1):57–70.CrossrefGoogle Scholar
  • Fan J, Peng H (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann. Statist. 32(3):928–961.CrossrefGoogle Scholar
  • Hastie T, Tibshirani R, Friedman J (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Springer Science & Business Media, Boston).CrossrefGoogle Scholar
  • Huang J, Jiao Y, Liu Y, Lu X (2018) A constructive approach to l0 penalized regression. J. Machine Learn. Res. 19(1):403–439.Google Scholar
  • Izenman AJ (1975) Reduced-rank regression for the multivariate linear model. J. Multivariate Anal. 5(2):248–264.CrossrefGoogle Scholar
  • Jiang S, Fang SC, Jin Q (2020) Sparse solutions by a quadratically constrained ℓq(0<q<1) minimization model. INFORMS J. Comput. 33(2):511–530.Google Scholar
  • Krishnapuram B, Carin L, Figueiredo MA, Hartemink AJ (2005) Sparse multinomial logistic regression: Fast algorithms and generalization bounds. IEEE Trans. Pattern Anal. Machine Intelligence 27(6):957–968.CrossrefGoogle Scholar
  • Lange K (2016) MM Optimization Algorithms (Society for Industrial and Applied Mathematics, Philadelphia).CrossrefGoogle Scholar
  • LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc. IEEE 86(11):2278–2324.CrossrefGoogle Scholar
  • Lee G, O’Leary JT, Lee SH, Morrison A (2002) Comparison and contrast of push and pull motivational effects on trip behavior: An application of a multinomial logistic regression model. Tourist Anal. 7(2):89–104.CrossrefGoogle Scholar
  • Li J, Bioucas-Dias JM, Plaza A (2010) Semisupervised hyperspectral image segmentation using multinomial logistic regression with active learning. IEEE Trans. Geosci. Remote Sensing 48(11):4085–4098.Google Scholar
  • Li Y, Yang Q, Yang L, Lei N, Zheng K (2019) A scalable electrochemical dehydrogenative cross-coupling of p(o)h compounds with rsh/roh. Chemical Comm. 55:4981–4984.CrossrefGoogle Scholar
  • Liu N, Ma Y, Topaloglu H (2020) Assortment optimization under the multinomial logit model with sequential offerings. INFORMS J. Comput. 32(3):835–853.LinkGoogle Scholar
  • Maxwell MS, Restrepo M, Henderson SG, Topaloglu H (2010) Approximate dynamic programming for ambulance redeployment. INFORMS J. Comput. 22(2):266–281.LinkGoogle Scholar
  • Meier L, Van De Geer S, Bühlmann P (2008) The group lasso for logistic regression. J. Royal Statist. Soc. Ser. B Statist. Methodology 70(1):53–71.CrossrefGoogle Scholar
  • Mistry M, Letsios D, Krennrich G, Lee RM, Misener R (2020) Mixed-integer convex nonlinear optimization with gradient-boosted trees embedded. INFORMS J. Comput. 33(3):1103–1119.LinkGoogle Scholar
  • Pandya D, Upadhyay S, Harsha SP (2014) Fault diagnosis of rolling element bearing by using multinomial logistic regression and wavelet packet transform. Soft Comput. 18(2):255–266.CrossrefGoogle Scholar
  • Połap D, Srivastava G, Yu K (2021) Agent architecture of an intelligent medical system based on federated learning and blockchain technology. J. Inform. Security Appl. 58:102748.Google Scholar
  • Ripley B, Venables W, Ripley MB (2016) Package ‘nnet’. R package version 7(3-12):700 (R, Vienna).Google Scholar
  • She Y (2017) Selective factor extraction in high dimensions. Biometrika 104(1):97–110.Google Scholar
  • Simon N, Friedman J, Hastie T (2013) A blockwise descent algorithm for group-penalized multiresponse and multinomial regression. Preprint, submitted November 26, https://arxiv.org/abs/1311.6529.Google Scholar
  • Tibshirani R (1996) Regression shrinkage and selection via the lasso. J. Royal Statist. Soc. B 58:267–288.CrossrefGoogle Scholar
  • Tutz G, Pössnecker W, Uhlmann L (2015) Variable selection in general multinomial logit models. Comput. Statist. Data Anal. 82:207–222.CrossrefGoogle Scholar
  • Uematsu Y, Fan Y, Chen K, Lv J, Lin W (2019) Sofar: Large-scale association network learning. IEEE Trans. Inform. Theory 65(8):4924–4939.CrossrefGoogle Scholar
  • Van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J. Machine Learn. Res. 9(November):2579–2605.Google Scholar
  • Vincent M, Hansen NR (2014) Sparse group lasso and high dimensional multinomial classification. Comput. Statist. Data Anal. 71:771–786.CrossrefGoogle Scholar
  • Wang X (2009) Dimension reduction techniques in quasi-Monte Carlo methods for option pricing. INFORMS J. Comput. 21(3):488–504.LinkGoogle Scholar
  • Wang Y (2005) A multinomial logistic regression modeling approach for anomaly intrusion detection. Comput. Security 24(8):662–674.CrossrefGoogle Scholar
  • Wang L, Peng B, Bradic J, Li R, Wu Y (2020) A tuning-free robust and efficient approach to high-dimensional regression. J. Amer. Statist. Assoc. 115(532):1700–1714.CrossrefGoogle Scholar
  • Wen C, Zhang A, Quan S, Wang X (2020) Bess: An r package for best subset selection in linear, logistic and cox proportional hazards models. J. Statist. Software 94(4):1–24.CrossrefGoogle Scholar
  • Won D, Manzour H, Chaovalitwongse W (2020) Convex optimization for group feature selection in networked data. INFORMS J. Comput. 32(1):182–198.LinkGoogle Scholar
  • Yee TW, Hastie TJ (2003) Reduced-rank vector generalized linear models. Statist. Modeling 3(1):15–41.CrossrefGoogle Scholar
  • Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J. Royal Statist. Soc. Ser. B Statist. Methodology 68(1):49–67.CrossrefGoogle Scholar
  • Zheng Z, Fan Y, Lv J (2014) High dimensional thresholded regression and shrinkage effect. J. Royal Statist. Soc. Ser. B Statist. Methodology 76(3):627–649.CrossrefGoogle Scholar
  • Zheng Z, Bahadori MT, Liu Y, Lv J (2019) Scalable interpretable multi-response regression via seed. J. Machine Learn. Res. 20(107):1–34.Google Scholar
  • Zhu J, Wen C, Zhu J, Zhang H, Wang X (2020) A polynomial algorithm for best-subset selection problem. Proc. National Acad. Sci. USA 117(52):33117–33123.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.