Truncated Fusion Learning on Supervised Clustering and Its Fast Stagewise Algorithm

Published Online:https://doi.org/10.1287/ijoc.2024.0840

References

  • Andrews JL, McNicholas PD (2012) Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions. Statist. Comput. 22(5):1021–1029.CrossrefGoogle Scholar
  • Antoniadis A (1997) Wavelets in statistics: A review. J. Italian Statist. Soc. 6(2):97–130. CrossrefGoogle Scholar
  • Bertsekas DP (1997) Nonlinear programming. J. Oper. Res. Soc. 48(3):334–334.CrossrefGoogle Scholar
  • Boyd S, Parikh N, Chu E (2011) Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers (Now Publishers Inc, Hanover).Google Scholar
  • Chen EY, Song R, Jordan MI (2024) Reinforcement learning in latent heterogeneous environments. J. Amer. Statist. Assoc. 119(548):3113–3126.CrossrefGoogle Scholar
  • Chen K, Dong R, Xu W, Zheng Z (2022) Fast stagewise sparse factor regression. J. Machine Learn. Res. 23(271):1–45.Google Scholar
  • Chen K, Huang R, Chan NH, Yau CY (2019) Subgroup analysis of zero-inflated Poisson regression model with applications to insurance data. Insurance Math. Econom. 86:8–18.CrossrefGoogle Scholar
  • Chen J, Tran-Dinh Q, Kosorok MR, Liu Y (2021) Identifying heterogeneous effect using latent supervised clustering with adaptive fusion. J. Comput. Graphical Statist. 30(1):43–54.CrossrefGoogle Scholar
  • Delon J, Desolneux A (2020) A Wasserstein-type distance in the space of Gaussian mixture models. SIAM J. Imaging Sci. 13(2):936–970.CrossrefGoogle Scholar
  • Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann. Statist. 32(1):407–499.Google Scholar
  • Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96(456):1348–1360.CrossrefGoogle Scholar
  • Fan Y, Lv J (2013) Asymptotic equivalence of regularization methods in thresholded parameter space. J. Amer. Statist. Assoc. 108(503):1044–1061.CrossrefGoogle Scholar
  • Fan J, Peng H (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann. Statist. 32(3):928–961.CrossrefGoogle Scholar
  • Fan Y, Demirkaya E, Li G, Lv J (2020) Rank: Large-scale inference with graphical nonlinear knockoffs. J. Amer. Statist. Assoc. 115(529):362–379. CrossrefGoogle Scholar
  • Gandhi RT, Tashima KT, Smeaton LM, Vu V, Ritz J, Andrade A, Eron JJ, Hogg E, Fichtenbaum CJ (2020) Long-term outcomes in a large randomized trial of HIV-1 salvage therapy: 96-week results of AIDS clinical trials group A5241 (OPTIONS). J. Infectious Diseases 221(9):1407–1415.CrossrefGoogle Scholar
  • Ghosh A, Chung J, Yin D, Ramchandran K (2020) An efficient framework for clustered federated learning. Neural Inform. Processing Systems 33:19586–19597. Google Scholar
  • Greenland S (2009) Interactions in epidemiology: Relevance, identification, and estimation. Epidemiology 20(1):14–17.CrossrefGoogle Scholar
  • Hastie TJ, Pregibon D (2017) Generalized linear models. Chambers JM, Hastie TJ, eds. Statistical Models in S (Routledge, Abingdon-on-Thames, UK), 195–247 .Google Scholar
  • Hastie T, Tibshirani R (1996) Discriminant analysis by Gaussian mixtures. J. Roy. Statist. Soc. Series B. Methodological 58(1):155–176.CrossrefGoogle Scholar
  • Hu X, Huang J, Liu L, Sun D, Zhao X (2021) Subgroup analysis in the heterogeneous cox model. Statist. Medicine 40(3):739–757.CrossrefGoogle Scholar
  • Jeon JJ, Kwon S, Choi H (2017) Homogeneity detection for the high-dimensional generalized linear model. Comput. Stat. Data Anal. 114:61–74.CrossrefGoogle Scholar
  • Lambert D (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34(1):1–14.CrossrefGoogle Scholar
  • Lehmann EL, Casella G (2006) Theory of Point Estimation (Springer, New York).Google Scholar
  • Li L, Li Y, Zhang J, Zheng Z (2025) Truncated fusion learning on supervised clustering and its fast stagewise algorithm. https://doi.org/10.1287/ijoc.2024.0840.cd, https://github.com/INFORMSJoC/2024.0840.Google Scholar
  • Lin L, Shi W, Ye J, Li J (2023) Multisource single-cell data integration by maw barycenter for Gaussian mixture models. Biometrics 79(2):866–877.CrossrefGoogle Scholar
  • Liu Y, Wu Y (2007) Variable selection via a combination of the l0 and l1 penalties. J. Comput. Graphical Statist. 16(4):782–798.CrossrefGoogle Scholar
  • Liu W, Mao X, Zhang X, Zhang X (2025) Robust personalized federated learning with sparse penalization. J. Amer. Statist. Assoc. 120(549):266–277. CrossrefGoogle Scholar
  • Ma S, Huang J (2017) A concave pairwise fusion approach to subgroup analysis. J. Amer. Statist. Assoc. 112(517):410–423.CrossrefGoogle Scholar
  • Ma S, Huang J, Zhang Z, Liu M (2019) Exploration of heterogeneous treatment effects via concave fusion. Internat. J. Biostatistics 16(1):1–26. Google Scholar
  • McCullagh P (1980) Regression models for ordinal data. J. Roy. Statist. Soc. Series B. Methodological 42(2):109–127.CrossrefGoogle Scholar
  • Reynolds DA (2009) Gaussian mixture models. Encyclopedia of Biometrics (Springer, Boston), 659–663.CrossrefGoogle Scholar
  • Sattler F, Muller KR, Samek W (2021) Clustered federated learning: Model-agnostic distributed multitask optimization under privacy constraints. IEEE Trans. Neural Networks Learn. Systems 32(8):3710–3722. CrossrefGoogle Scholar
  • Shen J, He X (2015) Inference for subgroup analysis with a structured logistic-normal mixture model. J. Amer. Statist. Assoc. 110(509):303–312.CrossrefGoogle Scholar
  • Shen X, Huang HC (2010) Grouping pursuit through a regularization solution surface. J. Amer. Statist. Assoc. 105(490):727–739.CrossrefGoogle Scholar
  • Su X, Fan J, Levine RA, Nunn ME, Tsai CL (2018) Sparse estimation of generalized linear models (GLM) via approximated information criteria. Statist. Sinica 28(3):1561–1581.Google Scholar
  • Tang Y, Xiang L, Zhu Z (2014) Risk factor selection in rate making: EM adaptive LASSO for zero-inflated Poisson regression models. Risk Anal. 34(6):1112–1127.CrossrefGoogle Scholar
  • Tibshirani RJ (2015) A general framework for fast stagewise algorithms. J. Machine Learn. Res. 16(1):2543–2588.Google Scholar
  • Tobin J (1958) Estimation of relationships for limited dependent variables. Econometrica 26(1):24–36. CrossrefGoogle Scholar
  • Vaughan G, Aseltine R, Chen K, Yan J (2017) Stagewise generalized estimating equations with grouped variables. Biometrics 73(4):1332–1342.CrossrefGoogle Scholar
  • Wager S, Athey S (2018) Estimation and inference of heterogeneous treatment effects using random forests. J. Amer. Statist. Assoc. 113(523):1228–1242.CrossrefGoogle Scholar
  • Wang M, Yao T, Allen GI (2023) Supervised convex clustering. Biometrics 79(4):3846–3858.CrossrefGoogle Scholar
  • Wei LJ (1992) The accelerated failure time model: A useful alternative to the Cox regression model in survival analysis. Statist. Medicine 11(14–15):1871–1879.CrossrefGoogle Scholar
  • Yip KC, Yau KK (2005) On modeling claim frequency data in general insurance with extra zeros. Insurance Math. Econom. 36(2):153–163.CrossrefGoogle Scholar
  • Yuan YX (2000) A review of trust region algorithms for optimization. Internat. Congress Indust. Appl. Math. 99(1):271–282.Google Scholar
  • Yuan YX (2015) Recent advances in trust region algorithms. Math. Programming 151:249–281.CrossrefGoogle Scholar
  • Zhang C (2010) Nearly unbiased variable selection under minimax concave penalty. Ann. Statist. 38(2):894–942.CrossrefGoogle Scholar
  • Zhang X, Liu J, Zhu Z (2024) Learning coefficient heterogeneity over networks: A distributed spanning-tree-based fused-lasso regression. J. Amer. Statist. Assoc. 119(545):485–497.CrossrefGoogle Scholar
  • Zhang R, Xue L, Wang Q (2023) An ensemble credit scoring model based on logistic regression with heterogeneous balancing and weighting effects. Expert Systems. Appl. 212:118732.CrossrefGoogle Scholar
  • Zheng Z, Bahadori MT, Liu Y, Lv J (2019) Scalable interpretable multi-response regression via seed. J. Machine Learn. Res. 20(107):1–34.Google Scholar
  • Zhou X (2018) On the Fenchel duality between strong convexity and Lipschitz continuous gradient. Preprint, submitted March 17, https://arxiv.org/abs/1803.06573.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.