Truncated Fusion Learning on Supervised Clustering and Its Fast Stagewise Algorithm
Published Online:31 Dec 2025https://doi.org/10.1287/ijoc.2024.0840
References
- (2012) Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions. Statist. Comput. 22(5):1021–1029.Crossref, Google Scholar
- (1997) Wavelets in statistics: A review. J. Italian Statist. Soc. 6(2):97–130. Crossref, Google Scholar
- (1997) Nonlinear programming. J. Oper. Res. Soc. 48(3):334–334.Crossref, Google Scholar
- (2011) Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers (Now Publishers Inc, Hanover).Google Scholar
- (2024) Reinforcement learning in latent heterogeneous environments. J. Amer. Statist. Assoc. 119(548):3113–3126.Crossref, Google Scholar
- (2022) Fast stagewise sparse factor regression. J. Machine Learn. Res. 23(271):1–45.Google Scholar
- (2019) Subgroup analysis of zero-inflated Poisson regression model with applications to insurance data. Insurance Math. Econom. 86:8–18.Crossref, Google Scholar
- (2021) Identifying heterogeneous effect using latent supervised clustering with adaptive fusion. J. Comput. Graphical Statist. 30(1):43–54.Crossref, Google Scholar
- (2020) A Wasserstein-type distance in the space of Gaussian mixture models. SIAM J. Imaging Sci. 13(2):936–970.Crossref, Google Scholar
- (2004) Least angle regression. Ann. Statist. 32(1):407–499.Google Scholar
- (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96(456):1348–1360.Crossref, Google Scholar
- (2013) Asymptotic equivalence of regularization methods in thresholded parameter space. J. Amer. Statist. Assoc. 108(503):1044–1061.Crossref, Google Scholar
- (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann. Statist. 32(3):928–961.Crossref, Google Scholar
- (2020) Rank: Large-scale inference with graphical nonlinear knockoffs. J. Amer. Statist. Assoc. 115(529):362–379. Crossref, Google Scholar
- (2020) Long-term outcomes in a large randomized trial of HIV-1 salvage therapy: 96-week results of AIDS clinical trials group A5241 (OPTIONS). J. Infectious Diseases 221(9):1407–1415.Crossref, Google Scholar
- (2020) An efficient framework for clustered federated learning. Neural Inform. Processing Systems 33:19586–19597. Google Scholar
- (2009) Interactions in epidemiology: Relevance, identification, and estimation. Epidemiology 20(1):14–17.Crossref, Google Scholar
- (2017) Generalized linear models. Chambers JM, Hastie TJ, eds. Statistical Models in S (Routledge, Abingdon-on-Thames, UK), 195–247 .Google Scholar
- (1996) Discriminant analysis by Gaussian mixtures. J. Roy. Statist. Soc. Series B. Methodological 58(1):155–176.Crossref, Google Scholar
- (2021) Subgroup analysis in the heterogeneous cox model. Statist. Medicine 40(3):739–757.Crossref, Google Scholar
- (2017) Homogeneity detection for the high-dimensional generalized linear model. Comput. Stat. Data Anal. 114:61–74.Crossref, Google Scholar
- (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34(1):1–14.Crossref, Google Scholar
- (2006) Theory of Point Estimation (Springer, New York).Google Scholar
- (2025) Truncated fusion learning on supervised clustering and its fast stagewise algorithm. https://doi.org/10.1287/ijoc.2024.0840.cd, https://github.com/INFORMSJoC/2024.0840.Google Scholar
- (2023) Multisource single-cell data integration by maw barycenter for Gaussian mixture models. Biometrics 79(2):866–877.Crossref, Google Scholar
- (2007) Variable selection via a combination of the l0 and l1 penalties. J. Comput. Graphical Statist. 16(4):782–798.Crossref, Google Scholar
- (2025) Robust personalized federated learning with sparse penalization. J. Amer. Statist. Assoc. 120(549):266–277. Crossref, Google Scholar
- (2017) A concave pairwise fusion approach to subgroup analysis. J. Amer. Statist. Assoc. 112(517):410–423.Crossref, Google Scholar
- (2019) Exploration of heterogeneous treatment effects via concave fusion. Internat. J. Biostatistics 16(1):1–26. Google Scholar
- (1980) Regression models for ordinal data. J. Roy. Statist. Soc. Series B. Methodological 42(2):109–127.Crossref, Google Scholar
- (2009) Gaussian mixture models. Encyclopedia of Biometrics (Springer, Boston), 659–663.Crossref, Google Scholar
- (2021) Clustered federated learning: Model-agnostic distributed multitask optimization under privacy constraints. IEEE Trans. Neural Networks Learn. Systems 32(8):3710–3722. Crossref, Google Scholar
- (2015) Inference for subgroup analysis with a structured logistic-normal mixture model. J. Amer. Statist. Assoc. 110(509):303–312.Crossref, Google Scholar
- (2010) Grouping pursuit through a regularization solution surface. J. Amer. Statist. Assoc. 105(490):727–739.Crossref, Google Scholar
- (2018) Sparse estimation of generalized linear models (GLM) via approximated information criteria. Statist. Sinica 28(3):1561–1581.Google Scholar
- (2014) Risk factor selection in rate making: EM adaptive LASSO for zero-inflated Poisson regression models. Risk Anal. 34(6):1112–1127.Crossref, Google Scholar
- (2015) A general framework for fast stagewise algorithms. J. Machine Learn. Res. 16(1):2543–2588.Google Scholar
- (1958) Estimation of relationships for limited dependent variables. Econometrica 26(1):24–36. Crossref, Google Scholar
- (2017) Stagewise generalized estimating equations with grouped variables. Biometrics 73(4):1332–1342.Crossref, Google Scholar
- (2018) Estimation and inference of heterogeneous treatment effects using random forests. J. Amer. Statist. Assoc. 113(523):1228–1242.Crossref, Google Scholar
- (2023) Supervised convex clustering. Biometrics 79(4):3846–3858.Crossref, Google Scholar
- (1992) The accelerated failure time model: A useful alternative to the Cox regression model in survival analysis. Statist. Medicine 11(14–15):1871–1879.Crossref, Google Scholar
- (2005) On modeling claim frequency data in general insurance with extra zeros. Insurance Math. Econom. 36(2):153–163.Crossref, Google Scholar
- (2000) A review of trust region algorithms for optimization. Internat. Congress Indust. Appl. Math. 99(1):271–282.Google Scholar
- (2015) Recent advances in trust region algorithms. Math. Programming 151:249–281.Crossref, Google Scholar
- (2010) Nearly unbiased variable selection under minimax concave penalty. Ann. Statist. 38(2):894–942.Crossref, Google Scholar
- (2024) Learning coefficient heterogeneity over networks: A distributed spanning-tree-based fused-lasso regression. J. Amer. Statist. Assoc. 119(545):485–497.Crossref, Google Scholar
- (2023) An ensemble credit scoring model based on logistic regression with heterogeneous balancing and weighting effects. Expert Systems. Appl. 212:118732.Crossref, Google Scholar
- (2019) Scalable interpretable multi-response regression via seed. J. Machine Learn. Res. 20(107):1–34.Google Scholar
- (2018) On the Fenchel duality between strong convexity and Lipschitz continuous gradient. Preprint, submitted March 17, https://arxiv.org/abs/1803.06573.Google Scholar

