Truncated Fusion Learning on Supervised Clustering and Its Fast Stagewise Algorithm

Letian Li
Letian Li
[email protected]
https://orcid.org/0009-0000-6981-3542
International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China
Search for more papers by this author
,
Yang Li
Corresponding Author
Yang Li
[email protected]
https://orcid.org/0000-0002-1202-1082
International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China
Search for more papers by this author
,
Jie Zhang
Jie Zhang
[email protected]
International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China
Search for more papers by this author
,
Zemin Zheng
Corresponding Author
Zemin Zheng
[email protected]
https://orcid.org/0000-0002-0240-9411
International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China
Search for more papers by this author

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China

Search for more papers by this author

Yang Li

Corresponding Author

Yang Li

[email protected]

https://orcid.org/0000-0002-1202-1082

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China

Search for more papers by this author

Jie Zhang

[email protected]

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China

Search for more papers by this author

Zemin Zheng

Corresponding Author

Zemin Zheng

[email protected]

https://orcid.org/0000-0002-0240-9411

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, P. R. China

Search for more papers by this author

Published Online:31 Dec 2025https://doi.org/10.1287/ijoc.2024.0840

References

Andrews JL, McNicholas PD (2012) Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions. Statist. Comput. 22(5):1021–1029.Crossref, Google Scholar
Antoniadis A (1997) Wavelets in statistics: A review. J. Italian Statist. Soc. 6(2):97–130. Crossref, Google Scholar
Bertsekas DP (1997) Nonlinear programming. J. Oper. Res. Soc. 48(3):334–334.Crossref, Google Scholar
Boyd S, Parikh N, Chu E (2011) Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers (Now Publishers Inc, Hanover).Google Scholar
Chen EY, Song R, Jordan MI (2024) Reinforcement learning in latent heterogeneous environments. J. Amer. Statist. Assoc. 119(548):3113–3126.Crossref, Google Scholar
Chen K, Dong R, Xu W, Zheng Z (2022) Fast stagewise sparse factor regression. J. Machine Learn. Res. 23(271):1–45.Google Scholar
Chen K, Huang R, Chan NH, Yau CY (2019) Subgroup analysis of zero-inflated Poisson regression model with applications to insurance data. Insurance Math. Econom. 86:8–18.Crossref, Google Scholar
Chen J, Tran-Dinh Q, Kosorok MR, Liu Y (2021) Identifying heterogeneous effect using latent supervised clustering with adaptive fusion. J. Comput. Graphical Statist. 30(1):43–54.Crossref, Google Scholar
Delon J, Desolneux A (2020) A Wasserstein-type distance in the space of Gaussian mixture models. SIAM J. Imaging Sci. 13(2):936–970.Crossref, Google Scholar
Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann. Statist. 32(1):407–499.Google Scholar
Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96(456):1348–1360.Crossref, Google Scholar
Fan Y, Lv J (2013) Asymptotic equivalence of regularization methods in thresholded parameter space. J. Amer. Statist. Assoc. 108(503):1044–1061.Crossref, Google Scholar
Fan J, Peng H (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann. Statist. 32(3):928–961.Crossref, Google Scholar
Fan Y, Demirkaya E, Li G, Lv J (2020) Rank: Large-scale inference with graphical nonlinear knockoffs. J. Amer. Statist. Assoc. 115(529):362–379. Crossref, Google Scholar
Gandhi RT, Tashima KT, Smeaton LM, Vu V, Ritz J, Andrade A, Eron JJ, Hogg E, Fichtenbaum CJ (2020) Long-term outcomes in a large randomized trial of HIV-1 salvage therapy: 96-week results of AIDS clinical trials group A5241 (OPTIONS). J. Infectious Diseases 221(9):1407–1415.Crossref, Google Scholar
Ghosh A, Chung J, Yin D, Ramchandran K (2020) An efficient framework for clustered federated learning. Neural Inform. Processing Systems 33:19586–19597. Google Scholar
Greenland S (2009) Interactions in epidemiology: Relevance, identification, and estimation. Epidemiology 20(1):14–17.Crossref, Google Scholar
Hastie TJ, Pregibon D (2017) Generalized linear models. Chambers JM, Hastie TJ, eds. Statistical Models in S (Routledge, Abingdon-on-Thames, UK), 195–247 .Google Scholar
Hastie T, Tibshirani R (1996) Discriminant analysis by Gaussian mixtures. J. Roy. Statist. Soc. Series B. Methodological 58(1):155–176.Crossref, Google Scholar
Hu X, Huang J, Liu L, Sun D, Zhao X (2021) Subgroup analysis in the heterogeneous cox model. Statist. Medicine 40(3):739–757.Crossref, Google Scholar
Jeon JJ, Kwon S, Choi H (2017) Homogeneity detection for the high-dimensional generalized linear model. Comput. Stat. Data Anal. 114:61–74.Crossref, Google Scholar
Lambert D (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34(1):1–14.Crossref, Google Scholar
Lehmann EL, Casella G (2006) Theory of Point Estimation (Springer, New York).Google Scholar
Li L, Li Y, Zhang J, Zheng Z (2025) Truncated fusion learning on supervised clustering and its fast stagewise algorithm. https://doi.org/10.1287/ijoc.2024.0840.cd, https://github.com/INFORMSJoC/2024.0840.Google Scholar
Lin L, Shi W, Ye J, Li J (2023) Multisource single-cell data integration by maw barycenter for Gaussian mixture models. Biometrics 79(2):866–877.Crossref, Google Scholar
Liu Y, Wu Y (2007) Variable selection via a combination of the l0 and l1 penalties. J. Comput. Graphical Statist. 16(4):782–798.Crossref, Google Scholar
Liu W, Mao X, Zhang X, Zhang X (2025) Robust personalized federated learning with sparse penalization. J. Amer. Statist. Assoc. 120(549):266–277. Crossref, Google Scholar
Ma S, Huang J (2017) A concave pairwise fusion approach to subgroup analysis. J. Amer. Statist. Assoc. 112(517):410–423.Crossref, Google Scholar
Ma S, Huang J, Zhang Z, Liu M (2019) Exploration of heterogeneous treatment effects via concave fusion. Internat. J. Biostatistics 16(1):1–26. Google Scholar
McCullagh P (1980) Regression models for ordinal data. J. Roy. Statist. Soc. Series B. Methodological 42(2):109–127.Crossref, Google Scholar
Reynolds DA (2009) Gaussian mixture models. Encyclopedia of Biometrics (Springer, Boston), 659–663.Crossref, Google Scholar
Sattler F, Muller KR, Samek W (2021) Clustered federated learning: Model-agnostic distributed multitask optimization under privacy constraints. IEEE Trans. Neural Networks Learn. Systems 32(8):3710–3722. Crossref, Google Scholar
Shen J, He X (2015) Inference for subgroup analysis with a structured logistic-normal mixture model. J. Amer. Statist. Assoc. 110(509):303–312.Crossref, Google Scholar
Shen X, Huang HC (2010) Grouping pursuit through a regularization solution surface. J. Amer. Statist. Assoc. 105(490):727–739.Crossref, Google Scholar
Su X, Fan J, Levine RA, Nunn ME, Tsai CL (2018) Sparse estimation of generalized linear models (GLM) via approximated information criteria. Statist. Sinica 28(3):1561–1581.Google Scholar
Tang Y, Xiang L, Zhu Z (2014) Risk factor selection in rate making: EM adaptive LASSO for zero-inflated Poisson regression models. Risk Anal. 34(6):1112–1127.Crossref, Google Scholar
Tibshirani RJ (2015) A general framework for fast stagewise algorithms. J. Machine Learn. Res. 16(1):2543–2588.Google Scholar
Tobin J (1958) Estimation of relationships for limited dependent variables. Econometrica 26(1):24–36. Crossref, Google Scholar
Vaughan G, Aseltine R, Chen K, Yan J (2017) Stagewise generalized estimating equations with grouped variables. Biometrics 73(4):1332–1342.Crossref, Google Scholar
Wager S, Athey S (2018) Estimation and inference of heterogeneous treatment effects using random forests. J. Amer. Statist. Assoc. 113(523):1228–1242.Crossref, Google Scholar
Wang M, Yao T, Allen GI (2023) Supervised convex clustering. Biometrics 79(4):3846–3858.Crossref, Google Scholar
Wei LJ (1992) The accelerated failure time model: A useful alternative to the Cox regression model in survival analysis. Statist. Medicine 11(14–15):1871–1879.Crossref, Google Scholar
Yip KC, Yau KK (2005) On modeling claim frequency data in general insurance with extra zeros. Insurance Math. Econom. 36(2):153–163.Crossref, Google Scholar
Yuan YX (2000) A review of trust region algorithms for optimization. Internat. Congress Indust. Appl. Math. 99(1):271–282.Google Scholar
Yuan YX (2015) Recent advances in trust region algorithms. Math. Programming 151:249–281.Crossref, Google Scholar
Zhang C (2010) Nearly unbiased variable selection under minimax concave penalty. Ann. Statist. 38(2):894–942.Crossref, Google Scholar
Zhang X, Liu J, Zhu Z (2024) Learning coefficient heterogeneity over networks: A distributed spanning-tree-based fused-lasso regression. J. Amer. Statist. Assoc. 119(545):485–497.Crossref, Google Scholar
Zhang R, Xue L, Wang Q (2023) An ensemble credit scoring model based on logistic regression with heterogeneous balancing and weighting effects. Expert Systems. Appl. 212:118732.Crossref, Google Scholar
Zheng Z, Bahadori MT, Liu Y, Lv J (2019) Scalable interpretable multi-response regression via seed. J. Machine Learn. Res. 20(107):1–34.Google Scholar
Zhou X (2018) On the Fenchel duality between strong convexity and Lipschitz continuous gradient. Preprint, submitted March 17, https://arxiv.org/abs/1803.06573.Google Scholar

cover image INFORMS Journal on Computing

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:July 07, 2024
Accepted:November 02, 2025
Published Online:December 31, 2025

Cite as

Letian Li, Yang Li, Jie Zhang, Zemin Zheng (2026) Truncated Fusion Learning on Supervised Clustering and Its Fast Stagewise Algorithm. INFORMS Journal on Computing 0(0).

https://doi.org/10.1287/ijoc.2024.0840

Keywords

Acknowledgments

The authors thank the editors and referees for their valuable comments that helped improve this article substantially.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Truncated Fusion Learning on Supervised Clustering and Its Fast Stagewise Algorithm

References

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News