On the Power of Linear Programming for K-Means Clustering

Antonio De Rosa
Antonio De Rosa
[email protected]
Department of Decision Sciences, Bocconi University, 20136 Milano, Italy; and The Bocconi Institute for Data Science and Analytics (BIDSA), Bocconi University, 20136 Milano, Italy
Search for more papers by this author
,
Aida Khajavirad
Corresponding Author
Aida Khajavirad
[email protected]
https://orcid.org/0000-0002-7097-1676
Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015
Search for more papers by this author
,
Yakun Wang
Yakun Wang
[email protected]
https://orcid.org/0009-0009-4995-0131
Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015
Search for more papers by this author

Department of Decision Sciences, Bocconi University, 20136 Milano, Italy; and The Bocconi Institute for Data Science and Analytics (BIDSA), Bocconi University, 20136 Milano, Italy

Search for more papers by this author

Aida Khajavirad

Corresponding Author

Aida Khajavirad

[email protected]

https://orcid.org/0000-0002-7097-1676

Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015

Search for more papers by this author

Yakun Wang

[email protected]

https://orcid.org/0009-0009-4995-0131

Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015

Search for more papers by this author

Published Online:19 May 2026https://doi.org/10.1287/ijoo.2025.0065

Abstract

De Rosa and Khajavirad [De Rosa A, Khajavirad A (2022) The ratio-cut polytope and K-means clustering. SIAM J. Optim. 32(1):173–203] introduced a new polynomial-size linear programming (LP) relaxation for K-means clustering. In this paper, we further investigate both theoretical and computational properties of this relaxation. As evident from our numerical experiments with both synthetic and real-world data sets, the proposed LP relaxation is almost always tight, i.e., its optimal solution is feasible for the original nonconvex problem. To better understand this unexpected behavior, on the theoretical side, we focus on K-means clustering with two clusters, and we obtain sufficient conditions under which a given partition of data is an optimal solution of the LP relaxation. We further analyze the sufficient conditions when the input is generated according to a popular stochastic model and obtain recovery guarantees for the LP. We conclude our theoretical study by constructing a family of inputs for which the LP relaxation is never tight. Denoting by n the number of data points to be clustered, the LP relaxation contains $Ω (n^{3})$ inequalities, making it impractical for large data sets. To address the scalability issue, by building upon a cutting-plane algorithm together with the GPU implementation of PDLP, a first-order method LP solver, we develop an efficient algorithm that solves the proposed LP and hence, the K-means clustering problem for up to $n \leq 4, 000$ data points.

Funding: The authors were partially funded by the Air Force Office of Scientific Research [Grant FA9550-23-1-0123].

cover image INFORMS Journal on Optimization

Articles In Advance

Article Information

Metrics

Information

Received:January 14, 2025
Accepted:April 20, 2026
Published Online:May 19, 2026

Cite as

Antonio De Rosa, Aida Khajavirad, Yakun Wang (2026) On the Power of Linear Programming for K-Means Clustering. INFORMS Journal on Optimization 0(0).

https://doi.org/10.1287/ijoo.2025.0065

Keywords

PDF download

Available Issues

Available Issues

Available Issues

On the Power of Linear Programming for K-Means Clustering

Abstract

Articles In Advance

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News