An Exact Solution Approach for Hierarchical Clustering
Published Online:17 Apr 2025https://doi.org/10.1287/ijoc.2024.0903
References
- (2024) Mixed integer linear programming formulation for k-means clustering problem. Central Eur. J. Oper. Res. 32(1):11–27.Crossref, Google Scholar
- (2009) A branch-and-cut SDP-based algorithm for minimum sum-of-squares clustering. Pesquisa Operacional 29(3):503–516.Crossref, Google Scholar
- (2012a) An improved column generation algorithm for minimum sum-of-squares clustering. Math. Programming 131:195–220.Crossref, Google Scholar
- (2012b) A column generation algorithm for semi-supervised minimum sum-of-squares clustering. Aloise D, Hansen P, Rocha C, eds. Proc. Global Optimi. Workshop 2012, 19–22.Google Scholar
- (2009) NP-hardness of Euclidean sum-of-squares clustering. Machine Learn. 75:245–248.Crossref, Google Scholar
- (2015) Relax, no need to round: Integrality of clustering formulations. Proc. 2015 Conf. Innovations Theoretical Comput. Sci. (Association for Computing Machinery, New York), 191–200.Google Scholar
- (2008) An exact algorithm for the vehicle routing problem based on the set partitioning formulation with additional cuts. Math. Programming 115:351–385.Crossref, Google Scholar
- (2006) A repetitive branch-and-bound procedure for minimum within-cluster sums of squares partitioning. Psychometrika 71(2):347–363.Crossref, Google Scholar
- (2023) Mixed-integer programming techniques for the minimum sum-of-squares clustering problem. J. Global Optim. 87(1):133–189.Crossref, Google Scholar
- (2020) From trees to continuous embeddings and back: Hyperbolic hierarchical clustering. Adv. Neural Inform. Processing Systems, vol. 33 (MIT Press, Cambridge, MA), 15065–15076.Google Scholar
- (2017) Approximate hierarchical clustering via sparsest cut and spreading metrics. Klein PN, ed. Proc. 2017 Annual ACM-SIAM Sympos. Discrete Algorithms (Society for Industrial and Applied Mathematics, Philadelphia), 841–854.Google Scholar
- (2019) Hierarchical clustering: Objective functions and algorithms. J. ACM 66(4):1–42.Crossref, Google Scholar
- (2018) Descriptive clustering: ILP and CP formulations with applications. Lang J, ed. Proc. 27th Internat. Joint Conf. Artificial Intelligence (AAAI Press, Washington, DC), 1263–1269.Google Scholar
- (2016) A cost function for similarity-based hierarchical clustering. Proc. 48th Annual ACM Sympos. Theory Comput. (Association for Computing Machinery, New York), 118–127.Google Scholar
- (1985) Evaluation of a branch and bound algorithm for clustering. SIAM J. Sci. Statist. Comput. 6(2):268–284.Crossref, Google Scholar
- (1967) On nonlinear fractional programming. Management Sci. 13(7):492–498.Link, Google Scholar
- (1999) An interior point algorithm for minimum sum-of-squares clustering. SIAM J. Sci. Comput. 21(4):1485–1505.Crossref, Google Scholar
- (1965) A method for cluster analysis. Biometrics 21(2):362–375.Crossref, Google Scholar
- (2017) A flexible ILP formulation for hierarchical clustering. Artificial Intelligence 244:95–109.Crossref, Google Scholar
- (2013) Formalizing hierarchical clustering as integer linear programming. 27th AAAI Conf. Artificial Intelligence, vol. 27 (AAAI Press, Washington, DC), 372–378.Google Scholar
- (2016) Large-scale optimization with the primal-dual column generation method. Math. Programming Comput. 8:47–82.Crossref, Google Scholar
- (1987) Parsimonious trees. J. Classification 4:85–101.Crossref, Google Scholar
- (2021) Exact and approximate hierarchical clustering using A*. Proc. 37th Conf. Uncertainty Artificial Intelligence, vol. 161 (PMLR, New York), 2061–2071.Google Scholar
- (2001) J-means: A new local search heuristic for minimum sum of squares clustering. Pattern Recognition 34(2):405–413.Crossref, Google Scholar
- (2000) A simple enumerative algorithm for unconstrained 0 - 1 quadratic programming. Cahiers du GERAD G-2000-59. https://numerique.banq.qc.ca/patrimoine/details/52327/3081864.Google Scholar
- (2022) An empirical comparison and characterisation of nine popular clustering methods. Adv. Data Anal. Classification 16(1):201–229.Crossref, Google Scholar
- (2015) Handbook of Cluster Analysis (Chapman & Hall, New York).Crossref, Google Scholar
- (1993) Solving airline crew scheduling problems by branch-and-cut. Management Sci. 39(6):657–682.Link, Google Scholar
- (1985) Comparing partitions. J. Classification 2:193–218.Crossref, Google Scholar
- (1969) A dynamic programming algorithm for cluster analysis. Oper. Res. 17(6):927–1092.Link, Google Scholar
- (1990) Finding Groups in Data: An Introduction to Cluster Analysis (Wiley, New York).Crossref, Google Scholar
- (1975) A branch and bound clustering algorithm. IEEE Trans. Comput. 24(9):908–915.Crossref, Google Scholar
- (1967) Some methods for classification and analysis of multivariate observations. Proc. Fifth Berkeley Sympos. Math. Statist. Probab. Vol. 1 Statist. (University of California Press, Berkeley, CA), 281–297.Google Scholar
- (2023) Approximation bounds for hierarchical clustering: Average linkage, bisecting k-means, and local search. J. Machine Learn. Res. 24(1):1–36.Google Scholar
- (2010) Integer linear programming models for constrained clustering. Internat. Conf. Discovery Sci. (Springer, New York), 159–173.Google Scholar
- (2021) Objective-based hierarchical clustering of deep embedding vectors. Proc. AAAI Conf. Artificial Intelligence, vol. 35 (AAAI Press, Washington, DC), 9055–9063.Crossref, Google Scholar
- (2017) Improved branch-cut-and-price for capacitated vehicle routing. Math. Programming Comput. 9:61–100.Crossref, Google Scholar
- (2005) A new theoretical framework for k-means-type clustering. Foundations Adv. Data Mining 180:79–96.Google Scholar
- (2022a) An exact algorithm for semi-supervised minimum sum-of-squares clustering. Comput. Oper. Res. 147:105958.Crossref, Google Scholar
- (2022b) SOS-SDP: An exact solver for minimum sum-of-squares clustering. INFORMS J. Comput. 34(4):2144–2162.Link, Google Scholar
- (2021) Hierarchical clustering of data streams: Scalable algorithms and approximation guarantees. Meila M, Zhang T, eds. Proc. 38th Internat. Conf. Machine Learn., vol. 139 (PMLR, New York), 8799–8809.Google Scholar
- (1971) Cluster analysis and mathematical programming. J. Amer. Statist. Assoc. 66(335):622–626.Crossref, Google Scholar
- (2017) Hierarchical clustering via spreading metrics. J. Machine Learn. Res. 18(88):1–35.Google Scholar
- (1981) An integer programming approach to scheduling. Wren A, ed. Computer Scheduling of Public Transport: Urban Passenger Vehicle and Crew Scheduling (North-Holland, Amsterdam), 269–280.Google Scholar
- (2017) A review of clustering techniques and developments. Neurocomputing 267:664–681.Crossref, Google Scholar
- (2003) Combinatorial Optimization (Springer, Berlin, Heidelberg).Google Scholar
- (2005) A branch-and-price approach to p-median location problems. Comput. Oper. Res. 32(6):1655–1664.Crossref, Google Scholar
- (2024) A column generation algorithm with dynamic constraint aggregation for minimum sum-of-squares clustering. Preprint, submitted October 8, https://arxiv.org/abs/2410.06187.Google Scholar
- (2020) Clustering benchmark datasets exploiting the fundamental clustering problems. Data Brief 30:105501.Crossref, Google Scholar
- (2022) Hierarchical means clustering. J. Classification 39(3):553–577.Crossref, Google Scholar
- (1969) Integer programming and the theory of grouping. J. Amer. Statist. Assoc. 65(326):506–519.Crossref, Google Scholar
- (2020) An objective for hierarchical clustering in Euclidean space and its connection to bisecting k-means. Proc. AAAI Conf. Artificial Intelligence, vol. 34 (AAAI Press, Washington, DC), 6307–6314.Crossref, Google Scholar
- (2025) An exact solution approach for hierarchical clustering. http://dx.doi.org/10.1287/ijoc.2024.0903.cd, https://github.com/INFORMSJoC/2024.0903.Google Scholar
- (2005) A cutting algorithm for the minimum sum-of-squared error clustering. Proc. SIAM Internat. Data Mining Conf. (Society for Industrial and Applied Mathematics, Philadelphia), 150–160.Google Scholar
- (2015) Planar ultrametrics for image segmentation. Adv. Neural Inform. Processing Systems, vol. 28 (MIT Press, Cambridge, MA), 64–72.Google Scholar

