On Sinkhorn’s Algorithm and Choice Modeling
Abstract
For a broad class of models widely used in practice for choice and ranking data based on the Luce choice axiom, including the Bradley–Terry–Luce and Plackett–Luce models, we show that the associated maximum likelihood estimation problems are equivalent to a classic matrix-balancing problem with target row and column sums. This perspective opens doors between two seemingly unrelated research areas and allows us to unify existing algorithms in the choice-modeling literature as special instances or analogs of Sinkhorn’s celebrated algorithm for matrix balancing. We draw inspirations from these connections and resolve some open problems on the study of Sinkhorn’s algorithm. We establish the global linear convergence of Sinkhorn’s algorithm for nonnegative matrices whenever finite scaling matrices exist and characterize its linear convergence rate in terms of the algebraic connectivity of a weighted bipartite graph. We further derive the sharp asymptotic rate of linear convergence, which generalizes a classic result of Knight. To our knowledge, these are the first quantitative linear convergence results for Sinkhorn’s algorithm for general nonnegative matrices and positive marginals. Our results highlight the importance of connectivity and orthogonality structures in matrix balancing and Sinkhorn’s algorithm, which could be of independent interest. More broadly, the connections that we establish in this paper between matrix balancing and choice modeling could also help motivate further transmission of ideas and lead to interesting results in both disciplines.
Funding: This work was supported in part by Stanford [an interdisciplinary graduate fellowship], the Division of Computing and Communication Foundations [Grant 2143176], and the European Research Council [Grant 866274].
Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2023.0596.

