Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria

Gabriele Farina
Gabriele Farina
[email protected]
https://orcid.org/0000-0002-3976-0061
MIT EECS, Cambridge, Massachusetts 02139
Search for more papers by this author
,
Christian Kroer
Corresponding Author
Christian Kroer
[email protected]
https://orcid.org/0000-0002-9009-8683
Industrial Engineering and Operations Research Department, Columbia University, New York, New York 10027
Search for more papers by this author
,
Tuomas Sandholm
Tuomas Sandholm
[email protected]
https://orcid.org/0000-0001-8861-9366
Computer Science Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213; and Strategy Robot, Inc., Pittsburgh, Pennsylvania 15232; and Optimized Markets, Inc., Pittsburgh, Pennsylvania 15232; and Strategic Machine, Inc., Pittsburgh, Pennsylvania 15232
Search for more papers by this author

MIT EECS, Cambridge, Massachusetts 02139

Search for more papers by this author

Christian Kroer

Corresponding Author

Christian Kroer

[email protected]

https://orcid.org/0000-0002-9009-8683

Industrial Engineering and Operations Research Department, Columbia University, New York, New York 10027

Search for more papers by this author

Tuomas Sandholm

[email protected]

https://orcid.org/0000-0001-8861-9366

Computer Science Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213; and Strategy Robot, Inc., Pittsburgh, Pennsylvania 15232; and Optimized Markets, Inc., Pittsburgh, Pennsylvania 15232; and Strategic Machine, Inc., Pittsburgh, Pennsylvania 15232

Search for more papers by this author

Published Online:28 Feb 2025https://doi.org/10.1287/opre.2021.0633

Abstract

We study the application of iterative first-order methods to the problem of computing equilibria of large-scale extensive-form games. First-order methods must typically be instantiated with a regularizer that serves as a distance-generating function (DGF) for the decision sets of the players. In this paper, we introduce a new weighted entropy-based distance-generating function. We show that this function is equivalent to a particular set of new weights for the dilated entropy distance–generating function on a treeplex while retaining the simpler structure of the regular entropy function for the unit cube. This function achieves significantly better strong-convexity properties than existing weight schemes for the dilated entropy while maintaining the same easily implemented closed-form proximal mapping as the prior state of the art. Extensive numerical simulations show that these superior theoretical properties translate into better numerical performance as well. We then generalize our new entropy distance function, as well as general dilated distance functions, to the scaled extension operator. The scaled extension operator is a way to recursively construct convex sets, which generalizes the decision polytope of extensive-form games as well as the convex polytopes corresponding to correlated and team equilibria. Correspondingly, we give the first efficiently computable distance-generating function for all those strategy polytopes. By instantiating first-order methods with our regularizers, we achieve several new results, such as the first method for computing ex ante correlated team equilibria with a guaranteed $1 / T$ rate of convergence and efficient proximal updates. Similarly, we show that our regularizers can be used to speed up the computation of correlated solution concepts.

Funding: G. Farina was supported by the National Science Foundations [Grant CCF-2443068] and by T. Sandholm’s grants listed below and a Facebook fellowship. C. Kroer was supported by the Office of Naval Research [Grants N00014-22-1-2530 and N00014-23-1-2374] and the National Science Foundation [Grants IIS-2147361 and IIS-2238960]. T. Sandholm was supported by the Vannevar Bush Faculty Fellowship, Office of Naval Research [Grant ONR N00014-23-1-2876], the National Science Foundation Division of Information and Intelligent Systems [Grants RI-1718457, RI-2312342, RI-1901403, and CCF-1733556], the Army Research Office [Grants W911NF2010081 and W911NF2210266], and the National Institutes of Health [Grant A240108S001]. This work was further supported by the National Science Foundation Division of Information and Intelligent Systems [Grant 1617590] and the Army Research Office [Grant W911NF-17-1-0082].

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2021.0633.

Volume 73, Issue 5

September-October 2025

Pages iii-vii, 2297-2866, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:September 24, 2021
Accepted:November 26, 2024
Published Online:February 28, 2025

Cite as

Gabriele Farina, Christian Kroer, Tuomas Sandholm (2025) Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria. Operations Research 73(5):2430-2457.

https://doi.org/10.1287/opre.2021.0633

Keywords

Acknowledgments

The authors gratefully acknowledge the insightful comments from the anonymous reviewers at Operations Research and the ACM Conference on Economics and Computation. These comments led to a greatly improved manuscript.

This paper first appeared as an extended abstract at the ACM Conference on Economics and Computation in 2021.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria

Abstract

Volume 73, Issue 5

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News