Sampling from the Gibbs Distribution in Congestion Games
Abstract
Logit dynamics is a form of randomized game dynamics in which players have a bias toward strategic deviations that give a higher improvement in cost. It is used extensively in practice. In congestion (or potential) games, the dynamics converge to the so-called Gibbs distribution over the set of all strategy profiles when interpreted as a Markov chain. In general, logit dynamics can converge slowly to the Gibbs distribution, but beyond that, not much is known about its algorithmic aspects, nor that of the Gibbs distribution. In this work, we are interested in the following two questions for congestion games: (i) Is there an efficient algorithm for sampling from the Gibbs distribution? (ii) If yes, does there also exist natural randomized dynamics that converge quickly to the Gibbs distribution? We first study these questions in extension parallel congestion games, a well-studied special case of symmetric network congestion games. As our main result, we show that there is a simple variation on the logit dynamics that converges quickly to the Gibbs distribution in such games. We also address the first question for the class of so-called capacitated uniform congestion games and the second question for max cut games played on a complete graph. To prove our results, we rely on the recent breakthrough work of Anari et al. (2019) regarding the approximate sampling of a base of a matroid according to a strongly log-concave probability distribution.
1. Introduction
Congestion games constitute a rich class of games that have been studied extensively since their introduction by Rosenthal [55]. An (unweighted) congestion game consists of a set of players and a set of resources . Every player i has a strategy set , in which each strategy is a subset of resources. Furthermore, every resource is equipped with a cost function that we assume to be nonnegative and nondecreasing. The goal of a player is to choose a strategy that minimizes the player’s total cost , where is the number of players using resource e in profile . A well-known example is the class of symmetric network congestion games, in which we are given a directed graph with origin and destination . The common strategy set of all players is given by the set of all o, d-paths in G.
Rosenthal [55] proves that congestion games are (exact) potential games. He shows that the function given by
Here, is used to denote the cost of player i in the strategy profile in which i chooses , and all other players choose their strategy in s. The function is often referred to as Rosenthal’s potential. The main implication of (1) is the existence of a pure Nash equilibrium (PNE): a strategy profile in which no player can deviate to another strategy and obtain an improved cost (Rosenthal [55]). This follows directly from the observation that better (or best) response dynamics converge to a PNE in a finite number of steps. Better response dynamics is defined as the procedure by which, in every step, precisely one player deviates to another strategy that yields an improved cost (until a pure Nash equilibrium is reached). For best response dynamics, the deviating player always deviates to a strategy that yields the greatest possible improvement in cost.
In the last two decades, the algorithmic aspects of (pure) Nash equilibria are studied extensively in both general and special classes of congestion games. Two of the most prominent questions concerning pure Nash equilibria are the following.
Does (natural) player dynamics, such as better or best response dynamics, converge to a PNE in polynomial time?
If not, can one compute a PNE in polynomial time by other means?
Ieong et al. [38] consider the convergence time of better and best response dynamics for singleton congestion games. Here, every strategy of a player consists of a single resource. It is shown that better response dynamics converge in at most steps, and there exist instances in which convergence of better response dynamics might take steps (Ieong et al. [38]). The result of Ieong et al. [38] is generalized by Ackermann et al. [2] to matroid congestion games, in which the strategy set of every player is the collection of bases of a matroid on the ground set E (see Section 2.3 for a formal definition). They show that best response dynamics converge in at most steps, where r is the maximum rank of the matroids that form the strategy sets of the players. They also show that matroids are, in a sense, the maximal structure that allow for polynomial time convergence of best response dynamics. Furthermore, Ackermann et al. [2] show best response dynamics are not guaranteed to converge in symmetric network congestion games, in which the common strategy set of every player is the collection of paths in a given directed graph. Best response dynamics are also studied in more general congestion games, such as with player-specific cost functions (Ackermann and Röglin [1]) or with weighted players (Even-Dar et al. [24], Fanelli and Moscardelli [26]). Approximate versions of response dynamics are also studied; see, for example, Chien and Sinclair [19] and Skopalik and Vöcking [59].
The complexity of computing a pure Nash equilibrium is also well-understood. Fabrikant et al. [25] show that computing a pure Nash equilibrium is complete for the complexity class polynomial local search. Hardness of computing a PNE is also shown for various special cases of congestion games; see, for example, Ackermann et al. [2] and Del Pia et al. [21], as well as for the computation of approximate equilibria (Skopalik and Vöcking [59]). On the positive side, Fabrikant et al. [25] show that a PNE can be computed efficiently in symmetric network congestion games despite the fact that best response dynamics are not guaranteed to converge in these games (Ackermann et al. [2]). Del Pia et al. [21] generalize this result to totally unimodular congestion games; see also Kleer and Schäfer [43] for a more general framework.
In general, player dynamics roughly comes in two flavors: one deviates to another strategy profile according to either a deterministic rule or a probabilistic one. A well-known example of the latter case is noisy (randomized) best response dynamics, which has received a lot of attention in practice; see, for example, Camerer [18] and Mäs and Nax [47], but seems hard to analyze from a theoretical perspective. Here, instead of making a deviation to another strategy according to a deterministic rule, a player chooses a strategy from the player’s set according to a probability distribution that usually puts relatively more weight on strategies that result in a lower cost. Randomized dynamics can be studied from two perspectives: as either a randomized alternative for deterministic dynamics converging to a pure Nash equilibrium or as a dynamical system on its own.
One well-known example of player dynamics that can be studied as a dynamical system and which is the topic of this paper is logit dynamics. It has received a lot of attention in various communities, such as evolutionary game theory (e.g., Sandholm [56]) and experimental economics (e.g., Camerer [18]). The procedure was introduced by Blume [14] as a form of randomized game dynamics in which players update their strategy according to a logit update rule (McFadden [48]). The logit dynamics for congestion games can be formulated as follows. For a given strategy profile s and fixed rationality level (or inverse temperature in the physics literature) parameter , first choose a player uniformly at random and then have player i choose a strategy with probability
Note that the denominator in (2) is a normalizing constant (often called the partition function).
Equivalently, one can replace the by and, similarly, in the normalizing constant. We use as this is more convenient for our purposes. The equivalence follows from (1).
The rationality level is used to model the amount of noise players believe there to be in the system (Auletta et al. [12]). When , players effectively only assign positive probability to best responses, whereas when , the distribution in (2) approaches the uniform distribution over . Also note that the dynamics indeed puts relatively more probability mass on strategies that give a greater improvement in cost.
Logit dynamics gives rise to an ergodic, time-reversible Markov chain on the set of all strategy profiles that has the Gibbs distribution π given by
Auletta et al. [12] interpret the Gibbs distribution as a dynamic equilibrium concept, which they dubbed the logit equilibrium; see also Ferraioli [28]. This concept is well-defined for general finite games; see, for example, Auletta et al. [12]. The goal of this work is to study algorithmic aspects of the logit equilibrium/Gibbs distribution in congestion games. Auletta et al. [12] give a tight bound for the mixing time of the logit dynamics in general potential games, which can be exponential in the quantity (as in Example 1). They also study graphical coordination games, for which they identify various types of behavior of the logit dynamics; we refer to Auletta et al. [12] for more details. Because of the general slow mixing, Auletta et al. [10] also study the concept of metastability, which means a Markov chain stays close to some distribution on a timescale shorter than the mixing time. More work on the logit dynamics in potential games by these authors can also be found in Auletta et al. [9, 11]. We also refer to the survey article of Ferraioli [28] and references therein. There are also various results addressing the inefficiency of “long-term” equilibria in the context of logit dynamics; see, for example, the works of Asadpour and Saberi [8], Mamageishvili and Penna [45], and Penna [54].
A special case of potential games for which logit-like dynamics is studied extensively is the Glauber dynamics for the Ising model, which, in game-theoretical terms, can be seen as logit dynamics for max-2-cut games that are described in Section 1.1. Whether the logit dynamics are rapidly mixing in this case depends on the parameter and graph topology G; see, for example, the work of Levin et al. [44] and references therein. Jerrum and Sinclair [39] show that, nevertheless, there exists a polynomial time algorithm to sample from the Gibbs distribution for any rationality level and any graph topology. This work is partially motivated by studying a similar question for special cases of congestion games as explained later.
As said before, in general, the logit dynamics converge slowly to the Gibbs distribution (Auletta et al. [12]); in particular, the number of steps needed might be , where is the maximum value attained by Rosenthal’s potential. We next give a simple example illustrating this fact.
Consider the congestion game with players and two resources . Both resources can be used by both players, that is, , and have a cost function satisfying and for some . Note that .
We assume that T is fixed in this example, in particular, independent of . If is large, the Gibbs distribution assigns weight close to 1/2 to the strategy profiles and weight close to (1/2) to . We summarize these facts in Figure 1. Now, informally speaking, if we consider the logit dynamics with starting profile (a, b), then in order to reach the profile (b, a), we have to go through either the profile (a, a) or (b, b). The probability of transitioning to one of those profiles in one step of the logit dynamics is . As both profiles (a, b) and (b, a) appear with probability close to 1/2 in the Gibbs distribution, the probability of being in either of these profiles after a (large) number of steps of the logit dynamics should be approximately equal. However, escaping from the profile (a, b) happens with probability . The inverse of this probability then forms a lower bound on the number of steps of the logit dynamics needed to get close to the stationary Gibbs distribution.
What is causing the slow convergence in Example 1? The problem is that we need to use either the profile (a, a) or (b, b), both having very small probability in the Gibbs distribution, to move from (a, b) to (b, a). However, as it turns out, if one in addition with some probability is allowed to interchange the strategies of players 1 and 2, then the resulting dynamics converges quickly to the Gibbs distribution (being a special case of Theorem 1). Note that this enables the possibility to directly transition between (a, b) and (b, a). This motivates the following question: Is there a (simple) Markov chain on that converges rapidly to the Gibbs distribution over at any rationality level T?
This question may be interpreted as the natural analogue of looking for other local search procedures converging quickly to a PNE when best/better response dynamics does not have this property.
When the answer to the preceding question is not directly obvious, one can take another step back and first ask whether it is at all possible to efficiently sample from the Gibbs distribution. Informally speaking, can we take “snapshots” (according to the Gibbs distribution) from the system in equilibrium in polynomial time? More formally speaking, does there exist an efficient algorithm to sample (approximately) a strategy profile according to the Gibbs distribution over at any rationality level T? This question can be interpreted as a dynamic analogue of the second question posed earlier for the computation of pure Nash equilibria. That is, although (deterministic) better/best response dynamics might take a long time to converge to a pure Nash equilibrium, one still wants to know whether a PNE can be computed efficiently by other means. Similarly, although logit (or other natural) dynamics can take a long time to converge to the Gibbs distribution, we may still ask whether, by means of sampling, we can get an impression of what the Gibbs distribution over looks like. Note that a positive answer to the first question posed earlier gives a positive answer to the second question as one can simply run the Markov chain sufficiently long in order to generate a sample from the Gibbs distribution.
These questions are made precise in Section 2. We remark that one of the aspects that makes them nontrivial is the fact that we require the questions to hold for any rationality level T, that is, T is considered part of the input. For example, in Example 1, we could set to circumvent the problem arising there.

1.1. Our Contributions
In this work, we address the questions posed for various special cases of congestion games.
1.1.1. Extension Parallel Congestion Games.
The class of extension parallel (EP) congestion games is a well-studied special case of symmetric network congestion games; see, for example, Fotakis [29], Fujishige et al. [30], and Holzman and Law-Yone [37]. Here, the common strategy set of all players is given by the set of o, d-paths of an extension parallel graph; see Section 4 for a definition and example. Our main result is that there is a simple Markov chain converging quickly to the Gibbs distribution over , also implying that we can sample approximately from the Gibbs distribution. We show that, if one, in addition to the logit dynamics transitions, is allowed to randomly interchange the strategies of two players (akin to the explanation given after Example 1), the resulting Markov chain converges quickly to the Gibbs distribution. We call this the relaxed logit dynamics; see Section 4.2 for a formal definition. Note that Theorem 1 gives a doubly exponential improvement with respect to the dependence on compared with the lower bound as given in Example 1.
(Informal). The relaxed logit dynamics for EP congestion games, at rationality level T, converges to a distribution ϵ-close to the Gibbs distribution in at most
The notion of “ϵ-close” refers to the fact that the distribution seen after the indicated number of steps differs from the Gibbs distribution at most ϵ in total variation distance (see Section 2.5), a well-known distance measure for comparing probability distributions in Markov chain theory.
In a nutshell, Theorem 1 follows from the fact that, in EP congestion games, Rosenthal’s potential is M-convex as is shown by Fujishige et al. [30]. M-convexity is a property defined in the area of discrete convex analysis (Murota [51]) (see Section 2.3).2 The link between M-convexity and sampling is, roughly speaking, established in a series of (breakthrough) papers by Anari et al. [3, 5, 6] through the theory of strongly log-concave (SLC) polynomials. The theory of strongly log-concave (or Lorentzian) polynomials dates back to the work of Gurvits [34] and is further developed by Anariet al. [3] and Brändén and Huh [17]. In particular, Anari et al. [5] give the first polynomial-time algorithm for approximately sampling and counting the number of bases of a given matroid, resolving also an old conjecture by Mihail and Vazirani [50]. In this work, we rely on the sampling result from Anari et al. [5], albeit for relatively simple matroid structures.
Before proving the preceding result, we also give another way of sampling from the Gibbs distribution in Section 4.1 that essentially is a more direct approach than the sampler induced by the Markov chain result given (Theorem 4). The high-level approach used for this more direct sampler is given in Section 3 (and also used in Section 6). Finally, we give an application of our results to the problem of (approximate) uniformly sampling pure Nash equilibria in EP congestion games in Section 4.3, which is, to the best of our knowledge, the first of its kind.
1.1.2. Max-k-Cut Game on a (Unweighted) Complete Graph.
A max-k-cut game (see, e.g., Gourvès and Monnot [33]) is given by an undirected graph whose nodes V are the players. Every has strategy set whose elements are referred to as colors. The utility of a player is the number of neighbors that choose a different color. They can be modeled as a special case of extension parallel congestion games when G is the complete graph. This is explained in Section 5.
As an application of Theorem 1, we show that the relaxed logit dynamics is rapidly mixing at any rationality level T for max-k-cut games when G is the complete graph (Corollary 2). This stands in stark contrast to known results regarding the logit dynamics on complete graphs for the case k = 2, whose mixing time depends heavily on the value of the rationality level T (Levin et al. [44]). We elaborate on this in Section 5.
1.1.3. Capacitated Uniform Congestion Games.
Finally, we study the class of so-called u-capacitated k-uniform congestion games for given and . In such a game, the strategy set of player is given by all subsets of E of size ki. Furthermore, for every , we are given a capacity ue so that whenever .
The motivation for studying these games comes from the class of base-matroid congestion games, in which the strategy set of every player is the set of bases of a given matroid over the ground set of resources E. It is well-known that best response dynamics converge to a PNE in a polynomial number of steps in this class of games (Anari et al. [2]), and so, in particular, a PNE can be computed in polynomial time. Given the base-matroid sampling result of Anari et al. [5], a natural question that comes to mind is if a similar result exists for sampling from the Gibbs distribution in base-matroid congestion games. Here, we give a first result addressing this question. (Note that, in our setting, the strategies of player are the bases of the ki-uniform matroid.)
We next explain why there is a need for capacity constraints in our results. A strategy profile s can be seen as a bipartite graph in which the nodes on one side correspond to the players, having degrees ki, and the nodes on the other side correspond to the resources, having degrees (the resource load on e in profile s). For a given profile s with resource load profile , the bipartite graph is obtained in the natural way: there is an edge between player i and resource e if player i uses resource e in profile s. Very roughly speaking, in order to apply the sampling result in Anari et al. [5], we establish strong log-concavity of a certain polynomial associated to the vectors k and u. For this, we rely on an asymptotic enumeration formula for the number of bipartite graphs with a given degree sequence, which, in terms of congestion games, gives the number of strategy profiles that have a given resource load profile . (Asymptotic enumeration of graphs with given degrees is studied extensively in the area of combinatorics.) The formula that we use is only valid for the range of and resource load profiles satisfying the imposed capacity constraints as given in Theorem 2. Our main result is as follows.
(Informal). There is an (almost) polynomial time algorithm for approximately sampling from the Gibbs distribution in u-capacitated k-uniform congestion games assuming that when , where and .
See Remark 6 for an explanation of what we mean by “almost” polynomial time. The proof of Theorem 2 reveals an interesting connection between M-convexity and asymptotic enumeration formulas that might be of independent interest.
1.2. Technical Approach
In this section, we give an intuitive outline of the approach taken in order to prove the results as summarized in Section 1.1. For the sake of simplicity, we work with symmetric singleton congestion games in which the common strategy set contains every individual resource as a strategy, that is, for all . These games form a special case of EP and capacitated uniform congestion games. All technical notions not formally defined here can be found in Section 2.
Let us recall Example 1, which is a symmetric singleton congestion game with two resources. The cause of the long convergence time in this example is the fact that transitioning between strategy profiles in which both resources have the same load profile, that is, the number of players using the resource, is difficult. To be precise, in Example 1, both profiles (a, b) and (b, a) have the same load profile, but they correspond to different strategy profiles.
In order to still be able to sample from the Gibbs distribution, we take two approaches:
We first sample a load profile according to the correct probability and then sample a strategy profile corresponding to that load profile (Algorithm 1 in Section 3).
We augment the logit dynamics with additional transition possibilities, making it easier to transition between strategy profiles with the same load profile.
In the first approach, the “correct probability” with which a given load profile should be sampled is the sum of all the probabilities assigned to strategy profiles with the given profile. The resulting distribution is called the Gibbs distribution induced on the load profiles. For the applications in Sections 4–6, sampling a strategy profile with a given load profile can be done by either generating a random permutation of the players or relying on known sampling results in the literature. In particular, in the case of capacitated uniform congestion games, sampling a strategy profile for a given load profile corresponds to sampling a bipartite graph with a given degree sequence.
The main technical challenge lies in sampling a load profile with the correct probability. In the case of symmetric singleton congestion games, the collection of all load profiles can be modeled by the set , where xe denotes the load on resource . The set R is a special case of a discrete polymatroid. In order to sample a (resource) load vector x from R with the correct probability, we consider a Markov chain that can transition between load vectors x and y for which ; that is, x and y differ in one unit of load on precisely two resources (but have the same total load). The transition probabilities are chosen in such a way that the stationary distribution of the Markov chain is precisely the Gibbs distribution induced on the load profiles.
The Markov chain that we consider over the set R is a “polymatroid version” of the base-exchange Markov chain studied by Anari et al. [5] for sampling a base of a given matroid. The state space of this base-exchange Markov chain is the set of all bases of a given matroid. It randomly exchanges some element of the current base with another element not in the base, resulting in another base of the matroid. This idea relies on the well-known base-exchange property of matroids (Schrijver [57]). In their breakthrough work Anari et al. [5] show this Markov chain to be rapidly mixing when the stationary distribution is strongly log-concave. Strong log-concavity is a property of a polynomial associated with the stationary distribution of a Markov chain (defined for both discrete polymatroids and matroids). We apply the result from Anari et al. [5] by giving a reduction from sampling a base of a polymatroid, that is, a load vector from R, to the problem of sampling a base of a matroid using a well-known construction of Helgason [35]. For this reduction, we rely on some properties that preserve strong log-concavity (Brändén and Huh [17]). Whereas this reduction works for arbitrary discrete polymatroids, in our applications, we use relatively simple polymatroid structures. What is left then is to show that the Gibbs distribution induced on the load profiles indeed satisfies the notion of strong log-concavity. Based on results in Brändén and Huh [17], it turns out that, for symmetric singleton congestion games, this is the case because of the fact that Rosenthal’s potential is a separable convex function.
In order to obtain our sampler for EP congestion games (Theorem 4), we use a similar approach based on load profiles, but instead, we rely on the M-convexity of Rosenthal’s potential for such games (Fujishige et al. [30]). For Theorem 2, we also use a similar approach.
In order to show that the logit dynamics converge quickly to the Gibbs distribution when we additionally allow the interchanging of the strategy of two players, we use a Markov chain decomposition argument. This approach results in the proofs of Theorem 5 and Corollary 2. The idea of Markov chain decomposition is to divide the state space into smaller sets. If one can prove that the Markov chain converges quickly to its (induced) stationary distribution on the smaller sets and that it is easy to move between these smaller sets, then the original chain converges quickly to its stationary distribution as well. We elaborate more on this approach before giving the proof of Theorem 5.
1.3. Discussion and Further Related Work
To the best of our knowledge, ours are the first (polynomial-time) sampling results for the Gibbs distribution in congestion games beyond the well-studied case of max-cut games on general graphs, also known as the Glauber dynamics for the Ising model. Given the extensive attention that logit dynamics has received in various communities as well as the topic of approximate sampling in theoretical computer science, we believe this to be an interesting line of work at the intersection of algorithmic game theory, combinatorics, and approximate sampling to pursue further.
In particular, for special cases of congestion games with a positive answer to questions 1 and 2 posed in the introduction, do there also exist positive answers for their dynamic analogues? As a concrete open question, we ask whether it is always possible to efficiently sample from the Gibbs distribution in general base matroid congestion games (Ackermann et al. [2]). If true, this provides an interesting (qualitative) game-theoretical generalization of the sampling result of Anari et al. [5].
We conclude this section with some more background and related work concerning the result of Anari et al. [5]. A long-standing open question (Mihail and Vazirani [50]) in the area of approximate sampling and counting asks if there exists a rapidly mixing Markov chain to sample approximate uniformly at random a base of a given matroid (which, in turn, also yields a polynomial-time approximation scheme for counting the number of bases of a given matroid).3 Anari et al. [5] show that the base-exchange Markov chain is indeed rapidly mixing. In fact, they prove the more general result that this chain is rapidly mixing, using appropriate transition probabilities, for any strongly log-concave stationary distribution; see Section 2.4. An (up to constant factors) tight mixing time bound for the base-exchange Markov chain is later given by Anari et al. [6]. Before Anari et al. [5], rapid mixing of the base-exchange Markov chain (with uniform stationary distribution) was only known for special cases. In particular, Feder and Mihail [27] prove rapid mixing for balanced matroids, using Sinclair’s [58] multicommodity flow method. Examples of balanced matroids are the graphic and regular matroids.
2. Preliminaries
In this section, we give all the necessary preliminaries regarding congestion games, strongly log-concave polynomials, and the relevant Markov chain notions and results. We start with some general notation.
All logarithms in this work have Euler’s number e as their base unless specified otherwise. For , we write . For two vectors , we write if for , and x < y if strict inequality holds for at least one i. Furthermore, with , we denote the modulus of x. We use to denote the standard basis of , that is, if j = I and otherwise. For sets A, B, and C, we use the notation to denote the set .
2.1. Congestion Games
A capacitated congestion game Γ is given by a tuple , where is a finite set of players, a finite set of resources (or facilities), the set of strategies of player , and the cost function of resource that satisfies whenever for with W a sufficiently large number. Unless stated otherwise, the cost functions are assumed to be nonnegative and nondecreasing. Finally, ue is a nonnegative integer modeling the capacity on resource . If ue = n for every resource , we simply call Γ a congestion game. (Extension parallel and capacitated uniform congestion games are defined in Sections 4 and 6, respectively.) For a strategy profile , we define to be the number of players using resource e, that is, . A game is called symmetric if for all . We then write to denote the common strategy set of all players.
We call the resource load profile corresponding to strategy profile s. We say that a strategy is feasible if for every and write to denote the set of all feasible strategy profiles. We only consider games in which the set of feasible strategy profiles is nonempty. More generally, we say that is a (feasible) resource load profile for if there is some (feasible) strategy profile s such that . We write for the set of strategy profiles whose resource load profile is y.
Similarly, for symmetric congestion games, we define the notion of a strategy load profile that models how many players are using a strategy in a given strategy profile . More precisely, given a strategy profile , we define as the number of players choosing strategy in strategy profile s. The vector is called the strategy load profile of s. Similarly as for resource load profiles, we define (with a slight abuse of notation) to be the set of strategy profiles s for which .
The cost of player under a strategy profile is given by A strategy profile is called a (pure) Nash equilibrium if, for every and every , it holds that , where denotes the strategy profile in which player i plays and every other player plays sj. We write to denote the set of all pure Nash equilibria of Γ.
We say that is an exact potential function for a congestion game Γ if, for every strategy profile , for every player , and every unilateral deviation of i it holds that Rosenthal [55] shows that
A function is called convex if for all . A function is called separable convex if it is of the form , where the ψj, given by , are convex. We say that is concave if is convex, and similarly, ψ is separable concave if is separable convex. A simple but important observation that we use in this work is the fact that Rosenthal’s potential is a separable convex function when seen as a function from resource load profiles to the reals. That is, the function given by
If the cost functions are nondecreasing, then Rosenthal’s potential is a separable convex function.
2.2. Gibbs Distribution and Logit Dynamics
The Gibbs distribution over the strategy profiles of a congestion game Γ is given by
The logit dynamics Markov chain (formal Markov chain definitions are given in Section 2.5) with current state proceeds as
Select a player uniformly at random.
For , transition to with probability with normalizing constant
It is a standard fact, which can be shown by using (1), that this Markov chain is reversible with respect to the Gibbs distribution.
2.3. Matroids and M-Concavity
Let be a finite set called the ground set and a collection of subsets of E (called independent sets). The pair is a matroid if (i) ; (ii) and , then ; (iii) and , then there exists an such that . An independent set of maximum size is called a basis. We use to denote the set of all bases of . The set of bases satisfies the so-called base-exchange property: if and , then there exists an such that . It satisfies the strong base-exchange property if both . The rank of a matroid is the common cardinality r of all bases in . The -truncation of a matroid is the matroid with if and only if and . The partition matroid is given by a disjoint partition of the ground set E and upper bounds ui for . A set is independent if and only if for all . The k-uniform matroid is the matroid in which is independent if and only if .
A discrete polymatroid is a finite set of vectors with the properties that (i) ; (ii) if and , then ; and (iii) if with , then there is a vector such that (in which the maximum is taken coordinate-wise). The set of bases is given by all maximal vectors in R that have a common modulus r. A polymatroid satisfies the base-exchange property; if and xi > yi, then there exists an index j with yj > xj and .
Finally, as a generalization of discrete polymatroids, we describe the notion of M-convexity for functions Murota [51, 53]. As we mostly work with its negated counterpart of M-concavity, we describe this first. Let be a function. The effective domain of ν is given by
The function ν is called -concave if it satisfies the (symmetric) exchange property: for any and any satisfying , there exists a such that and
It is well-known that a separable concave function is -concave (Murota [52]). The function ν is called M-concave if it is -concave and, in addition, there is an such that . A function is called M-convex if is M-concave.
2.4. Strongly Log-Concave Polynomials
We consider polynomials with nonnegative real coefficients. For a vector , we write
For a constant with , we write . Let and . Let be a weight function. The generating polynomial of w is given by
The support of is the set . The generating polynomial g is called d-homogeneous if for all with . It is called multiaffine if every variable xi appears with at most multiplicity one in every monomial of p. For example, is multiaffine, but is not as the multiplicity of x1 in the first monomial is two. Finally, the elementary symmetric polynomial of degree d, for , is given by
(Strong Log-Concavity; Gurvits [34]). A polynomial with nonnegative coefficients is called log-concave on a subset if its Hessian is negative semidefinite on S. A polynomial p is called SLC on S if, for any , we have that is log-concave.
For convenience, the zero polynomial is defined to be strongly log-concave always. It is interesting to note that if a d-homogeneous multiaffine polynomial p is SLC, then the support of p must form the collection of bases of a matroid, and more generally, if a (not multiaffine) homogeneous polynomial is SLC, its support forms an M-convex set (Brändén and Huh [17]). Finally, if the generating polynomial is strongly log-concave, then the probability distribution is called strongly log-concave.
The definition of strong log-concavity is in this work is not really needed but included for completeness. In our proofs, we essentially only rely on properties of SLC polynomials from the literature that are reviewed as follows. For homogeneous generating polynomials the notion of strong log-concavity is equivalent to that of a polynomial being Lorentzian (Brändén and Huh [17]) or completely log-concave (Anari et al. [4]). These equivalences are shown in Brändén and Huh [16].
We next state all properties of SLC polynomials that are used in this work. First, it is easy to check that the SLC property is preserved under multiplication with a nonnegative scalar, which we state here for sake of reference.
(Brändén and Huh [17]). If is SLC and , then is SLC.
We continue with the polarization operator defined by Brändén and Huh [17]. The polarization operator introduces auxiliary variables in order to turn p into a multiaffine polynomial over a larger set of variables. (We elaborate on polarization in Section 3.1.) Formally, following Brändén and Huh [17], for , let
(Brändén and Huh [17]). If p is d-homogeneous and SLC over , then is d-homogeneous and SLC over .
We conclude this section by stating a large class of homogeneous polynomials that are known to be strongly log-concave.
(Brändén and Huh [17]). For and a nonnegative weight function, consider
2.5. Markov Chains
Let be an ergodic and time-reversible Markov chain with state space Ω, transition matrix P, and stationary distribution π (which is unique because of the ergodicity assumption). Reversibility means that for all . We write for the distribution over Ω at time step t with initial state . The total variation distance of two distributions π and σ over Ω is defined as
2.5.1. Markov Chain Decomposition.
Let be a disjoint partition of the state space Ω. Following Martin and Randall [46], consider and let be defined by
The Markov chain on with transition matrix is called the projection chain on the partition . It is time-reversible with respect to the distribution over . For the restriction chain on Ωi has transition matrix given by
It is time-reversible with respect to the distribution for . In Appendix A.1, we give a Markov chain decomposition result based on the modified log-Sobolev constant ρ.
2.5.2. Base-Exchange Markov Chain.
Let be a matroid, and let π be an SLC probability distribution over the set of bases given by for some nonnegative weight function . Here, means that . The base-exchange Markov chain on is defined by the following transitions, in which is the current state of the Markov chain:
Select an element uniformly at random and remove it.
Pick a base with with probability among all such bases .
It is not hard to see, using the base-exchange property, that this procedure defines an ergodic, time-reversible Markov chain with stationary distribution π. Anari et al. [4] show that this chain is rapidly mixing for any matroid . In particular, in a recent follow-up work, they give a tight mixing time bound (Anari et al. [6]).
(Anari et al. [6]). Let be matroid of rank r, and let π be an SLC probability distribution over the set of bases given by for some weight function . Then, the mixing time of the base-exchange random walk satisfies
We note that the mixing time is independent of the size n of the ground set E of the matroid as well as the stationary distribution π. In this work, Theorem 3 is essentially only applied to partition and uniform matroids. Furthermore, Cryan et al. [20] show that the modified log-Sobolev of the base-exchange random walk satisfies , where r is the rank of the matroid.
2.6. Sampling Algorithms
Consider a class of (capacitated) congestion games with n players and m resources and cost functions for . Let be a weight function. In this work, an algorithm for sampling according to a distribution ϵ-close to π with is said to run in (randomized) polynomial time if the number of arithmetic operations can be upper bounded by a polynomial in and . (We assume the strategy sets can be represented in a compact form.) The generation of a uniform random 0/1 bit is considered to be one arithmetic operation.
(Real Numbers). In this work, we use Markov chains whose transition probabilities are, in general, not rational numbers (in particular for the Gibbs distribution). Whenever we use real numbers, it is implicitly assumed that we use sufficiently accurate approximations to these numbers. All our results remain valid when (real-valued) transition probabilities are replaced by sufficiently accurate rational approximations. We note that, roughly speaking, whenever we want to generate Markov chain transitions with probabilities proportional to for , our algorithms run in pseudo-polynomial time in terms of the values of the cost functions of the congestion game under consideration.
All our algorithmic results are based on running Markov chains for a sufficiently long time. We usually write our running time bounds as the product of two factors: the number of steps that we need to run the Markov chain (before it’s close to stationarity) and the complexity of implementing one such step. In particular, in all cases, the transitions probabilities of one step are determined by a sequence of rational numbers and , and we want to sample an index with probability
Here, z as well as the encoding size of the qi are . We refer to as the computational complexity of sampling an index i according to (approximations of) the preceding probabilities in order not to overload our theorem statements. We say that probabilities of the form (8) are suitable.
2.7. Bipartite Graphs
An (undirected) bipartite graph is given by two disjoint sets of nodes and with . We say that G has degree sequence if for and for , where d(v) denotes the degree of node v in G. We write for the set of all bipartite graphs on with degree sequence .
3. General Approach
In Section 2.1, we give two possible definitions for the load profile of a strategy profile . For general congestion games, we define the resource load profile that keeps track of how many players use a particular resource e in s. For symmetric congestion games, we may in addition consider the strategy load profile that keeps track of how many players use a particular strategy t from the common strategy set .
A general approach for sampling a strategy profile according to the Gibbs distribution in Sections 4.1 and 6.1 is to first sample a (resource or strategy) load profile α according to approximately the right probability and then sample a strategy profile uniformly at random. Remember that the set is used to denote all strategy profiles with the given (resource or strategy) load profile. The approach is summarized in Algorithm 1, in which we give the formulation for resource load profiles because we use Rosenthal’s potential as given in (4). The formulation for strategy load profiles is exactly the same with replaced by as given in (3). We note that, in order to sample a strategy load profile uniformly at random in symmetric games, it suffices to generate a random permutation of the players in .
Sampling a load profile with the correct probability in our applications corresponds to sampling a base of a discrete polymatroid according to a strongly log-concave distribution. In order to do this, we present a reduction of this problem to that of sampling a base of a matroid according to a strongly log-concave distribution in Section 3.1 (after which we can rely on Theorem 3).
(Gibbs Sampler for Congestion Game Γ)
Input: Congestion game Γ, rationality level , and .
Output: Strategy profile according to distribution that is ϵ-close to Gibbs distribution π at rationality level T.
Step I: Sample load profile α according to a distribution that is ϵ-close to given by
Step II: Sample strategy profile (approximate) uniformly at random.
3.1. Sampling Bases of Discrete Polymatroids
In this section, we describe how to generate a discrete polymatroid base, according to a strongly log-concave distribution over the set of all polymatroid bases by reducing it to the problem of generating a base of a matroid. This follows more or less directly from Theorem 3 by using the notion of polarization. Polarization can be seen as a functional version of the classic reduction from discrete polymatroids to matroids as given by Helgason [35]. (See also Schrijver [57, chapter 44.6b] for this reduction.)
For a polymatroid , consider a d-homogeneous strongly log-concave polynomial
This is with positive coefficients and support the set of bases . Consider the matroid on ground set , where if and only if the vector given by satisfies . The fact that is indeed a matroid follows directly from the fact that R is a polymatroid. Note that, for a given , we have
We slightly abuse notation here and write for the all d-vector.
Then, with the set of bases of , the polarization of g can be written as
Polarization should be interpreted as spreading out the weight for equally over all bases . Proposition 3 implies that is also strongly log-concave.
Let so that and take . Then,
Note that, looking at the support of p, we have monomials corresponding to and monomials corresponding to .
Corollary 1 now follows directly from Theorem 3. It says the following. Suppose the current state of the base-exchange Markov chain after t steps, starting from any state , is the base , and suppose we output the polymatroid base . If t is large enough such that we are in state B with probability close to for every , then is outputted with probability close to with and as in (10).
Let π be the distribution over with , and let be the distribution over with . Let , and let be the distribution over after t steps of the base-exchange Markov chain . Let σt be the induced distribution over given by If , then .
We have
This gives the desired result. □
It is possible to define a more direct Markov chain on the set of all bases of a given discrete polymatroid and prove that this chain is rapidly mixing (also based on Theorem 3), but this is not needed for our results.
4. Extension Parallel Congestion Games
An extension parallel congestion game is a symmetric congestion game in which the common strategy set of the players consists of the o, d-paths in a (directed) extension parallel network with source o and target d. For two given networks with source oi and target di for i = 1, 2, let be the union of G1 and G2. The parallel composition of G1 and G2 is the network obtained by identifying o1 with o2 and d1 with d2. These nodes are the source and target of , respectively. The series composition of G1 and G2 is obtained by identifying d1 with o2. The node o1 becomes the source of and d2 its target. An extension parallel network consists of (i) a single arc (o, d), (ii) two extension parallel networks in parallel, or (iii) a single arc in series with an extension parallel network. An example is given in Figure 2. For a given extension parallel graph G, we use to denote all o, d-paths in G. Note that, for an extension parallel network, we have .

In this section we are always working with strategy load profiles (and so these are sometimes simply referred to as load profiles). The set of all possible strategy load profiles is denoted by . We consider the potential defined by for some . This is well-defined as the potential value is the same for any choice of . The main result that we need in this section is the M-convexity of Rosenthal’s potential for EP congestion games.
(Fujishige et al. [30]). Let Γ be an extension parallel congestion game. Then, the potential defined by for is M-convex.
The M-convexity of follows, in a nutshell, from the fact that the collection of sets , where is the set of all paths containing arc a, form a laminar family in the case of extension parallel networks. This implies that the potential is laminar convex, which yields the M-convexity property of the potential. We refer the reader to Fujishige et al. [30] for more details.
Before giving our main result as sketched in Section 1.1, we first give a more direct approach for sampling from the Gibbs distribution in EP congestion games.
4.1. Sampling from the Gibbs Distribution
As mentioned in Section 3 and sketched in Algorithm 1, the high-level algorithmic idea for sampling a strategy profile according to the Gibbs distribution consists of first sampling a load profile α with the correct probability and then a strategy profile from uniformly at random. The main result of this section based on this approach is stated in Theorem 4.
Let and , and let Γ be an extension parallel congestion game with n players. There is a randomized algorithm with output distribution over that is ϵ-close to the Gibbs distribution π at rationality level T and runs in (expected) time with C the complexity of implementing one step of a base-exchange Markov chain with suitable probabilities (see Section 2.6).
Note that, for an extension parallel congestion game, the number of strategy profiles corresponding to a given load profile α is . (This is the number of ways in which we can assign n labeled balls to bins , where bi contains αi balls.)
The n-homogeneous generating polynomial
Strong log-concavity is preserved under scalar multiplication by Proposition 2, so it suffices to show that
Because of Lemma 1, the polarization of g in (11) is also strongly log-concave. The support of can be seen as the bases of the n-uniform matroid on ground set . Our algorithm now consists of first running the base-exchange Markov chain for steps, starting from any initial base. We output , where B is the state we are in after the steps that were carried out. The resulting distribution over satisfies by Corollary 1. We then uniformly at random choose a strategy profile from . Let be the resulting output distribution over . It remains to show that , which the following calculation shows. Note that
We conclude with analyzing the running time of the algorithm. One step of the base-exchange Markov chain can be implemented in time O(C) by definition. Generating an uniformly at random can be done by generating a uniform random permutation μ of . We set for players for players , and so on. Generating a uniform random permutation can be done in time . □
4.2. Relaxed Logit Dynamics
In this section, we give the proof of Theorem 1, reformulated in Theorem 5. Formally, the relaxed logit dynamics Markov chain with current state proceeds by
With probability 1/2: Select two players uniformly at random and transition to given by
With probability 1/2: Perform a transition according to the logit dynamics (as in Section 2.2).
We note that, for any symmetric congestion game, this is a well-defined ergodic, time-reversible Markov chain with the Gibbs distribution as stationary distribution.
For an extension parallel congestion game Γ with common strategy set and initial state , the mixing time of the relaxed logit dynamics Markov chain at rationality level satisfies
Compared with the mixing time of the (nonrelaxed) logit dynamics for general games (Auletta et al. [12]), we get a doubly exponential improvement in terms of the dependence on (at the cost of a small polynomial increase in the dependence on n).
The proof of Theorem 5 relies on a Markov chain decomposition argument. The idea underlying Markov chain decomposition is to divide the state space into a disjoint union of smaller sets. If one can show that the Markov chain is rapidly mixing when restricted to every smaller set and that it is easy to transition between the different smaller sets, then the original chain is rapidly mixing as well. In our setting, every load profile is identified with a smaller set, which is formed by all strategy profiles inducing the given load profile. To prove rapid mixing of the smaller sets, we compare the restricted chains to the so-called random transposition Markov chain, which is known to be rapidly mixing; see, for example, Goel [32]. To prove rapid mixing between the smaller sets, that is, between load profiles, we use again the connection to the base-exchange Markov chain but in a different way than in the proof of Theorem 4. In order to prove rapid mixing of all the Markov chains involved, we show that their modified log-Sobolev constant can be bounded appropriately. Recall that this is a quantity that can be used to upper bound the mixing time of a Markov chain (see Appendix A).
We use a Markov chain decomposition argument based on the two operations that define the relaxed logit dynamics Markov chain. We first partition the state space naturally based on load profiles by setting for , whereas before we have . Our proof approach is to apply the Markov chain decomposition theorem of Hermon and Salez [36] as given in Theorem A.1. In particular, for this, we need to bound the modified log-Sobolev constants of the projection and restriction chains. We start with the modified log-Sobolev constant of the projection chain.
The projection chain has state space and stationary distribution for . Let such that ; that is, there exist paths p and such that
In this case, we say that α and β are adjacent load profiles differing on paths p and . Note that, if and are such that they differ by a deviation of some player i from path p to path , then , and this expression is the same for every such player i with si = p. Moreover, note that for any two strategy profiles .
For some fixed choice of strategy profile and player i using path p, that is, si = p, the transition probabilities for adjacent load profiles can then be seen to equal (with Z and are the normalizing constants as in Section 2.2)
The lower bound of serves as our lower bound on χ as defined in Appendix A.1. In order to bound the modified log-Sobolev constant of the projection chain, one can use a comparison argument (as defined in Appendix A.2) with the base-exchange Markov chain on the support of the polarization of as in (11). In this section, the support corresponds to the set of bases of an n-uniform matroid. In particular, it holds that
We continue with bounding the modified log-Sobolev constant of the restriction chains. In order to do this, we use a comparison argument with the random transposition Markov chain on the set Sk of all permutations of . Given a permutation σ, this chain proceeds by selecting two positions a and b uniformly at random and interchanging the positions of the elements and . With ρrt denoting the modified log-Sobolev constant of this chain, it follows that, for every , we have
4.3. Uniform Sampling of Pure Nash Equilibria
In Theorem 6, we show that the result in Theorem 4 also implies that, for an extension parallel congestion game Γ, we can (approximate) uniformly at random sample a pure Nash equilibrium from the set of all pure Nash equilibria of Γ in pseudo-polynomial time. That is, we sample every with probability . The (approximate) uniform sampling of combinatorial objects has received a lot of attention in the last 30 years, in particular within the area of theoretical computer science. However, to the best of our knowledge, no nontrivial results for (pure) Nash equilibria are known despite the fact that the problem of computing Nash equilibria has received much attention.
For the proof of Theorem 6, we use the fact that Nash equilibria are precisely the strategy profiles minimizing Rosenthal’s potential in EP congestion games. Furthermore, we exploit the fact that Theorem 4 holds for any rationality level . In particular, if we set T large enough, then most weight in the stationary distribution is assigned to profiles minimizing Rosenthal’s potential (under the assumption that the cost functions are integer-valued). This means that, with high probability, Algorithm 1 outputs a strategy profile minimizing Rosenthal’s potential with domain . (Whenever we refer to Algorithm 1, we mean the implementation of the high-level approach as given in the previous section.) We use the Gibbs distribution with base two instead of e to avoid having to work with real numbers.
Let , and let Γ be an extension parallel congestion game with integer-valued cost functions and n players. There is a randomized algorithm with output distribution over that is ϵ-close to the uniform distribution over and runs in (expected) time polynomial in and , where is the maximum value attained by Rosenthal’s potential.
For the proof of Theorem 6, we use the following correspondence between Nash equilibria and strategy profiles minimizing Rosenthal’s potential.
(Fotakis [29], Holzman and Law-Yone [37]). The set of strategy profiles of an extension parallel congestion game Γ coincides with the set of strategy profiles that minimize Rosenthal’s potential as in (3).
We first show that, for T sufficiently large in the algorithm used to prove Theorem 4, most weight is assigned to strategy profiles minimizing Rosenthal’s potential. We apply the idea in Algorithm 1 used to prove Theorem 4 with base two instead of base e. Remember that q is the number of (o, d)-paths in the extension parallel network of the game Γ. Let be the common potential value of all strategy profiles . For any other strategy profile , we have
The algorithm for sampling an (almost) uniform sample from now works as follows. First, compute a strategy profile minimizing Rosenthal’s potential in order to determine . This can be done efficiently; see, for example, Fotakis [29]. Then, run Algorithm 1 with and . If the resulting strategy profile has potential value , output this strategy profile, and otherwise, rerun Algorithm 1 until it does. Note that, with probability at least , Algorithm 1 outputs a strategy profile with potential value in one run. A simple argument then shows that the output distribution is ϵ-close to the uniform distribution over as desired. □
The pseudo-polynomial dependence coming from the polynomial dependence on rather than arises from the fact that we have to compute transition probabilities of the form , where the ai are integers, which requires random 0/1 bits (following the notion of suitable probabilities in Section 2.6). However, there is no a priori reason that the problem of (approximately) sampling pure Nash equilibria according to the uniform distribution requires pseudo-polynomial (in the input size of the original congestion game) time as opposed to sampling from the Gibbs distribution. We leave open the question of finding a (truly) polynomial time algorithm.
5. Max-k-Cut Games
In this section, we give an application of Theorem 5 to max-k-cut games. We first repeat their definition and show how they can be modeled as a special case of extension-parallel congestion games in case the graph G of the game is complete.
A max-k-cut game is given by an undirected graph whose nodes V are the players. Every has strategy set whose elements are referred to as colors. With the set of neighbors of node , the utility of player i is the number of neighbors that choose a different color (hence, the name max-k-cut game, referring to the well-known max-cut problem (Garey and Johnson [31])). Equivalently, the cost of a player is given by the number of neighbors that choose the same color; that is, for , we have and in terms of (relaxed) logit dynamics, considering the cost of a player is equivalent to considering its utility because is independent of the chosen color in s.
When is the complete graph, a max-k-cut game can naturally be modeled as a so-called symmetric singleton congestion game in which is the set of resources and every resource is equipped with the cost function for . Indeed, if players are using resource/color e, then the cost for every player choosing color is , which is precisely the number of neighbors choosing that color for every such node (because of the fact that G is the complete graph). Such games are easily seen to be special cases of extension parallel congestion games.
The relaxed logit dynamics may then be interpreted as the process by which either a player changes its color (according to the transitions as specified by the logit dynamics) or the colors of two players are interchanged. As , which is the total number of edges of the graph G, we obtain the following corollary of Theorem 5.
For a max-k-cut game played on the complete graph with initial state , where , the mixing time of the relaxed logit dynamics Markov chain at rationality level satisfies
Levin et al. [44] show that the mixing time of the (nonrelaxed) logit dynamics on the complete graph depends on the parameter in the case k = 2. They show there is a critical value Tc so that, when , the logit dynamics converge quickly to the Gibbs distribution, whereas when , the dynamics converge slowly. What Corollary 2 shows is that, if one in addition is allowed to randomly interchange the colors of two nodes, the slow mixing for can be resolved. We believe this to be of independent interest.
6. Capacitated Uniform Congestion Games
In this section, we consider u-capacitated k-uniform congestion games for vectors and . We write and . The vector u models the capacities of the resources , that is, the variables as defined in Section 2.1. The strategy set of player is given by all subsets of cardinality , that is, the bases of the ki-uniform matroid on E. We write for the collection of all u-capacitated k-uniform congestion games. We remark that, in this section, load profiles refer to resource load profiles as defined in Section 2.1 and no longer to path load profiles as considered in Section 4.
Note that we can naturally model a feasible strategy profile in of a capacitated uniform congestion game as a (simple) bipartite graph on : there is an edge if and only if player uses resource in its strategy si. The main result needed in this section is stated in Proposition 7. We use the notation for . For a bipartite degree sequence , we then write and . Note that .
(McKay [49]). Let be the collection of all bipartite degree sequences for which . Then,
6.1. Sampling from the Gibbs Distribution
In this section, we give an (almost) polynomial time sampling algorithm that samples from a distribution that is close to the Gibbs distribution provided the game is sufficiently large. That is, we show that, for a large class of pairs (u, k), we can sample from a distribution close to the Gibbs distribution.
We follow again the high-level approach in Algorithm 1. The set of all feasible load profiles is now given by and for any feasible load profile . Recall that we want to sample an with probability proportional to (approximately) and then sample a strategy profile with probability .
A couple of problems arise here compared with the case of extension parallel congestion games. First of all, there is no polynomial time algorithm known to compute the numbers . (In fact, it is still an open question whether this problem is P-complete; see, e.g., Jerrum et al. [40].) Instead, we rely on a fully polynomial randomized approximation scheme for computing approximations to the numbers up to arbitrary precision (Bezáková et al. [13], Jerrum et al. [40]). Second, in this case, the polynomial
Finally, the problem of sampling a strategy profile now corresponds to that of sampling a bipartite graph with degree sequence for which many algorithms are known. The main result of this section is given in Theorem 7.
Let , let π be the Gibbs distribution at rationality level , and let be the class of all congestion games satisfying
There is a randomized algorithm for the class and a constant such that the output distribution over has the property that whenever . The algorithm runs in (expected) time
Setting
The next step is now to show that replacing by in (20) gives rise to a strongly log-concave polynomial. The crucial observation here is to see that is a separable convex function.
The K-homogeneous generating polynomial
Following the proof of Lemma 1, first observe that (and note that k and all quantities involving k are considered fixed)
One can now carry out similar steps as in the proof of Theorem 4, albeit with some modifications. Again, the polarization of g as in (20) is strongly log-concave as well. The support of is now the set of bases of the n-truncation of a partition matroid on ground set , where with is independent if and only if for . It follows that the modified log-Sobolev constant of this chain satisfies . Using a Markov chain comparison argument as described in Remark A.1 in Appendix A.2 with because of (19) then yields that the modified log-Solev constant ρ of the Markov chain in which we use the original quantities , instead of the approximation , satisfies .
The algorithmic problem is now that we cannot compute the quantities exactly in polynomial time, so one step of this base-exchange Markov chain cannot be implemented efficiently. Nevertheless we can use the approximation scheme of Bezáková et al. [13] to compute approximations up to arbitrary precision in time polynomial in n, m and (this gives the dependence of in C). A Markov chain comparison argument (again as in Remark A.1) then implies that the base-exchange Markov chain on using these approximations is also rapidly mixing, and in particular, it is sufficient to run the chain
We conclude with the sampling of a strategy profile from the set uniformly at random, which now requires sampling a bipartite graph from the set uniformly at random. One algorithm to do this for degree sequences satisfying the condition in Proposition 7 is that of Arman et al. [7] that runs in expected polynomial time. For an overview of algorithms that can be used to (approximately) sample a bipartite graph with a given degree sequence, see, for example, Dyer et al. [23]. □
We remark here that the algorithm of Theorem 7 does not run in polynomial time as described in Section 2.6 because of the dependence of C on (as opposed to the required ). This dependence arises because of the algorithmic approximations of the numbers used in the proof of Theorem 7. Alternatively, we could just use the approximations straight away. However, these predictions only become accurate when , so this gives a weaker result in terms of closeness to the Gibbs distribution.
The author is grateful to Prasad Tetali for pointing him to Hermon and Salez [36] and to the anonymous reviewers of the 22nd ACM Conference on Economics and Computation and Mathematics of Operations Research for their useful comments. A large part of this work has been carried out while the author was a postdoctoral fellow at the Max Planck Institute for Informatics in Saarbrücken, Germany. An abstract of this work appears in the Proceedings of the 22nd ACM Conference on Economics and Computation (Kleer [42]).
Appendix A. Markov Chains and Functional Inequalities
Three well-known quantities that can be used to upper bound the mixing time of a Markov chain are the Poincaré constant, the log-Sobolev constant, and the modified log-Sobolev constant. In this section, we define the modified log-Sobolev constant.
Let be a time-reversible Markov chain with stationary distribution π and . Let . Furthermore, define the entropy-like quantity
The modified log-Sobolev constant of the Markov chain is defined by
As also mentioned in Section 2.5, it holds that
A.1. Markov Chain Decomposition
Let be a time-reversible Markov chain with stationary distribution π, and let be a disjoint partition of the state space. We write ρ for the modified log-Sobolev constant of this chain, for the modified log-Sobolev constant of the projection chain, and ρi for that of the restriction chain Pi (and define ).
Hermon and Salez [36] recently give a Markov chain decomposition theorem that applies to the Poincaré constant, the log-Sobolev constant, and the modified log-Sobolev constant. For other Markov chain decomposition theorems, see, for example, the work of Jerrum et al. [41], who, in particular, give stronger theorems for the Poincaré and log-Sobolev constants. We next describe the necessary objects to formulate their result.
Assume that, for each with and , we are given a coupling of the probability distributions πi and πj. That is, κij is such that
Based on the couplings κij, we define
(Hermon and Salez [36]). With the preceding notation, it holds that Furthermore, the parameter χ satisfies
A.2. Markov Chain Comparison
Another useful property of proving mixing time bounds through Poincaré and (modified) log-Sobolev constants is that it is easy to see that small perturbations in the transition probabilities and the stationary distribution only result in mild variations in these constants by means of a Markov chain comparison argument. Goel [32] shows the following for the modified log-Sobolev constant based on similar results for the other constants by Diaconis and Saloff-Coste [22].
(based on Goel [32, Lemma 4.1]). Let and be two finite, reversible Markov chains with stationary distributions π and , respectively, and modified log-Sobolev constant ρ and , respectively. Assume there is a mapping mapping any (arbitrary) to and constants and such that, for all f, we have
Then,
In particular, if and there exists a such that for all and for , it directly follows that
Appendix B. Comparison Arguments Omitted in Proof of Theorem 5
B.1. First Inequality in (14)
We start with showing the first inequality in (14), which is
Furthermore, is the modified log-Sobolev constant of the base-exchange Markov chain on with stationary distribution . Similar to what is explained in Section 3.1, we may write , where for , and is then the set of bases of the n-uniform matroid on . Roughly speaking, for every path , we introduce n auxiliary elements (corresponding to the n auxiliary variables introduced when polarizing).
For every , there are bases corresponding to it (as in Section 3.1). We denote this set of bases by
We start with defining the required mapping needed in Theorem A.2. For , we define simply by setting for . It can then easily be checked that as . This means that we can take c = 1 and B = 0 in Theorem A.2. In order to show the desired Dirichlet form inequality in the statement of Theorem A.2, it suffices to prove that, for any adjacent , it holds that
Now, fix and assume that they are adjacent (the case can be dealt with similarly). Remember that is adjacent to α if , that is, there exist paths p and such that
Let r be the path for which and write for all load profiles γ adjacent to α for which (including β). Following the definition of the base-exchange Markov chain, it then holds that, for any and , we have
Continuing the estimate in (B.2), we then get
B.2. First Inequality in (15)
In order to show the inequality in (15), we again use a Markov chain comparison between two chains on different state spaces. We want to show that
It is convenient to study these chains in terms of bipartite graphs with given degrees on node partition . For , we consider the degree sequence with xi = 1 for every , and the sequence with for , where one should remember that q is the number of strategies, that is, paths, available in the common strategy set (denoted by A here) of all players. It follows directly that there is a one-to-one correspondence between and , where, for a given strategy profile , there is an edge {i, p} if and only if si = p. (This is similar to the setting we consider in Section 6.) Given , our restriction chain can be interpreted as randomly selecting two edges from the bipartite graph Gs corresponding to the profile s and switching them if possible. That is, if we select {i, p} and with , we delete the edges {i, p} and and add the edges and (note that always holds as the nodes in B have degree one).
In order to introduce the random transposition Markov chain, we split up every node into nodes and consider bipartite graphs on two sets of n nodes and with , where every node has degree one. That is, every such graph is a perfect matching between and B. Note that there are precisely
We can now use a similar type of comparison argument as for the first inequality in (14) given earlier. We define the mapping for a given function by setting whenever for . Let σ be the uniform distribution over and let the uniform distribution over . Note that . It then follows that, for every , we have
This implies that we can take C = 1. As before, we can take a = 1 and B = 0.
1 If one drops the assumption that the cost functions are nonnegative (see Section 2), the parameter can be replaced by , where is the minimum value attained by Rosenthal’s potential. This also holds for all subsequent results.
2 In fact, the result generalizes directly to “symmetric congestion games for which Rosenthal’s potential is M-convex,” but we are not aware of any other interesting class of congestion games for which this is true (and, therefore, choose to formulate our results in terms of EP congestion games). M-convexity of Rosenthal’s potential already fails to hold for the smallest nonextension parallel network congestion game, which is the graph that has two graphs, both consisting of two parallel edges, in series.
3 In fact, no efficient algorithm was known for the approximate sampling and counting of the bases of a matroid.
References
- [1] (2008) On the convergence time of the best response dynamics in player-specific congestion games. Preprint, submitted May 8, https://arxiv.org/abs/0805.1130.Google Scholar
- [2] (2008) On the impact of combinatorial structure on congestion games. J. ACM 55(6):1–22.Google Scholar
- [3] (2018) Log-concave polynomials, entropy, and a deterministic approximation algorithm for counting bases of matroids. Proc. 59th Annual Sympos. Foundations Comput. Sci., 35–46.Google Scholar
- [4] (2018) Log-concave polynomials III: Mason’s ultra-log-concavity conjecture for independent sets of matroids. Preprint, submitted July 2, https://arxiv.org/abs/1807.00929.Google Scholar
- [5] (2019) Log-concave polynomials II: High-dimensional walks and an FPRAS for counting bases of a matroid. Proc. 51st Annual ACM SIGACT Sympos. Theory Comput., 1–12.Google Scholar
- [6] (2021) Log-concave polynomials IV: Approximate exchange, tight mixing times, and near-optimal sampling of forests. Proc. 53rd Annual ACM SIGACT Sympos. Theory Comput., 408–420.Google Scholar
- [7] (2019) Fast uniform generation of random graphs with given degree sequences. Proc. 60th Annual Sympos. Foundations Comput. Sci., 1371–1379.Google Scholar
- [8] (2009) On the inefficiency ratio of stable equilibria in congestion games. Internet and Network Economics, 545–552.Crossref, Google Scholar
- [9] (2013) Mixing time and stationary expected social welfare of logit dynamics. Theory Comput. Systems 53(1):3–40.Crossref, Google Scholar
- [10] (2018) Metastability of logit dynamics for coordination games. Algorithmica 80(11):3078–3131.Crossref, Google Scholar
- [11] (2015) Logit dynamics with concurrent updates for local interaction potential games. Algorithmica 73(3):511–546.Crossref, Google Scholar
- [12] (2016) Convergence to equilibrium of logit dynamics for strategic games. Algorithmica 76(1):110–142.Crossref, Google Scholar
- [13] (2007) Sampling binary contingency tables with a greedy start. Random Structures Algorithms 30(1–2):168–205.Crossref, Google Scholar
- [14] (1993) The statistical mechanics of strategic interaction. Games Econom. Behav. 5(3):387–424.Crossref, Google Scholar
- [15] (2006) Modified logarithmic Sobolev inequalities in discrete settings. J. Theoretical Probab. 19(2):289–336.Crossref, Google Scholar
- [16] (2018) Hodge-Riemann relations for Potts model partition functions. Preprint, submitted November 5, https://arxiv.org/abs/1811.01696.Google Scholar
- [17] (2020) Lorentzian polynomials. Ann. Math. 192(3):821–891.Crossref, Google Scholar
- [18] (2010) Behavioural game theory. Behavioural and Experimental Economics, 42–50.Google Scholar
- [19] (2011) Convergence to approximate Nash equilibria in congestion games. Games Econom. Behav. 71(2):315–327.Crossref, Google Scholar
- [20] (2019) Modified log-Sobolev inequalities for strongly log-concave distributions. Proc. 60th Annual Sympos. Foundations Comput. Sci., 1358–1370.Google Scholar
- [21] (2017) Totally unimodular congestion games. Proc. 28th Annual ACM-SIAM Sympos. Discrete Algorithms, 577–588.Google Scholar
- [22] (1996) Logarithmic Sobolev inequalities for finite Markov chains. Ann. Appl. Probab. 6(3):695–750.Crossref, Google Scholar
- [23] (2021) Sampling hypergraphs with given degrees. Discrete Math. 344(11):112566.Crossref, Google Scholar
- [24] (2007) Convergence time to Nash equilibrium in load balancing. ACM Trans. Algorithms 3(3):32.Crossref, Google Scholar
- [25] (2004) The complexity of pure Nash equilibria. Proc. 36th Annual ACM Sympos. Theory Comput., 604–612.Google Scholar
- [26] (2011) On best response dynamics in weighted congestion games with polynomial delays. Distributed Comput. 24(5):245–254.Crossref, Google Scholar
- [27] (1992) Balanced matroids. Proc. 24th Annual ACM Sympos. Theory Comput., 26–38.Google Scholar
- [28] (2013) Logit dynamics: A model for bounded rationality. ACM SIGecom Exchanges 12(1):34–37.Crossref, Google Scholar
- [29] (2010) Congestion games with linearly independent paths: Convergence time and price of anarchy. Theory Comput. Systems 47(1):113–136.Crossref, Google Scholar
- [30] (2015) Congestion games viewed from m-convexity. Oper. Res. Lett. 43(3):329–333.Crossref, Google Scholar
- [31] (1990) Computers and Intractability: A Guide to the Theory of NP-Completeness (W.H. Freeman & Co., New York).Google Scholar
- [32] (2004) Modified logarithmic Sobolev inequalities for some models of random walk. Stochastic Processes Appl. 114(1):51–79.Crossref, Google Scholar
- [33] (2009) On strong equilibria in the max cut game. Proc. Fifth Internat. Workshop Internet Network Econom., 608–615.Google Scholar
- [34] (2010) On multivariate Newton-like inequalities. Advances in Combinatorial Mathematics, 61–78.Google Scholar
- [35] (1974) Aspects of the theory of hypermatroids. Hypergraph Seminar (Springer, Berlin/Heidelberg), 191–213.Google Scholar
- [36] (2019) Modified log-Sobolev inequalities for strong-Rayleigh measures. Preprint, submitted February 7, https://arxiv.org/abs/1902.02775.Google Scholar
- [37] (1997) Strong equilibrium in congestion games. Games Econom. Behav. 21(1–2):85–101.Crossref, Google Scholar
- [38] (2005) Fast and compact: A simple class of congestion games. Proc. 20th Natl. Conf. Artificial Intelligence, 489–494.Google Scholar
- [39] (1993) Polynomial-time approximation algorithms for the Ising model. SIAM J. Comput. 22(5):1087–1116.Crossref, Google Scholar
- [40] (2004) A polynomial-time approximation algorithm for the permanent of a matrix with nonnegative entries. J. ACM 51(4):671–697.Crossref, Google Scholar
- [41] (2004) Elementary bounds on Poincaré and log-Sobolev constants for decomposable Markov chains. Ann. Appl. Probab. 14(4):1741–1765.Crossref, Google Scholar
- [42] (2021) Sampling from the Gibbs distribution in congestion games. Proc. 22nd ACM Conf. Econom. Comput., 679–680.Google Scholar
- [43] (2017) Potential function minimizers of combinatorial congestion games: Efficiency and computation. Proc. 18th ACM Conf. Econom. Comput., 223–240.Google Scholar
- [44] (2010) Glauber dynamics for the mean-field Ising model: Cut-off, critical power law, and metastability. Probab. Theory Related Fields 146(1–2):223–265.Crossref, Google Scholar
- [45] (2016) Tighter bounds on the inefficiency ratio of stable equilibria in load balancing games. Oper. Res. Lett. 44(5):645–648.Crossref, Google Scholar
- [46] (2000) Sampling adsorbing staircase walks using a new Markov chain decomposition method. Proc. 41st Annual Sympos. Foundations Comput. Sci., 492–502.Google Scholar
- [47] (2016) A behavioral study of “noise” in coordination games. J. Econom. Theory 162:195–208.Crossref, Google Scholar
- [48] (1973) Conditional logit analysis of qualitative choice behavior. Frontiers in Econometrics, 105–142.Google Scholar
- [49] (1984)
Asymptotics for 0-1 matrices with prescribed line sums . Enumeration and Design (Academic Press, Cambridge, MA), 225–238.Google Scholar - [50] (1989) On the expansion of 0-1 polytopes. J. Combin. Theory Ser. BGoogle Scholar
- [51] (1998) Discrete convex analysis. Math. Programming 83(1–3):313–371.Crossref, Google Scholar
- [52] (2003) Discrete convex analysis. SIAM Monograph on Discrete Mathematics and Applications (Society for Industrial and Applied Mathematics).Crossref, Google Scholar
- [53] (2009) Recent developments in discrete convex analysis. Research Trends in Combinatorial Optimization, 219–260.Google Scholar
- [54] (2018) The price of anarchy and stability in general noisy best-response dynamics. Internat. J. Game Theory 47(3):839–855.Crossref, Google Scholar
- [55] (1973) A class of games possessing pure-strategy Nash equilibria. Internat. J. Game Theory 2:65–67.Crossref, Google Scholar
- [56] (2010) Population Games and Evolutionary Dynamics (MIT Press, Cambridge, MA).Google Scholar
- [57] (2003)
Combinatorial optimization: Polyhedra and efficiency . Matroids, Trees, Stable Sets, vol. B, Algorithms and Combinatorics (Springer-Verlag, Berlin).Google Scholar - [58] (1992) Improved bounds for mixing rates of Markov chains and multicommodity flow. Combin. Probab. Comput. 1(4):351–370.Crossref, Google Scholar
- [59] (2008) Inapproximability of pure Nash equilibria. Proc. 40th Annual ACM Sympos. Theory Comput., 355–364.Google Scholar

