Open Access

Sampling from the Gibbs Distribution in Congestion Games

Pieter Kleer
Pieter Kleer
[email protected]
https://orcid.org/0000-0003-4304-7282
Department of Econometrics and Operations Research, Tilburg University, 5037 AB Tilburg, Netherlands
Search for more papers by this author

Pieter Kleer

[email protected]

https://orcid.org/0000-0003-4304-7282

Department of Econometrics and Operations Research, Tilburg University, 5037 AB Tilburg, Netherlands

Search for more papers by this author

Published Online:4 Apr 2023https://doi.org/10.1287/moor.2022.1322

Abstract

Logit dynamics is a form of randomized game dynamics in which players have a bias toward strategic deviations that give a higher improvement in cost. It is used extensively in practice. In congestion (or potential) games, the dynamics converge to the so-called Gibbs distribution over the set of all strategy profiles when interpreted as a Markov chain. In general, logit dynamics can converge slowly to the Gibbs distribution, but beyond that, not much is known about its algorithmic aspects, nor that of the Gibbs distribution. In this work, we are interested in the following two questions for congestion games: (i) Is there an efficient algorithm for sampling from the Gibbs distribution? (ii) If yes, does there also exist natural randomized dynamics that converge quickly to the Gibbs distribution? We first study these questions in extension parallel congestion games, a well-studied special case of symmetric network congestion games. As our main result, we show that there is a simple variation on the logit dynamics that converges quickly to the Gibbs distribution in such games. We also address the first question for the class of so-called capacitated uniform congestion games and the second question for max cut games played on a complete graph. To prove our results, we rely on the recent breakthrough work of Anari et al. (2019) regarding the approximate sampling of a base of a matroid according to a strongly log-concave probability distribution.

1. Introduction

Congestion games constitute a rich class of games that have been studied extensively since their introduction by Rosenthal [55]. An (unweighted) congestion game $Γ = (N, E, {(S_{i})}_{i \in N}, {(c_{e})}_{e \in E})$ consists of a set of players $N = {1, \dots, n}$ and a set of resources $E = {1, \dots, m}$ . Every player i has a strategy set $S_{i} \subseteq 2^{E}$ , in which each strategy is a subset of resources. Furthermore, every resource $e \in E$ is equipped with a cost function $c_{e} : R_{\geq 0} \to R$ that we assume to be nonnegative and nondecreasing. The goal of a player is to choose a strategy that minimizes the player’s total cost $C_{i} (s) = \sum_{e \in s_{i}} c_{e} (ℓ_{e} (s))$ , where $ℓ_{e} (s)$ is the number of players using resource e in profile $s \in \times_{i} S_{i} = S$ . A well-known example is the class of symmetric network congestion games, in which we are given a directed graph $G = (V, E)$ with origin $o \in V$ and destination $d \in V$ . The common strategy set of all players is given by the set of all o, d-paths in G.

Rosenthal [55] proves that congestion games are (exact) potential games. He shows that the function $Φ : \times_{i} S_{i} \to R$ given by

Φ (s) = \sum_{e \in E} \sum_{k = 1}^{ℓ_{e} (s)} c_{e} (k),

satisfies for every

s \in \times_{i} S_{i}

and

s_{i}^{'} \in S_{i}

the equality

C_{i} (s) - C_{i} (s_{i}^{'}, s_{- i}) = Φ (s) - Φ (s_{i}^{'}, s_{- i}) .

(1)

Here, $C_{i} (s_{i}^{'}, s_{- i})$ is used to denote the cost of player i in the strategy profile in which i chooses $s_{i}^{'}$ , and all other players choose their strategy in s. The function $Φ$ is often referred to as Rosenthal’s potential. The main implication of (1) is the existence of a pure Nash equilibrium (PNE): a strategy profile in which no player can deviate to another strategy and obtain an improved cost (Rosenthal [55]). This follows directly from the observation that better (or best) response dynamics converge to a PNE in a finite number of steps. Better response dynamics is defined as the procedure by which, in every step, precisely one player deviates to another strategy that yields an improved cost (until a pure Nash equilibrium is reached). For best response dynamics, the deviating player always deviates to a strategy that yields the greatest possible improvement in cost.

In the last two decades, the algorithmic aspects of (pure) Nash equilibria are studied extensively in both general and special classes of congestion games. Two of the most prominent questions concerning pure Nash equilibria are the following.

Does (natural) player dynamics, such as better or best response dynamics, converge to a PNE in polynomial time?
If not, can one compute a PNE in polynomial time by other means?

Ieong et al. [38] consider the convergence time of better and best response dynamics for singleton congestion games. Here, every strategy of a player consists of a single resource. It is shown that better response dynamics converge in at most $O (n^{2} m)$ steps, and there exist instances in which convergence of better response dynamics might take $Ω (n^{\frac{3}{2}} m)$ steps (Ieong et al. [38]). The result of Ieong et al. [38] is generalized by Ackermann et al. [2] to matroid congestion games, in which the strategy set of every player is the collection of bases of a matroid on the ground set E (see Section 2.3 for a formal definition). They show that best response dynamics converge in at most $O (n^{2} m r)$ steps, where r is the maximum rank of the matroids that form the strategy sets of the players. They also show that matroids are, in a sense, the maximal structure that allow for polynomial time convergence of best response dynamics. Furthermore, Ackermann et al. [2] show best response dynamics are not guaranteed to converge in symmetric network congestion games, in which the common strategy set of every player is the collection of paths in a given directed graph. Best response dynamics are also studied in more general congestion games, such as with player-specific cost functions (Ackermann and Röglin [1]) or with weighted players (Even-Dar et al. [24], Fanelli and Moscardelli [26]). Approximate versions of response dynamics are also studied; see, for example, Chien and Sinclair [19] and Skopalik and Vöcking [59].

The complexity of computing a pure Nash equilibrium is also well-understood. Fabrikant et al. [25] show that computing a pure Nash equilibrium is complete for the complexity class polynomial local search. Hardness of computing a PNE is also shown for various special cases of congestion games; see, for example, Ackermann et al. [2] and Del Pia et al. [21], as well as for the computation of approximate equilibria (Skopalik and Vöcking [59]). On the positive side, Fabrikant et al. [25] show that a PNE can be computed efficiently in symmetric network congestion games despite the fact that best response dynamics are not guaranteed to converge in these games (Ackermann et al. [2]). Del Pia et al. [21] generalize this result to totally unimodular congestion games; see also Kleer and Schäfer [43] for a more general framework.

In general, player dynamics roughly comes in two flavors: one deviates to another strategy profile according to either a deterministic rule or a probabilistic one. A well-known example of the latter case is noisy (randomized) best response dynamics, which has received a lot of attention in practice; see, for example, Camerer [18] and Mäs and Nax [47], but seems hard to analyze from a theoretical perspective. Here, instead of making a deviation to another strategy according to a deterministic rule, a player chooses a strategy from the player’s set according to a probability distribution that usually puts relatively more weight on strategies that result in a lower cost. Randomized dynamics can be studied from two perspectives: as either a randomized alternative for deterministic dynamics converging to a pure Nash equilibrium or as a dynamical system on its own.

One well-known example of player dynamics that can be studied as a dynamical system and which is the topic of this paper is logit dynamics. It has received a lot of attention in various communities, such as evolutionary game theory (e.g., Sandholm [56]) and experimental economics (e.g., Camerer [18]). The procedure was introduced by Blume [14] as a form of randomized game dynamics in which players update their strategy according to a logit update rule (McFadden [48]). The logit dynamics for congestion games can be formulated as follows. For a given strategy profile s and fixed rationality level (or inverse temperature in the physics literature) parameter $T \geq 0$ , first choose a player $i \in N$ uniformly at random and then have player i choose a strategy $s_{i}^{'} \in S_{i}$ with probability

\frac{e^{- T Φ (s_{i}^{'}, s_{- i})}}{\sum_{t \in S_{i}} e^{- T Φ (t, s_{- i})}} .

(2)

Note that the denominator in (2) is a normalizing constant (often called the partition function).

Remark 1.

Equivalently, one can replace the $Φ (s_{i}^{'}, s_{- i})$ by $C_{i} (s_{i}^{'}, s_{- i})$ and, similarly, in the normalizing constant. We use $Φ$ as this is more convenient for our purposes. The equivalence follows from (1).

The rationality level $T \geq 0$ is used to model the amount of noise players believe there to be in the system (Auletta et al. [12]). When $T \to \infty$ , players effectively only assign positive probability to best responses, whereas when $T \to 0$ , the distribution in (2) approaches the uniform distribution over $S_{i}$ . Also note that the dynamics indeed puts relatively more probability mass on strategies that give a greater improvement in cost.

Logit dynamics gives rise to an ergodic, time-reversible Markov chain on the set $S$ of all strategy profiles that has the Gibbs distribution π given by

π (s) = \frac{e^{- T Φ (s)}}{\sum_{t \in \times_{i} S_{i}} e^{- T Φ (t)}}

for

s \in S

as its unique stationary distribution. This simply means that, if one runs the logit dynamics for a sufficiently long time, the distribution over

S

converges to the Gibbs distribution. The time it takes to converge to the stationary distribution is called the mixing time of the Markov chain.

Auletta et al. [12] interpret the Gibbs distribution as a dynamic equilibrium concept, which they dubbed the logit equilibrium; see also Ferraioli [28]. This concept is well-defined for general finite games; see, for example, Auletta et al. [12]. The goal of this work is to study algorithmic aspects of the logit equilibrium/Gibbs distribution in congestion games. Auletta et al. [12] give a tight bound for the mixing time of the logit dynamics in general potential games, which can be exponential in the quantity $T Φ_{\max}$ (as in Example 1). They also study graphical coordination games, for which they identify various types of behavior of the logit dynamics; we refer to Auletta et al. [12] for more details. Because of the general slow mixing, Auletta et al. [10] also study the concept of metastability, which means a Markov chain stays close to some distribution on a timescale shorter than the mixing time. More work on the logit dynamics in potential games by these authors can also be found in Auletta et al. [9, 11]. We also refer to the survey article of Ferraioli [28] and references therein. There are also various results addressing the inefficiency of “long-term” equilibria in the context of logit dynamics; see, for example, the works of Asadpour and Saberi [8], Mamageishvili and Penna [45], and Penna [54].

A special case of potential games for which logit-like dynamics is studied extensively is the Glauber dynamics for the Ising model, which, in game-theoretical terms, can be seen as logit dynamics for max-2-cut games that are described in Section 1.1. Whether the logit dynamics are rapidly mixing in this case depends on the parameter $T \geq 0$ and graph topology G; see, for example, the work of Levin et al. [44] and references therein. Jerrum and Sinclair [39] show that, nevertheless, there exists a polynomial time algorithm to sample from the Gibbs distribution for any rationality level $T \geq 0$ and any graph topology. This work is partially motivated by studying a similar question for special cases of congestion games as explained later.

As said before, in general, the logit dynamics converge slowly to the Gibbs distribution (Auletta et al. [12]); in particular, the number of steps needed might be $Ω (e^{T Φ_{\max}})$ , where $Φ_{\max}$ is the maximum value attained by Rosenthal’s potential. We next give a simple example illustrating this fact.

Example 1.

Consider the congestion game with players $N = {1, 2}$ and two resources $E = {a, b}$ . Both resources $e \in {a, b}$ can be used by both players, that is, $S_{1} = S_{2} = {{a}, {b}}$ , and have a cost function satisfying $c_{e} (1) = 0$ and $c_{e} (2) = ϕ$ for some $ϕ \geq 0$ . Note that $Φ_{\max} = ϕ$ .

We assume that T is fixed in this example, in particular, independent of $ϕ$ . If $ϕ$ is large, the Gibbs distribution assigns weight close to 1/2 to the strategy profiles $(s_{1}, s_{2}) \in {(a, b), (b, a)}$ and weight close to (1/2) $e^{- T ϕ} \approx 0$ to $(s_{1}, s_{2}) \in {(a, a), (b, b)}$ . We summarize these facts in Figure 1. Now, informally speaking, if we consider the logit dynamics with starting profile (a, b), then in order to reach the profile (b, a), we have to go through either the profile (a, a) or (b, b). The probability of transitioning to one of those profiles in one step of the logit dynamics is $O (e^{- ϕ T})$ . As both profiles (a, b) and (b, a) appear with probability close to 1/2 in the Gibbs distribution, the probability of being in either of these profiles after a (large) number of steps of the logit dynamics should be approximately equal. However, escaping from the profile (a, b) happens with probability $O (e^{- ϕ T})$ . The inverse $Ω (e^{T ϕ})$ of this probability then forms a lower bound on the number of steps of the logit dynamics needed to get close to the stationary Gibbs distribution.

What is causing the slow convergence in Example 1? The problem is that we need to use either the profile (a, a) or (b, b), both having very small probability in the Gibbs distribution, to move from (a, b) to (b, a). However, as it turns out, if one in addition with some probability is allowed to interchange the strategies of players 1 and 2, then the resulting dynamics converges quickly to the Gibbs distribution (being a special case of Theorem 1). Note that this enables the possibility to directly transition between (a, b) and (b, a). This motivates the following question: Is there a (simple) Markov chain on $S$ that converges rapidly to the Gibbs distribution over $S$ at any rationality level T?

This question may be interpreted as the natural analogue of looking for other local search procedures converging quickly to a PNE when best/better response dynamics does not have this property.

When the answer to the preceding question is not directly obvious, one can take another step back and first ask whether it is at all possible to efficiently sample from the Gibbs distribution. Informally speaking, can we take “snapshots” (according to the Gibbs distribution) from the system in equilibrium in polynomial time? More formally speaking, does there exist an efficient algorithm to sample (approximately) a strategy profile $s \in S$ according to the Gibbs distribution over $S$ at any rationality level T? This question can be interpreted as a dynamic analogue of the second question posed earlier for the computation of pure Nash equilibria. That is, although (deterministic) better/best response dynamics might take a long time to converge to a pure Nash equilibrium, one still wants to know whether a PNE can be computed efficiently by other means. Similarly, although logit (or other natural) dynamics can take a long time to converge to the Gibbs distribution, we may still ask whether, by means of sampling, we can get an impression of what the Gibbs distribution over $S$ looks like. Note that a positive answer to the first question posed earlier gives a positive answer to the second question as one can simply run the Markov chain sufficiently long in order to generate a sample from the Gibbs distribution.

These questions are made precise in Section 2. We remark that one of the aspects that makes them nontrivial is the fact that we require the questions to hold for any rationality level T, that is, T is considered part of the input. For example, in Example 1, we could set $T = Θ (1 / ϕ)$ to circumvent the problem arising there.

**Figure 1. Overview of the four possible strategy profiles in Example 1.**

1.1. Our Contributions

In this work, we address the questions posed for various special cases of congestion games.

1.1.1. Extension Parallel Congestion Games.

The class of extension parallel (EP) congestion games is a well-studied special case of symmetric network congestion games; see, for example, Fotakis [29], Fujishige et al. [30], and Holzman and Law-Yone [37]. Here, the common strategy set of all players is given by the set of o, d-paths $P$ of an extension parallel graph; see Section 4 for a definition and example. Our main result is that there is a simple Markov chain converging quickly to the Gibbs distribution over $S$ , also implying that we can sample approximately from the Gibbs distribution. We show that, if one, in addition to the logit dynamics transitions, is allowed to randomly interchange the strategies of two players (akin to the explanation given after Example 1), the resulting Markov chain converges quickly to the Gibbs distribution. We call this the relaxed logit dynamics; see Section 4.2 for a formal definition. Note that Theorem 1 gives a doubly exponential improvement with respect to the dependence on $T Φ_{\max}$ compared with the lower bound as given in Example 1.

Theorem 1

(Informal). The relaxed logit dynamics for EP congestion games, at rationality level T, converges to a distribution ϵ-close to the Gibbs distribution in at most

n^{3} (\log n + \log \log | P | + \log (\frac{2 T Φ_{\max}}{ϵ^{2}}))

steps, where n is the number of players,

P

the number of paths in the EP graph, and

Φ_{\max}

the maximum value attained by Rosenthal’s potential.¹

The notion of “ϵ-close” refers to the fact that the distribution seen after the indicated number of steps differs from the Gibbs distribution at most ϵ in total variation distance (see Section 2.5), a well-known distance measure for comparing probability distributions in Markov chain theory.

In a nutshell, Theorem 1 follows from the fact that, in EP congestion games, Rosenthal’s potential is M-convex as is shown by Fujishige et al. [30]. M-convexity is a property defined in the area of discrete convex analysis (Murota [51]) (see Section 2.3).² The link between M-convexity and sampling is, roughly speaking, established in a series of (breakthrough) papers by Anari et al. [3, 5, 6] through the theory of strongly log-concave (SLC) polynomials. The theory of strongly log-concave (or Lorentzian) polynomials dates back to the work of Gurvits [34] and is further developed by Anariet al. [3] and Brändén and Huh [17]. In particular, Anari et al. [5] give the first polynomial-time algorithm for approximately sampling and counting the number of bases of a given matroid, resolving also an old conjecture by Mihail and Vazirani [50]. In this work, we rely on the sampling result from Anari et al. [5], albeit for relatively simple matroid structures.

Before proving the preceding result, we also give another way of sampling from the Gibbs distribution in Section 4.1 that essentially is a more direct approach than the sampler induced by the Markov chain result given (Theorem 4). The high-level approach used for this more direct sampler is given in Section 3 (and also used in Section 6). Finally, we give an application of our results to the problem of (approximate) uniformly sampling pure Nash equilibria in EP congestion games in Section 4.3, which is, to the best of our knowledge, the first of its kind.

1.1.2. Max-k-Cut Game on a (Unweighted) Complete Graph.

A max-k-cut game (see, e.g., Gourvès and Monnot [33]) is given by an undirected graph $G = (V, E)$ whose nodes V are the players. Every $v \in V$ has strategy set $S_{v} = {1, \dots, k}$ whose elements are referred to as colors. The utility of a player is the number of neighbors that choose a different color. They can be modeled as a special case of extension parallel congestion games when G is the complete graph. This is explained in Section 5.

As an application of Theorem 1, we show that the relaxed logit dynamics is rapidly mixing at any rationality level T for max-k-cut games when G is the complete graph (Corollary 2). This stands in stark contrast to known results regarding the logit dynamics on complete graphs for the case k = 2, whose mixing time depends heavily on the value of the rationality level T (Levin et al. [44]). We elaborate on this in Section 5.

1.1.3. Capacitated Uniform Congestion Games.

Finally, we study the class of so-called u-capacitated k-uniform congestion games for given $k = (k_{1}, \dots, k_{n})$ and $u = (u_{1}, \dots, u_{m})$ . In such a game, the strategy set of player $i \in N$ is given by all subsets of E of size k_i. Furthermore, for every $e \in E$ , we are given a capacity u_e so that $c_{e} (x) = \infty$ whenever $x > u_{e}$ .

The motivation for studying these games comes from the class of base-matroid congestion games, in which the strategy set of every player is the set of bases $B_{i}$ of a given matroid $M_{i}$ over the ground set of resources E. It is well-known that best response dynamics converge to a PNE in a polynomial number of steps in this class of games (Anari et al. [2]), and so, in particular, a PNE can be computed in polynomial time. Given the base-matroid sampling result of Anari et al. [5], a natural question that comes to mind is if a similar result exists for sampling from the Gibbs distribution in base-matroid congestion games. Here, we give a first result addressing this question. (Note that, in our setting, the strategies of player $i \in N$ are the bases of the k_i-uniform matroid.)

We next explain why there is a need for capacity constraints in our results. A strategy profile s can be seen as a bipartite graph in which the nodes on one side correspond to the players, having degrees k_i, and the nodes on the other side correspond to the resources, having degrees $ℓ_{e} (s)$ (the resource load on e in profile s). For a given profile s with resource load profile $ℓ (s)$ , the bipartite graph is obtained in the natural way: there is an edge between player i and resource e if player i uses resource e in profile s. Very roughly speaking, in order to apply the sampling result in Anari et al. [5], we establish strong log-concavity of a certain polynomial associated to the vectors k and u. For this, we rely on an asymptotic enumeration formula for the number of bipartite graphs with a given degree sequence, which, in terms of congestion games, gives the number of strategy profiles that have a given resource load profile $ℓ$ . (Asymptotic enumeration of graphs with given degrees is studied extensively in the area of combinatorics.) The formula that we use is only valid for the range of $k = (k_{1}, \dots, k_{n})$ and resource load profiles $ℓ (s) = (ℓ_{e} (s))$ satisfying the imposed capacity constraints as given in Theorem 2. Our main result is as follows.

Theorem 2

(Informal). There is an (almost) polynomial time algorithm for approximately sampling from the Gibbs distribution in u-capacitated k-uniform congestion games assuming that $1 \leq k_{\max} u_{\max} = o (U^{1 / 4})$ when $n \to \infty$ , where $k_{\max} = \max_{i} k_{i}, u_{\max} = \max_{j} u_{j}$ and $U = \sum_{j} u_{j}$ .

See Remark 6 for an explanation of what we mean by “almost” polynomial time. The proof of Theorem 2 reveals an interesting connection between M-convexity and asymptotic enumeration formulas that might be of independent interest.

1.2. Technical Approach

In this section, we give an intuitive outline of the approach taken in order to prove the results as summarized in Section 1.1. For the sake of simplicity, we work with symmetric singleton congestion games in which the common strategy set contains every individual resource as a strategy, that is, $S_{i} = {{e_{1}}, {e_{2}}, \dots, {e_{m}}}$ for all $i \in N$ . These games form a special case of EP and capacitated uniform congestion games. All technical notions not formally defined here can be found in Section 2.

Let us recall Example 1, which is a symmetric singleton congestion game with two resources. The cause of the long convergence time in this example is the fact that transitioning between strategy profiles in which both resources have the same load profile, that is, the number of players using the resource, is difficult. To be precise, in Example 1, both profiles (a, b) and (b, a) have the same load profile, but they correspond to different strategy profiles.

In order to still be able to sample from the Gibbs distribution, we take two approaches:

We first sample a load profile according to the correct probability and then sample a strategy profile corresponding to that load profile (Algorithm 1 in Section 3).
We augment the logit dynamics with additional transition possibilities, making it easier to transition between strategy profiles with the same load profile.

In the first approach, the “correct probability” with which a given load profile should be sampled is the sum of all the probabilities assigned to strategy profiles with the given profile. The resulting distribution is called the Gibbs distribution induced on the load profiles. For the applications in Sections 4–6, sampling a strategy profile with a given load profile can be done by either generating a random permutation of the players or relying on known sampling results in the literature. In particular, in the case of capacitated uniform congestion games, sampling a strategy profile for a given load profile corresponds to sampling a bipartite graph with a given degree sequence.

The main technical challenge lies in sampling a load profile with the correct probability. In the case of symmetric singleton congestion games, the collection of all load profiles can be modeled by the set $R = {x \in ℤ_{\geq 0}^{m} : x_{1} + x_{2} + \dots + x_{m} = n}$ , where x_e denotes the load on resource $e \in E$ . The set R is a special case of a discrete polymatroid. In order to sample a (resource) load vector x from R with the correct probability, we consider a Markov chain that can transition between load vectors x and y for which $\sum_{i} | x_{i} - y_{i} | = 2$ ; that is, x and y differ in one unit of load on precisely two resources (but have the same total load). The transition probabilities are chosen in such a way that the stationary distribution of the Markov chain is precisely the Gibbs distribution induced on the load profiles.

The Markov chain that we consider over the set R is a “polymatroid version” of the base-exchange Markov chain studied by Anari et al. [5] for sampling a base of a given matroid. The state space of this base-exchange Markov chain is the set of all bases of a given matroid. It randomly exchanges some element of the current base with another element not in the base, resulting in another base of the matroid. This idea relies on the well-known base-exchange property of matroids (Schrijver [57]). In their breakthrough work Anari et al. [5] show this Markov chain to be rapidly mixing when the stationary distribution is strongly log-concave. Strong log-concavity is a property of a polynomial associated with the stationary distribution of a Markov chain (defined for both discrete polymatroids and matroids). We apply the result from Anari et al. [5] by giving a reduction from sampling a base of a polymatroid, that is, a load vector from R, to the problem of sampling a base of a matroid using a well-known construction of Helgason [35]. For this reduction, we rely on some properties that preserve strong log-concavity (Brändén and Huh [17]). Whereas this reduction works for arbitrary discrete polymatroids, in our applications, we use relatively simple polymatroid structures. What is left then is to show that the Gibbs distribution induced on the load profiles indeed satisfies the notion of strong log-concavity. Based on results in Brändén and Huh [17], it turns out that, for symmetric singleton congestion games, this is the case because of the fact that Rosenthal’s potential is a separable convex function.

In order to obtain our sampler for EP congestion games (Theorem 4), we use a similar approach based on load profiles, but instead, we rely on the M-convexity of Rosenthal’s potential for such games (Fujishige et al. [30]). For Theorem 2, we also use a similar approach.

In order to show that the logit dynamics converge quickly to the Gibbs distribution when we additionally allow the interchanging of the strategy of two players, we use a Markov chain decomposition argument. This approach results in the proofs of Theorem 5 and Corollary 2. The idea of Markov chain decomposition is to divide the state space into smaller sets. If one can prove that the Markov chain converges quickly to its (induced) stationary distribution on the smaller sets and that it is easy to move between these smaller sets, then the original chain converges quickly to its stationary distribution as well. We elaborate more on this approach before giving the proof of Theorem 5.

1.3. Discussion and Further Related Work

To the best of our knowledge, ours are the first (polynomial-time) sampling results for the Gibbs distribution in congestion games beyond the well-studied case of max-cut games on general graphs, also known as the Glauber dynamics for the Ising model. Given the extensive attention that logit dynamics has received in various communities as well as the topic of approximate sampling in theoretical computer science, we believe this to be an interesting line of work at the intersection of algorithmic game theory, combinatorics, and approximate sampling to pursue further.

In particular, for special cases of congestion games with a positive answer to questions 1 and 2 posed in the introduction, do there also exist positive answers for their dynamic analogues? As a concrete open question, we ask whether it is always possible to efficiently sample from the Gibbs distribution in general base matroid congestion games (Ackermann et al. [2]). If true, this provides an interesting (qualitative) game-theoretical generalization of the sampling result of Anari et al. [5].

We conclude this section with some more background and related work concerning the result of Anari et al. [5]. A long-standing open question (Mihail and Vazirani [50]) in the area of approximate sampling and counting asks if there exists a rapidly mixing Markov chain to sample approximate uniformly at random a base of a given matroid (which, in turn, also yields a polynomial-time approximation scheme for counting the number of bases of a given matroid).³ Anari et al. [5] show that the base-exchange Markov chain is indeed rapidly mixing. In fact, they prove the more general result that this chain is rapidly mixing, using appropriate transition probabilities, for any strongly log-concave stationary distribution; see Section 2.4. An (up to constant factors) tight mixing time bound for the base-exchange Markov chain is later given by Anari et al. [6]. Before Anari et al. [5], rapid mixing of the base-exchange Markov chain (with uniform stationary distribution) was only known for special cases. In particular, Feder and Mihail [27] prove rapid mixing for balanced matroids, using Sinclair’s [58] multicommodity flow method. Examples of balanced matroids are the graphic and regular matroids.

2. Preliminaries

In this section, we give all the necessary preliminaries regarding congestion games, strongly log-concave polynomials, and the relevant Markov chain notions and results. We start with some general notation.

All logarithms in this work have Euler’s number e as their base unless specified otherwise. For $k \in ℤ_{> 0}$ , we write $[k] = {1, \dots, k}$ . For two vectors $x, y \in ℤ^{n}$ , we write $x \leq y$ if $x_{i} \leq y_{i}$ for $i = 1, \dots, n$ , and x < y if strict inequality holds for at least one i. Furthermore, with $| x | = \sum_{i = 1}^{n} | x_{i} |$ , we denote the modulus of x. We use ${(e_{i})}_{i = 1, \dots, n}$ to denote the standard basis of $R^{n}$ , that is, $e_{i} (j) = 1$ if j = I and $e_{i} (j) = 0$ otherwise. For sets A, B, and C, we use the notation $A - B + C$ to denote the set $(A \ B) \cup C$ .

2.1. Congestion Games

A capacitated congestion game Γ is given by a tuple $(N, E, {(S_{i})}_{i \in N}, {(c_{e})}_{e \in E}, {(u_{e})}_{e \in E})$ , where $N = [n]$ is a finite set of players, $E = [m]$ a finite set of resources (or facilities), $S_{i} \subseteq 2^{E}$ the set of strategies of player $i \in N$ , and $c_{e} : ℤ_{\geq 0} \to ℚ$ the cost function of resource $e \in E$ that satisfies $c_{e} (x) = W$ whenever $x > u_{e}$ for $e \in E$ with W a sufficiently large number. Unless stated otherwise, the cost functions are assumed to be nonnegative and nondecreasing. Finally, u_e is a nonnegative integer modeling the capacity on resource $e \in E$ . If u_e = n for every resource $e \in E$ , we simply call Γ a congestion game. (Extension parallel and capacitated uniform congestion games are defined in Sections 4 and 6, respectively.) For a strategy profile $s = (s_{1}, \dots, s_{n}) \in \times_{i} S_{i} = S$ , we define $ℓ_{e} (s)$ to be the number of players using resource e, that is, $ℓ_{e} (s) = | {i \in N : e \in s_{i}} |$ . A game is called symmetric if $S_{i} = S_{j}$ for all $i, j \in [n]$ . We then write $P$ to denote the common strategy set of all players.

We call $ℓ (s) = {(ℓ_{e} (s))}_{e \in E}$ the resource load profile corresponding to strategy profile s. We say that a strategy $s \in S$ is feasible if $ℓ_{e} (s) \leq u_{e}$ for every $e \in E$ and write $S_{f}$ to denote the set of all feasible strategy profiles. We only consider games in which the set of feasible strategy profiles is nonempty. More generally, we say that $y \in N^{m}$ is a (feasible) resource load profile for $(N, E, {(S_{i})}_{i \in N}, {(u_{e})}_{e \in E})$ if there is some (feasible) strategy profile s such that $y = ℓ (s)$ . We write $S (y)$ for the set of strategy profiles $s \in \times_{i} S_{i}$ whose resource load profile is y.

Similarly, for symmetric congestion games, we define the notion of a strategy load profile that models how many players are using a strategy $p \in P$ in a given strategy profile $s \in P^{n}$ . More precisely, given a strategy profile $s \in P^{n}$ , we define $z_{p} (s) = | {i \in N : s_{i} = p} |$ as the number of players choosing strategy $p \in P$ in strategy profile s. The vector $z (s) = {(z_{p} (s))}_{p \in P}$ is called the strategy load profile of s. Similarly as for resource load profiles, we define (with a slight abuse of notation) $S (x)$ to be the set of strategy profiles s for which $x = z (s)$ .

The cost of player $i \in N$ under a strategy profile $s = (s_{1}, \dots, s_{n}) \in \times_{i} S_{i}$ is given by $C_{i} (s) = \sum_{e \in s_{i}} c_{e} (ℓ_{e} (s)) .$ A strategy profile $s \in S$ is called a (pure) Nash equilibrium if, for every $i \in N$ and every $s_{i}^{'} \in S_{i}$ , it holds that $C_{i} (s) \leq C_{i} (s_{i}^{'}, s_{- i})$ , where $(s_{i}^{'}, s_{- i})$ denotes the strategy profile in which player i plays $s_{i}^{'}$ and every other player $j \neq i$ plays s_j. We write $NE (Γ)$ to denote the set of all pure Nash equilibria of Γ.

We say that $Φ : \times_{i} S_{i} \to R$ is an exact potential function for a congestion game Γ if, for every strategy profile $s \in \times_{i} S_{i}$ , for every player $i \in N$ , and every unilateral deviation $s_{i}^{'} \in S_{i}$ of i it holds that $Φ (s) - Φ (s_{- i}, s_{i}^{'}) = C_{i} (s) - C_{i} (s_{- i}, s_{i}^{'}) .$ Rosenthal [55] shows that

Φ (s) = \sum_{e \in E} \sum_{k = 1}^{ℓ_{e} (s)} c_{e} (k)

(3)

is an exact potential function for any congestion game. Subsequently, we refer to this potential function as Rosenthal’s potential.

A function $ϕ : {0, \dots, n} \to R$ is called convex if $ϕ (i) - ϕ (i - 1) \leq ϕ (i + 1) - ϕ (i)$ for all $i = 1, \dots, n - 1$ . A function $ψ : {0, \dots, n}^{m} \to R$ is called separable convex if it is of the form $(x_{1}, \dots, x_{m}) \mapsto \sum_{j = 1}^{m} ψ_{j} (x_{j})$ , where the ψ_j, given by $x_{j} \mapsto ψ_{j} (x_{j})$ , are convex. We say that $ϕ$ is concave if $- ϕ$ is convex, and similarly, ψ is separable concave if $- ψ$ is separable convex. A simple but important observation that we use in this work is the fact that Rosenthal’s potential is a separable convex function when seen as a function from resource load profiles to the reals. That is, the function $\tilde{Φ} : ℤ_{\geq 0}^{m} \to R$ given by

\tilde{Φ} (β) = \sum_{e \in E} \sum_{k = 1}^{β_{e}} c_{e} (k)

(4)

for

β \in ℤ_{\geq 0}^{m}

is separable convex.

Proposition 1.

If the cost functions ${(c_{e})}_{e \in E}$ are nondecreasing, then Rosenthal’s potential $\tilde{Φ}$ is a separable convex function.

2.2. Gibbs Distribution and Logit Dynamics

The Gibbs distribution $π : S \to R_{\geq 0}$ over the strategy profiles of a congestion game Γ is given by

π (s) = \frac{e^{- T Φ (s)}}{Z},

where

Φ

is Rosenthal’s potential,

T \geq 0

rationality level, and Z is the normalizing constant (or partition function)

Z = \sum_{t \in S_{f}} e^{- T Φ (t)} .

The logit dynamics Markov chain (formal Markov chain definitions are given in Section 2.5) with current state $s \in S$ proceeds as

Select a player $i \in N$ uniformly at random.
For $s_{i}^{'} \in S_{i}$ , transition to $(s_{i}^{'}, s_{- i})$ with probability $e^{- T Φ (s_{i}^{'}, s_{- i})} / Z_{i}^{'}$ with normalizing constant
$Z_{i}^{'} = \sum_{r \in S_{i}} e^{- T Φ (r, s_{- i})} .$

It is a standard fact, which can be shown by using (1), that this Markov chain is reversible with respect to the Gibbs distribution.

2.3. Matroids and M-Concavity

Let $E = [n]$ be a finite set called the ground set and $I \subseteq 2^{E} = {X : X \subseteq E}$ a collection of subsets of E (called independent sets). The pair $N = (E, I)$ is a matroid if (i) $\emptyset \in I$ ; (ii) $A \in I$ and $B \subseteq A$ , then $B \in I$ ; (iii) $A, B \in I$ and $| A | > | B |$ , then there exists an $a \in A \ B$ such that $B + a \in I$ . An independent set $B \in I$ of maximum size is called a basis. We use $B$ to denote the set of all bases of $N$ . The set of bases $B$ satisfies the so-called base-exchange property: if $B, B^{'} \in B$ and $e \in B \ B^{'}$ , then there exists an $e^{'} \in B^{'} \ B$ such that $B + e^{'} - e \in B$ . It satisfies the strong base-exchange property if both $B + e^{'} - e, B^{'} - e^{'} + e \in B$ . The rank of a matroid is the common cardinality r of all bases in $B$ . The $ℓ$ -truncation $M_{ℓ} = (E, I_{ℓ})$ of a matroid $M$ is the matroid with $A \in I_{ℓ}$ if and only if $A \in I$ and $| A | \leq ℓ$ . The partition matroid is given by a disjoint partition $E = E_{1} \cup \dots \cup E_{q}$ of the ground set E and upper bounds u_i for $i = 1, \dots, q$ . A set $A \subseteq E$ is independent if and only if $| A \cap E_{i} | \leq u_{i}$ for all $i = 1, \dots, q$ . The k-uniform matroid is the matroid in which $A \subseteq E$ is independent if and only if $| A | \leq k$ .

A discrete polymatroid is a finite set of vectors $R \subset ℤ_{\geq 0}^{n}$ with the properties that (i) $0 \in R$ ; (ii) if $y \in R$ and $x \leq y$ , then $x \in R$ ; and (iii) if $x, y \in R$ with $| y | > | x |$ , then there is a vector $w \in R$ such that $x < w < \max {x, y}$ (in which the maximum is taken coordinate-wise). The set of bases $B_{R}$ is given by all maximal vectors in R that have a common modulus r. A polymatroid satisfies the base-exchange property; if $x, y \in B_{R}$ and x_i > y_i, then there exists an index j with y_j > x_j and $x - e_{i} + e_{j} \in B_{R}$ .

Finally, as a generalization of discrete polymatroids, we describe the notion of M-convexity for functions Murota [51, 53]. As we mostly work with its negated counterpart of M-concavity, we describe this first. Let $ν : ℤ_{\geq 0}^{n} \to R \cup {- \infty}$ be a function. The effective domain of ν is given by

dom (ν) = {α \in ℤ_{\geq 0}^{n} : v (α) > - \infty} .

The function ν is called $M^{♯}$ -concave if it satisfies the (symmetric) exchange property: for any $α, β \in dom (ν)$ and any $i \in [n]$ satisfying $α_{i} > β_{i}$ , there exists a $j \in [n]$ such that $α_{j} < β_{j}$ and

ν (α) + ν (β) \leq ν (α - e_{i} + e_{j}) + ν (β + e_{i} - e_{j}) .

(5)

It is well-known that a separable concave function is $M^{♯}$ -concave (Murota [52]). The function ν is called M-concave if it is $M^{♯}$ -concave and, in addition, there is an $r \in ℤ_{\geq 0}$ such that $dom (ν) \subseteq {α : \sum_{i} α_{i} = r}$ . A function $ν : ℤ_{\geq 0}^{n} \to R \cup {\infty}$ is called M-convex if $- ν$ is M-concave.

2.4. Strongly Log-Concave Polynomials

We consider polynomials $p \in R [x_{1}, \dots, x_{n}]$ with nonnegative real coefficients. For a vector $β = (β_{1}, \dots, β_{n}) \in ℤ_{\geq 0}^{n}$ , we write

\partial^{β} = \prod_{i = 1}^{n} \partial_{x_{i}}^{β_{i}}

to denote the partial differential operator that differentiates a function β_i times with respect to x_i for

i = 1, \dots, n

. For

α \in ℤ_{\geq 0}^{n}

, we write

x^{α}

to denote

\prod_{i = 1}^{n} x_{i}^{α_{i}}

. Furthermore, we write

α! = \prod_{i} α_{i}!

, and for

α, κ \in ℤ_{\geq 0}^{n}

with

α_{i} \leq κ_{i}

for all i, we write

(\begin{matrix} κ \\ α \end{matrix}) = \prod_{i = 1}^{n} (\begin{matrix} κ_{i} \\ α_{i} \end{matrix}) .

For a constant $t \in ℤ_{\geq 0}$ with $t \geq \max_{i} α_{i}$ , we write $(\begin{matrix} t \\ α \end{matrix}) = \prod_{i = 1}^{n} (\begin{matrix} t \\ α_{i} \end{matrix})$ . Let $κ \in ℤ_{\geq 0}^{n}$ and $K = \times_{i} {0, \dots, κ_{i}}$ . Let $w : K \to R_{\geq 0}$ be a weight function. The generating polynomial of w is given by

g_{κ} (x) = \sum_{α \in K} w (α) \cdot x^{α} .

The support of $g_{κ}$ is the set $supp (g_{κ}) = {α \in K : w (α) > 0}$ . The generating polynomial g is called d-homogeneous if $| α | = \sum_{i} α_{i} = d$ for all $α \in {0, \dots, k}^{m}$ with $w (α) > 0$ . It is called multiaffine if every variable x_i appears with at most multiplicity one in every monomial of p. For example, $q (x_{1}, x_{2}) = x_{1} x_{2}$ is multiaffine, but $r (x_{1}, x_{2}) = x_{1}^{2} + x_{1} x_{2}$ is not as the multiplicity of x₁ in the first monomial is two. Finally, the elementary symmetric polynomial of degree d, for $κ = (1, 1, \dots, 1)$ , is given by

h_{κ} (x) = \sum_{α \in {0, 1}^{n} : | α | = d} x^{α} .

Definition 1

(Strong Log-Concavity; Gurvits [34]). A polynomial $p \in R [x_{1}, \dots, x_{n}]$ with nonnegative coefficients is called log-concave on a subset $S \subseteq R_{\geq 0}^{n}$ if its Hessian $▽^{2} \log (p)$ is negative semidefinite on S. A polynomial p is called SLC on S if, for any $β \in ℤ_{\geq 0}^{n}$ , we have that $\partial^{β} p$ is log-concave.

For convenience, the zero polynomial is defined to be strongly log-concave always. It is interesting to note that if a d-homogeneous multiaffine polynomial p is SLC, then the support of p must form the collection of bases of a matroid, and more generally, if a (not multiaffine) homogeneous polynomial is SLC, its support forms an M-convex set (Brändén and Huh [17]). Finally, if the generating polynomial $g_{κ}$ is strongly log-concave, then the probability distribution $π (α) \propto w (α)$ is called strongly log-concave.

Remark 2.

The definition of strong log-concavity is in this work is not really needed but included for completeness. In our proofs, we essentially only rely on properties of SLC polynomials from the literature that are reviewed as follows. For homogeneous generating polynomials the notion of strong log-concavity is equivalent to that of a polynomial being Lorentzian (Brändén and Huh [17]) or completely log-concave (Anari et al. [4]). These equivalences are shown in Brändén and Huh [16].

We next state all properties of SLC polynomials that are used in this work. First, it is easy to check that the SLC property is preserved under multiplication with a nonnegative scalar, which we state here for sake of reference.

Proposition 2

(Brändén and Huh [17]). If $p \in R [x_{1}, \dots, x_{n}]$ is SLC and $γ \in R_{\geq 0}$ , then $γ \cdot p$ is SLC.

We continue with the polarization operator defined by Brändén and Huh [17]. The polarization operator introduces auxiliary variables in order to turn p into a multiaffine polynomial over a larger set of variables. (We elaborate on polarization in Section 3.1.) Formally, following Brändén and Huh [17], for $κ \in ℤ_{\geq 0}^{n}$ , let

R_{κ} [x_{i}] = {polynomials in R {[x_{i}]}_{1 \leq i \leq n} of degree at most κ_{i} in x_{i} for every i},

and

R_{κ}^{a} [x_{i j}] = {multi-affine polynomials in R {[x_{i j}]}_{1 \leq i \leq n, 1 \leq j \leq κ_{i}}} .

The polarization operator

Π_{κ} : R_{κ} [x_{i}] \to R_{κ}^{a} [x_{i j}]

replaces every factor

x^{α}

\frac{1}{(\begin{matrix} κ \\ α \end{matrix})} \prod_{i = 1}^{n} (elementary symmetric polynomial of degree α_{i} in the variables {x_{i j}}_{1 \leq j \leq κ_{i}}) .

Proposition 3

(Brändén and Huh [17]). If p is d-homogeneous and SLC over $R_{κ} [x_{i}]$ , then $Π_{κ} (p)$ is d-homogeneous and SLC over $R_{κ}^{a} [x_{i j}]$ .

We conclude this section by stating a large class of homogeneous polynomials that are known to be strongly log-concave.

Proposition 4

(Brändén and Huh [17]). For $κ \in ℤ_{\geq 0}^{n}$ and $w : K \to R_{\geq 0}$ a nonnegative weight function, consider

g_{κ} (x) = \sum_{α \in K} \frac{w (α)}{α!} x^{α},

(6)

and assume that

g_{κ}

is d-homogeneous. Let

ν : ℤ_{\geq 0}^{n} \to R \cup {- \infty}

be defined by

ν (α) = \log (w (α))

for

α \in K

and

ν (α) = - \infty

otherwise. If ν is M-concave, then

g_{κ}

is SLC.

2.5. Markov Chains

Let $M = (Ω, P)$ be an ergodic and time-reversible Markov chain with state space Ω, transition matrix P, and stationary distribution π (which is unique because of the ergodicity assumption). Reversibility means that $π (x) P (x, y) = π (y) P (y, x)$ for all $x, y \in Ω$ . We write $P^{t} (x, \cdot)$ for the distribution over Ω at time step t with initial state $x \in Ω$ . The total variation distance $d_{T V} (π, σ)$ of two distributions π and σ over Ω is defined as

d_{T V} (π, σ) = \max_{S \subseteq Ω} | π (S) - σ (S) | = \frac{1}{2} \sum_{x \in Ω} | π (x) - σ (x) |,

where, for a distribution σ over Ω, we write

σ (S) = \sum_{x \in S} σ (x)

. We say that two distributions π and σ are ϵ-close if

d_{T V} (π, σ) \leq ϵ

. The total variation distance of the distribution

P^{t} (x, \cdot)

from π at time t with initial state x is denoted by

Δ_{x} (t)

. The mixing time of

M

with initial state

x \in Ω

τ_{x} (ϵ) = \min {t : Δ_{x} (t^{'}) \leq ϵ for all t^{'} \geq t} .

Informally,

τ_{x} (ϵ)

is the number of steps until the Markov chain is ϵ-close to its stationary distribution given that it is starting in x. A treaty of some more advanced Markov chain notions is given in Appendix A, including the definition of the modified log-Sobolev constant

ρ = ρ (P)

, which can be used to bound the mixing time of a Markov chain. (The definition of ρ is deferred to Appendix A as we actually do not need it in the main body; we only rely on lower bounds on this constant.) It holds that (see, e.g., Bobkov and Tetali [15])

τ_{x} (ϵ) \leq \frac{1}{ρ (P)} (\log \log π {(x)}^{- 1} + \log (\frac{1}{2 ϵ^{2}})) .

(7)

2.5.1. Markov Chain Decomposition.

Let $Ω = Ω_{1} \cup \dots \cup Ω_{m}$ be a disjoint partition of the state space Ω. Following Martin and Randall [46], consider $\bar{π} (i) = π (Ω_{i}) = \sum_{x \in Ω_{i}} π (x)$ and let $\bar{P} : [m] \times [m] \to [0, 1]$ be defined by

\bar{P} (i, j) = \bar{π} {(i)}^{- 1} \sum_{x \in Ω_{i}, y \in Ω_{j}} π (x) P (x, y) .

The Markov chain on $[m]$ with transition matrix $\bar{P}$ is called the projection chain on the partition ${Ω_{i}}_{i = 1, \dots, m}$ . It is time-reversible with respect to the distribution $\bar{π}$ over $[m]$ . For $i \in [m]$ the restriction chain on Ω_i has transition matrix $P_{i} : Ω_{i} \times Ω_{i} \to [0, 1]$ given by

P_{i} (x, y) = {\begin{array}{l} P (x, y) & if x \neq y, \\ 1 - \sum_{z \in Ω_{i} \ {x}} P (x, z) & if x = y . \end{array}

It is time-reversible with respect to the distribution $π_{i} (x) = π (x) / \bar{π} (i)$ for $x \in Ω_{i}$ . In Appendix A.1, we give a Markov chain decomposition result based on the modified log-Sobolev constant ρ.

2.5.2. Base-Exchange Markov Chain.

Let $N$ be a matroid, and let π be an SLC probability distribution over the set of bases $B$ given by $π (α) \propto w (α)$ for some nonnegative weight function $w : K \to R$ . Here, $π (α) \propto w (α)$ means that $π (α) = w (α) / (\sum_{α \in K} w (α))$ . The base-exchange Markov chain on $B$ is defined by the following transitions, in which $B \in B$ is the current state of the Markov chain:

Select an element $e \in B$ uniformly at random and remove it.
Pick a base $B^{'} \in B$ with $B^{'} \supset B - e$ with probability $\propto w (B^{'})$ among all such bases $B^{'}$ .

It is not hard to see, using the base-exchange property, that this procedure defines an ergodic, time-reversible Markov chain with stationary distribution π. Anari et al. [4] show that this chain is rapidly mixing for any matroid $N$ . In particular, in a recent follow-up work, they give a tight mixing time bound (Anari et al. [6]).

Theorem 3

(Anari et al. [6]). Let $N$ be matroid of rank r, and let π be an SLC probability distribution over the set of bases $B$ given by $π (α) \propto w (α)$ for some weight function $w : B \to R_{\geq 0}$ . Then, the mixing time of the base-exchange random walk satisfies $τ (ϵ) \leq O (r \log (r / ϵ)) .$

We note that the mixing time is independent of the size n of the ground set E of the matroid $N$ as well as the stationary distribution π. In this work, Theorem 3 is essentially only applied to partition and uniform matroids. Furthermore, Cryan et al. [20] show that the modified log-Sobolev of the base-exchange random walk satisfies $ρ \geq 1 / r$ , where r is the rank of the matroid.

2.6. Sampling Algorithms

Consider a class of (capacitated) congestion games $Γ = (N, E, {(S_{i})}_{i \in N}, {(c_{e})}_{e \in E}, {(u_{e})}_{e \in E})$ with n players and m resources and cost functions $c_{e} : ℤ_{\geq 0} \to ℚ$ for $e \in E$ . Let $w : S \to ℚ_{\geq 0}$ be a weight function. In this work, an algorithm for sampling $s \in S$ according to a distribution ϵ-close to π with $π (s) \propto w (s)$ is said to run in (randomized) polynomial time if the number of arithmetic operations can be upper bounded by a polynomial in $n, m, \log (1 / ϵ), \max_{e, j} \log (c_{e} (j))$ and $\max_{s} \log (w (s))$ . (We assume the strategy sets can be represented in a compact form.) The generation of a uniform random 0/1 bit is considered to be one arithmetic operation.

Remark 3

(Real Numbers). In this work, we use Markov chains whose transition probabilities are, in general, not rational numbers (in particular for the Gibbs distribution). Whenever we use real numbers, it is implicitly assumed that we use sufficiently accurate approximations to these numbers. All our results remain valid when (real-valued) transition probabilities are replaced by sufficiently accurate rational approximations. We note that, roughly speaking, whenever we want to generate Markov chain transitions with probabilities proportional to $e^{- T Φ (s)}$ for $s \in S$ , our algorithms run in pseudo-polynomial time in terms of the values of the cost functions of the congestion game under consideration.

All our algorithmic results are based on running Markov chains for a sufficiently long time. We usually write our running time bounds as the product of two factors: the number of steps that we need to run the Markov chain (before it’s close to stationarity) and the complexity of implementing one such step. In particular, in all cases, the transitions probabilities of one step are determined by a sequence of rational numbers $a = (a_{1}, \dots, a_{z})$ and $q = (q_{1}, \dots, q_{z})$ , and we want to sample an index $i \in [z]$ with probability

\frac{q_{i} e^{a_{i}}}{\sum_{i} q_{i} e^{a_{i}}} .

(8)

Here, z as well as the encoding size of the q_i are $poly (n, m)$ . We refer to $C = C (n, m, a)$ as the computational complexity of sampling an index i according to (approximations of) the preceding probabilities in order not to overload our theorem statements. We say that probabilities of the form (8) are suitable.

2.7. Bipartite Graphs

An (undirected) bipartite graph $G = (A \cup B, F)$ is given by two disjoint sets of nodes $A = {a_{1}, \dots, a_{n}}$ and $B = {b_{1}, \dots, b_{m}}$ with $F \subseteq {{a, b} : a \in A, b \in B}$ . We say that G has degree sequence $(x, y)$ if $d (a_{i}) = x_{i}$ for $i = 1, \dots, n$ and $d (b_{i}) = y_{j}$ for $j = 1, \dots, m$ , where d(v) denotes the degree of node v in G. We write $G (x, y)$ for the set of all bipartite graphs on $A \cup B$ with degree sequence $(x, y)$ .

3. General Approach

In Section 2.1, we give two possible definitions for the load profile of a strategy profile $s = (s_{1}, \dots, s_{n}) \in \times_{i} S_{i}$ . For general congestion games, we define the resource load profile $ℓ (s) = (ℓ_{e} (s))$ that keeps track of how many players use a particular resource e in s. For symmetric congestion games, we may in addition consider the strategy load profile $z (s) = {(z_{t} (s))}_{t \in S_{0}}$ that keeps track of how many players use a particular strategy t from the common strategy set $P$ .

A general approach for sampling a strategy profile according to the Gibbs distribution in Sections 4.1 and 6.1 is to first sample a (resource or strategy) load profile α according to approximately the right probability and then sample a strategy profile $s \in S (α)$ uniformly at random. Remember that the set $S (α)$ is used to denote all strategy profiles with the given (resource or strategy) load profile. The approach is summarized in Algorithm 1, in which we give the formulation for resource load profiles because we use Rosenthal’s potential $\tilde{Φ} : ℤ_{\geq 0}^{m} \to R$ as given in (4). The formulation for strategy load profiles is exactly the same with $\tilde{Φ}$ replaced by $Φ$ as given in (3). We note that, in order to sample a strategy load profile $s \in S (α)$ uniformly at random in symmetric games, it suffices to generate a random permutation of the players in $N = {1, \dots, n}$ .

Sampling a load profile with the correct probability in our applications corresponds to sampling a base of a discrete polymatroid according to a strongly log-concave distribution. In order to do this, we present a reduction of this problem to that of sampling a base of a matroid according to a strongly log-concave distribution in Section 3.1 (after which we can rely on Theorem 3).

Algorithm 1

(Gibbs Sampler for Congestion Game Γ)

Input: Congestion game Γ, rationality level $T \geq 0$ , and $ϵ \geq 0$ .

Output: Strategy profile $s \in S$ according to distribution $\bar{π}$ that is ϵ-close to Gibbs distribution π at rationality level T.

Step I: Sample load profile α according to a distribution $σ^{'}$ that is ϵ-close to $π^{'}$ given by
$π^{'} (α) = | S (α) | e^{- T \tilde{Φ} (α)} .$
Step II: Sample strategy profile $s \in S (α)$ (approximate) uniformly at random.

3.1. Sampling Bases of Discrete Polymatroids

In this section, we describe how to generate a discrete polymatroid base, according to a strongly log-concave distribution over the set of all polymatroid bases by reducing it to the problem of generating a base of a matroid. This follows more or less directly from Theorem 3 by using the notion of polarization. Polarization can be seen as a functional version of the classic reduction from discrete polymatroids to matroids as given by Helgason [35]. (See also Schrijver [57, chapter 44.6b] for this reduction.)

For a polymatroid $R \subset ℤ_{\geq 0}^{n}$ , consider a d-homogeneous strongly log-concave polynomial

g_{R} (x_{1}, \dots, x_{n}) = \sum_{α \in B_{R}} w (α) x^{α} .

This is with positive coefficients and support the set of bases $B_{R}$ . Consider the matroid $N_{R} = (E, I)$ on ground set $E = {(i, j) : 1 \leq i \leq n, 1 \leq j \leq d}$ , where $I \in I$ if and only if the vector $α (I) \in ℤ_{\geq 0}^{n}$ given by $α_{e} = | {f : (e, f) \in I} |$ satisfies $α (I) \in R$ . The fact that $N_{R}$ is indeed a matroid follows directly from the fact that R is a polymatroid. Note that, for a given $α \in R$ , we have

| {I : α (I) = α} | = (\begin{matrix} d \\ α \end{matrix}) .

(9)

We slightly abuse notation here and write $d = (d, \dots, d) \in ℤ^{n}$ for the all d-vector.

Then, with $Π (B_{R})$ the set of bases of $N_{R}$ , the polarization $Π (g)$ of g can be written as

\begin{array}{l} Π (g) (y_{11}, \dots, y_{1 d}, \dots, y_{n 1}, \dots, y_{n d}) & = \sum_{B \in Π (B_{R})} {(\begin{matrix} d \\ α (B) \end{matrix})}^{- 1} w (α (B)) \cdot y^{B} \\ = \sum_{B \in Π (B_{R})} w_{Π} (α (B)) \cdot y^{B}, \end{array}

where, for

B \in Π (B_{R})

, we define

w_{Π} (α (B)) = {(\begin{matrix} d \\ α (B) \end{matrix})}^{- 1} w (α (B)) .

(10)

Polarization should be interpreted as spreading out the weight $w (α)$ for $α \in R$ equally over all bases $B \in {A : α (A) = α} \subseteq Π (B_{R})$ . Proposition 3 implies that $Π (g)$ is also strongly log-concave.

Example 2.

Let $p (x_{1}, x_{2}) = x_{1}^{2} x_{2} + x_{1} x_{2}$ so that $supp (p) = {(2, 0), (1, 1)}$ and take $d = (2, 2)$ . Then,

\begin{array}{l} Π_{d} (p) & = \frac{1}{2} x_{11} x_{12} (x_{21} + x_{22}) + \frac{1}{4} (x_{11} + x_{12}) (x_{21} + x_{22}) \\ = \frac{1}{2} x_{11} x_{12} x_{21} + \frac{1}{2} x_{11} x_{12} x_{22} + \frac{1}{4} x_{11} x_{21} + \frac{1}{4} x_{11} x_{22} + \frac{1}{4} x_{12} x_{21} + \frac{1}{4} x_{12} x_{22} . \end{array}

Note that, looking at the support of p, we have $(\begin{matrix} d \\ α \end{matrix}) = 2$ monomials corresponding to $α = (2, 0)$ and $(\begin{matrix} d \\ α \end{matrix}) = 4$ monomials corresponding to $α = (1, 1)$ .

Corollary 1 now follows directly from Theorem 3. It says the following. Suppose the current state of the base-exchange Markov chain after t steps, starting from any state $B_{0} \in Π (B_{R})$ , is the base $B \in Π (B_{R})$ , and suppose we output the polymatroid base $α (B)$ . If t is large enough such that we are in state B with probability close to $w_{Π} (α (B))$ for every $B \in Π (B_{R})$ , then $α (B)$ is outputted with probability close to $w (α (B))$ with $w_{Π} (α (B))$ and $w (α (B))$ as in (10).

Corollary 1.

Let π be the distribution over $B_{R}$ with $π (α) \propto w (α)$ , and let $Π_{π}$ be the distribution over $Π (B_{R})$ with $Π_{π} (B) \propto w_{Π} (α (B))$ . Let $B \in Π (B_{R})$ , and let $Π_{σ}^{t} = P^{t} (B, \cdot)$ be the distribution over $Π (B_{R})$ after t steps of the base-exchange Markov chain $M = (Π (B_{R}), P)$ . Let σ^t be the induced distribution over $B_{R}$ given by $σ^{t} (α) = \sum_{B : α (B) = α} Π_{σ}^{t} (B) .$ If $d_{T V} (Π_{σ}^{t}, Π_{π}) \leq ϵ$ , then $d_{T V} (σ^{t}, π) \leq ϵ$ .

Proof.

We have

\begin{array}{l} 2 d_{T V} (σ, π) & = \sum_{α \in B_{R}} | \sum_{B : α (B) = α} σ^{'} (B) - w (α) | \\ = \sum_{α \in B_{R}} | \sum_{B : α (B) = α} [σ^{'} (B) - {(\begin{matrix} d \\ α (B) \end{matrix})}^{- 1} w (α)] | (using (9)) \\ \leq \sum_{B \in B^{'}} | σ^{'} (B) - w^{'} (α (B)) | (triangle inequality) \\ = 2 d_{T V} (σ^{'}, π^{'}) \leq 2 ϵ . \end{array}

This gives the desired result. □

Remark 4.

It is possible to define a more direct Markov chain on the set of all bases of a given discrete polymatroid and prove that this chain is rapidly mixing (also based on Theorem 3), but this is not needed for our results.

4. Extension Parallel Congestion Games

An extension parallel congestion game is a symmetric congestion game in which the common strategy set $P$ of the players consists of the o, d-paths in a (directed) extension parallel network $G = (V, A)$ with source o and target d. For two given networks $G_{i} = (V_{i}, A_{i})$ with source o_i and target d_i for i = 1, 2, let $G^{'} = (V_{1} \cup V_{2}, A_{1} \cup A_{2})$ be the union of G₁ and G₂. The parallel composition of G₁ and G₂ is the network obtained by identifying o₁ with o₂ and d₁ with d₂. These nodes are the source and target of $G^{'}$ , respectively. The series composition of G₁ and G₂ is obtained by identifying d₁ with o₂. The node o₁ becomes the source of $G^{'}$ and d₂ its target. An extension parallel network consists of (i) a single arc (o, d), (ii) two extension parallel networks in parallel, or (iii) a single arc in series with an extension parallel network. An example is given in Figure 2. For a given extension parallel graph G, we use $P = {p_{1}, \dots, p_{q}}$ to denote all o, d-paths in G. Note that, for an extension parallel network, we have $q \leq | A | = m$ .

**Figure 2. Example of an extension parallel network.**

In this section we are always working with strategy load profiles (and so these are sometimes simply referred to as load profiles). The set of all possible strategy load profiles is denoted by $L = {α \in {1, \dots, q}^{n} : | α | = n}$ . We consider the potential $Φ : L \to ℚ$ defined by $Φ (α) = Φ (s)$ for some $s \in S (α)$ . This is well-defined as the potential value is the same for any choice of $s \in S (α)$ . The main result that we need in this section is the M-convexity of Rosenthal’s potential for EP congestion games.

Proposition 5

(Fujishige et al. [30]). Let Γ be an extension parallel congestion game. Then, the potential $Φ : L \to ℚ$ defined by $Φ (α) = Φ (s)$ for $s \in S (α)$ is M-convex.

The M-convexity of $Φ$ follows, in a nutshell, from the fact that the collection of sets ${Q_{a} : a \in A}$ , where $Q_{a} = {p \in L : a \in p}$ is the set of all paths containing arc a, form a laminar family in the case of extension parallel networks. This implies that the potential $Φ$ is laminar convex, which yields the M-convexity property of the potential. We refer the reader to Fujishige et al. [30] for more details.

Before giving our main result as sketched in Section 1.1, we first give a more direct approach for sampling from the Gibbs distribution in EP congestion games.

4.1. Sampling from the Gibbs Distribution

As mentioned in Section 3 and sketched in Algorithm 1, the high-level algorithmic idea for sampling a strategy profile according to the Gibbs distribution consists of first sampling a load profile α with the correct probability and then a strategy profile from $S (α)$ uniformly at random. The main result of this section based on this approach is stated in Theorem 4.

Theorem 4.

Let $ϵ > 0$ and $T \geq 0$ , and let Γ be an extension parallel congestion game with n players. There is a randomized algorithm $A$ with output distribution $\bar{π}$ over $P^{n}$ that is ϵ-close to the Gibbs distribution π at rationality level T and runs in (expected) time $O (C \cdot n \log (n / ϵ))$ with C the complexity of implementing one step of a base-exchange Markov chain with suitable probabilities (see Section 2.6).

Proof.

Note that, for an extension parallel congestion game, the number of strategy profiles corresponding to a given load profile α is $| S (α) | = n! / α!$ . (This is the number of ways in which we can assign n labeled balls to bins $b_{1}, \dots, b_{q}$ , where b_i contains α_i balls.)

Lemma 1.

The n-homogeneous generating polynomial

g (x_{1}, \dots, x_{n}) = \sum_{α \in {[q]}^{n} : | α | = n} | S (α) | e^{- T Φ (α)} x^{α} = \sum_{α \in {[q]}^{n} : | α | = n} \frac{n!}{α!} e^{- T Φ (α)} x^{α}

(11)

is strongly log-concave. Hence, the distribution

π^{'}

over

L

given by

π^{'} (α) \propto \frac{n!}{α!} e^{- T Φ (α)}

for

α \in L

is strongly log-concave.

Proof.

Strong log-concavity is preserved under scalar multiplication by Proposition 2, so it suffices to show that

\frac{1}{n!} g (x) = \sum_{α \in {[q]}^{n} : | α | = n} \frac{e^{- T Φ (α)}}{α!} x^{α}

is strongly log-concave. In turn, by Proposition 4, it is sufficient to show that

\log (e^{- T Φ (α)}) = - T Φ (α)

is an M-concave function on its effective domain. As

T \geq 0

, this is equivalent to showing that

Φ (α)

is M-convex on its effective domain

L = {α \in {[q]}^{n} : | α | = n}

. This follows from Proposition 5. □

Because of Lemma 1, the polarization $Π (g)$ of g in (11) is also strongly log-concave. The support of $Π (g)$ can be seen as the bases of the n-uniform matroid $N$ on ground set ${(i, j) : 1 \leq i, j \leq n}$ . Our algorithm now consists of first running the base-exchange Markov chain for $O (n \log (n / ϵ))$ steps, starting from any initial base. We output $α (B)$ , where B is the state we are in after the $O (n \log (n / ϵ))$ steps that were carried out. The resulting distribution $σ^{'} (α)$ over $L$ satisfies $d_{T V} (σ^{'}, π^{'}) \leq ϵ$ by Corollary 1. We then uniformly at random choose a strategy profile from $S (α)$ . Let $\bar{π}$ be the resulting output distribution over $S$ . It remains to show that $d_{T V} (π, \bar{π}) \leq ϵ$ , which the following calculation shows. Note that

\bar{π} (s) = \frac{α!}{n!} σ^{'} (α),

where

α = ℓ (s)

is the load profile corresponding to strategy s. Then,

\begin{array}{l} \sum_{s \in S} | \bar{π} (s) - π (s) | & = \sum_{α} \sum_{s \in S : ℓ (s) = α} | \bar{π} (s) - π (s) | \\ = \sum_{α} \sum_{s \in S : ℓ (s) = α} | \frac{α!}{n!} σ^{'} (α) - e^{- T Φ (α)} | \\ = \sum_{α} \frac{n!}{α!} | \frac{α!}{n!} σ^{'} (α) - e^{- T Φ (α)} | \\ = \sum_{α} | σ^{'} (α) - \frac{n!}{α!} e^{- T Φ (α)} | \\ \leq 2 ϵ . \end{array}

We conclude with analyzing the running time of the algorithm. One step of the base-exchange Markov chain can be implemented in time O(C) by definition. Generating an $s \in S (α)$ uniformly at random can be done by generating a uniform random permutation μ of ${1, \dots, n}$ . We set $s_{i} = p_{1}$ for players $i = μ (1), \dots, μ (α_{1}), s_{i} = p_{2}$ for players $μ (α_{1} + 1), \dots, μ (α_{2} + 1)$ , and so on. Generating a uniform random permutation can be done in time $O (n \log (n))$ . □

4.2. Relaxed Logit Dynamics

In this section, we give the proof of Theorem 1, reformulated in Theorem 5. Formally, the relaxed logit dynamics Markov chain with current state $s \in P^{n}$ proceeds by

With probability 1/2: Select two players $i, j \in N$ uniformly at random and transition to $s^{'}$ given by
$s_{k}^{'} = {\begin{array}{l} s_{i} & if k = j \\ s_{j} & if k = i \\ s_{k} & otherwise . \end{array}$
With probability 1/2: Perform a transition according to the logit dynamics (as in Section 2.2).

We note that, for any symmetric congestion game, this is a well-defined ergodic, time-reversible Markov chain with the Gibbs distribution as stationary distribution.

Theorem 5.

For an extension parallel congestion game Γ with common strategy set $P$ and initial state $s \in P^{n}$ , the mixing time of the relaxed logit dynamics Markov chain at rationality level $T \geq 0$ satisfies

τ_{s} (ϵ) \leq n^{3} (\log n + \log \log | P | + \log (\frac{2 T Φ_{\max}}{ϵ^{2}})),

where

Φ_{\max} = \max_{r \in S} Φ (r)

is the maximum value attained by Rosenthal’s potential over

P^{n}

Compared with the mixing time of the (nonrelaxed) logit dynamics for general games (Auletta et al. [12]), we get a doubly exponential improvement in terms of the dependence on $T Φ_{\max}$ (at the cost of a small polynomial increase in the dependence on n).

The proof of Theorem 5 relies on a Markov chain decomposition argument. The idea underlying Markov chain decomposition is to divide the state space into a disjoint union of smaller sets. If one can show that the Markov chain is rapidly mixing when restricted to every smaller set and that it is easy to transition between the different smaller sets, then the original chain is rapidly mixing as well. In our setting, every load profile is identified with a smaller set, which is formed by all strategy profiles inducing the given load profile. To prove rapid mixing of the smaller sets, we compare the restricted chains to the so-called random transposition Markov chain, which is known to be rapidly mixing; see, for example, Goel [32]. To prove rapid mixing between the smaller sets, that is, between load profiles, we use again the connection to the base-exchange Markov chain but in a different way than in the proof of Theorem 4. In order to prove rapid mixing of all the Markov chains involved, we show that their modified log-Sobolev constant can be bounded appropriately. Recall that this is a quantity that can be used to upper bound the mixing time of a Markov chain (see Appendix A).

Proof of Theorem 5.

We use a Markov chain decomposition argument based on the two operations that define the relaxed logit dynamics Markov chain. We first partition the state space $S = P^{n}$ naturally based on load profiles by setting $Ω_{α} = S (α)$ for $α \in L$ , whereas before we have $L = {α : α \in {[q]}^{n} and | α | = n}$ . Our proof approach is to apply the Markov chain decomposition theorem of Hermon and Salez [36] as given in Theorem A.1. In particular, for this, we need to bound the modified log-Sobolev constants of the projection and restriction chains. We start with the modified log-Sobolev constant $\bar{ρ}$ of the projection chain.

The projection chain $\bar{P}$ has state space $L$ and stationary distribution $\bar{π} (α) = | S (α) | e^{- T Φ (α)}$ for $α \in L$ . Let $α, β \in L$ such that $\sum_{e} | α_{e} - β_{e} | = 2$ ; that is, there exist paths p and $p^{'}$ such that

α_{e} = {\begin{array}{l} β_{e} + 1 & if e = p \\ β_{e} - 1 & if e = p^{'} \\ α_{e} & if e \in E \ {p, p^{'}} . \end{array}

In this case, we say that α and β are adjacent load profiles differing on paths p and $p^{'}$ . Note that, if $s \in S (α)$ and $s^{'} \in S (β)$ are such that they differ by a deviation of some player i from path p to path $p^{'}$ , then $P (s, s^{'}) = \frac{1}{2 n} \frac{\exp (- T Φ (p^{'}, s_{- i}))}{Z^{'}}$ , and this expression is the same for every such player i with s_i = p. Moreover, note that $π (x) = π (y)$ for any two strategy profiles $x, y \in S (α)$ .

For some fixed choice of strategy profile $x \in Ω_{α}$ and player i using path p, that is, s_i = p, the transition probabilities for adjacent load profiles can then be seen to equal (with Z and $Z^{'}$ are the normalizing constants as in Section 2.2)

\begin{array}{l} 2 \bar{P} (α, β) = \frac{1}{\bar{π} (α)} \sum_{x \in Ω_{α}, y \in Ω_{β}} π (x) P (x, y) & = \frac{π (x)}{\bar{π} (α)} \frac{α_{p}}{n} | S (α) | \frac{\exp (- T Φ (β))}{Z^{'}} \\ = \frac{\exp (- T Φ (α)) / Z}{| S (α) | \exp (- T Φ (α)) / Z} \frac{α_{p}}{n} | S (α) | \frac{\exp (- T Φ (β))}{Z^{'}} \\ = \frac{α_{p}}{n} \frac{\exp (- T Φ (β))}{Z^{'}} \\ = 2 α_{p} P (s, s^{'}), \end{array}

(12)

where the last equality is true for any choice of

s \in S (α)

and

s^{'} \in S (β)

. Note that this implies that, for any

α, β \in L, s \in S (α)

and

s^{'} \in S (β)

, we have

\frac{P (s, s^{'})}{\bar{P} (α, β)} = \frac{1}{α_{p}} \geq \frac{1}{n} .

(13)

The lower bound of $1 / n$ serves as our lower bound on χ as defined in Appendix A.1. In order to bound the modified log-Sobolev constant of the projection chain, one can use a comparison argument (as defined in Appendix A.2) with the base-exchange Markov chain on the support of the polarization $Π (g_{Γ})$ of $g_{Γ}$ as in (11). In this section, the support corresponds to the set of bases of an n-uniform matroid. In particular, it holds that

\bar{ρ} \geq \frac{1}{n} \cdot ρ (Π (L)) \geq \frac{1}{n^{2}},

(14)

where

ρ (Π (L))

is the modified log-Sobolev constant of the base-exchange Markov chain on the support of

Π (g)

with g as in (11). The second inequality comes from the fact that

ρ (Π (L)) \geq 1 / n

as is shown by Cryan et al. [20]. The first inequality is somewhat tedious to prove as it requires a Markov chain comparison argument between two Markov chains on different state spaces and is, therefore, deferred to Appendix B.

We continue with bounding the modified log-Sobolev constant of the restriction chains. In order to do this, we use a comparison argument with the random transposition Markov chain on the set S_k of all permutations of ${1, \dots, k}$ . Given a permutation σ, this chain proceeds by selecting two positions a and b uniformly at random and interchanging the positions of the elements $σ (a)$ and $σ (b)$ . With ρ_rt denoting the modified log-Sobolev constant of this chain, it follows that, for every $α \in L$ , we have

ρ_{α} \geq ρ_{r t} \geq \frac{1}{n - 1}

(15)

using the fact that

ρ_{r t} \geq 1 / (n - 1)

as shown by Goel [32]. This comparison argument is also deferred to Appendix B. Now, applying Theorem A.2, it follows that

\bar{ρ} \geq 1 / n^{3}

. Plugging this into (7), it then follows that

τ_{s} (ϵ) \leq n^{3} (\log n + \log \log | P | + \log (\frac{2 T Φ_{\max}}{ϵ^{2}}))

using that

π {(s)}^{- 1} \leq | P |^{n} e^{- T Φ_{\max}}

for every

s \in S

because of the nonnegativity of the cost functions. □

4.3. Uniform Sampling of Pure Nash Equilibria

In Theorem 6, we show that the result in Theorem 4 also implies that, for an extension parallel congestion game Γ, we can (approximate) uniformly at random sample a pure Nash equilibrium from the set $NE (Γ)$ of all pure Nash equilibria of Γ in pseudo-polynomial time. That is, we sample every $s \in NE (Γ)$ with probability $\approx 1 / | NE (Γ) |$ . The (approximate) uniform sampling of combinatorial objects has received a lot of attention in the last 30 years, in particular within the area of theoretical computer science. However, to the best of our knowledge, no nontrivial results for (pure) Nash equilibria are known despite the fact that the problem of computing Nash equilibria has received much attention.

For the proof of Theorem 6, we use the fact that Nash equilibria are precisely the strategy profiles minimizing Rosenthal’s potential in EP congestion games. Furthermore, we exploit the fact that Theorem 4 holds for any rationality level $T \geq 0$ . In particular, if we set T large enough, then most weight in the stationary distribution is assigned to profiles minimizing Rosenthal’s potential (under the assumption that the cost functions are integer-valued). This means that, with high probability, Algorithm 1 outputs a strategy profile minimizing Rosenthal’s potential with domain $S$ . (Whenever we refer to Algorithm 1, we mean the implementation of the high-level approach as given in the previous section.) We use the Gibbs distribution with base two instead of e to avoid having to work with real numbers.

Theorem 6.

Let $ϵ > 0$ , and let Γ be an extension parallel congestion game with integer-valued cost functions and n players. There is a randomized algorithm $A$ with output distribution $\bar{π}$ over $NE (Γ)$ that is ϵ-close to the uniform distribution over $NE (Γ)$ and runs in (expected) time polynomial in $n, m, Φ_{\max}$ and $\log (1 / ϵ)$ , where $Φ_{\max}$ is the maximum value attained by Rosenthal’s potential.

For the proof of Theorem 6, we use the following correspondence between Nash equilibria and strategy profiles minimizing Rosenthal’s potential.

Proposition 6

(Fotakis [29], Holzman and Law-Yone [37]). The set of strategy profiles $NE (Γ)$ of an extension parallel congestion game Γ coincides with the set of strategy profiles that minimize Rosenthal’s potential as in (3).

Proof of Theorem 6.

We first show that, for T sufficiently large in the algorithm used to prove Theorem 4, most weight is assigned to strategy profiles minimizing Rosenthal’s potential. We apply the idea in Algorithm 1 used to prove Theorem 4 with base two instead of base e. Remember that q is the number of (o, d)-paths in the extension parallel network of the game Γ. Let $ϕ = Φ (s)$ be the common potential value of all strategy profiles $s \in NE (Γ)$ . For any other strategy profile $s^{'} \in S \ NE (Γ)$ , we have

2^{- T Φ (s^{'})} \leq 2^{- T (ϕ + 1)} = 2^{- T} e^{- T ϕ}

by assumption that all cost functions are integer-valued. As there are q strategies to choose from for every player, we have

| S | = q^{n} = 2^{n \log_{2} (q)}

. This implies that the Gibbs distribution π over

S

with rationality level

T = ⌈ n \log_{2} (q) + \log_{2} (2 / ϵ) ⌉

satisfies

π (S \ NE (Γ)) = \sum_{s \in NE (Γ)} 2^{- T Φ (s^{'})} \leq 2^{n \log (q)} 2^{- T} 2^{- T ϕ} \leq \frac{ϵ}{2} \cdot π (NE (Γ)) .

(16)

The algorithm for sampling an (almost) uniform sample from $NE (Γ)$ now works as follows. First, compute a strategy profile minimizing Rosenthal’s potential in order to determine $ϕ$ . This can be done efficiently; see, for example, Fotakis [29]. Then, run Algorithm 1 with $T = ⌈ n \log_{2} (q) + \log_{2} (2 / ϵ) ⌉$ and $ϵ^{'} = ϵ / 2$ . If the resulting strategy profile has potential value $ϕ$ , output this strategy profile, and otherwise, rerun Algorithm 1 until it does. Note that, with probability at least $(1 - ϵ / 2)$ , Algorithm 1 outputs a strategy profile with potential value $ϕ$ in one run. A simple argument then shows that the output distribution is ϵ-close to the uniform distribution over $NE (Γ)$ as desired. □

Remark 5.

The pseudo-polynomial dependence coming from the polynomial dependence on $Φ_{\max}$ rather than $\log_{2} (Φ_{\max})$ arises from the fact that we have to compute transition probabilities of the form $2^{a_{i}} / \sum_{i} 2^{a_{i}}$ , where the a_i are integers, which requires $Ω (\sum_{i} a_{i})$ random 0/1 bits (following the notion of suitable probabilities in Section 2.6). However, there is no a priori reason that the problem of (approximately) sampling pure Nash equilibria according to the uniform distribution requires pseudo-polynomial (in the input size of the original congestion game) time as opposed to sampling from the Gibbs distribution. We leave open the question of finding a (truly) polynomial time algorithm.

5. Max-k-Cut Games

In this section, we give an application of Theorem 5 to max-k-cut games. We first repeat their definition and show how they can be modeled as a special case of extension-parallel congestion games in case the graph G of the game is complete.

A max-k-cut game is given by an undirected graph $G = (V, E)$ whose nodes V are the players. Every $v \in V$ has strategy set $[k] = {1, \dots, k}$ whose elements are referred to as colors. With $N (i) = {j \in V : {i, j} \in E}$ the set of neighbors of node $i \in V$ , the utility of player i is the number of neighbors $U_{i} (s) = {j \in N (i) : s_{i} \neq s_{j}}$ that choose a different color (hence, the name max-k-cut game, referring to the well-known max-cut problem (Garey and Johnson [31])). Equivalently, the cost of a player is given by the number of neighbors that choose the same color; that is, for $s \in {[k]}^{n}$ , we have $C_{i} (s) = | {j \in N (i) : s_{i} = s_{j}} |,$ and in terms of (relaxed) logit dynamics, considering the cost of a player is equivalent to considering its utility because $C_{i} (s) + U_{i} (s) = | N (i) |$ is independent of the chosen color in s.

When $G = (V, E)$ is the complete graph, a max-k-cut game can naturally be modeled as a so-called symmetric singleton congestion game in which $E = [k]$ is the set of resources and every resource is equipped with the cost function $c_{e} (x) = x - 1$ for $e \in [k]$ . Indeed, if $ℓ$ players are using resource/color e, then the cost for every player choosing color $ℓ$ is $ℓ - 1$ , which is precisely the number of neighbors choosing that color for every such node (because of the fact that G is the complete graph). Such games are easily seen to be special cases of extension parallel congestion games.

The relaxed logit dynamics may then be interpreted as the process by which either a player changes its color (according to the transitions as specified by the logit dynamics) or the colors of two players are interchanged. As $Φ_{\max} \leq k (k - 1) / 2$ , which is the total number of edges of the graph G, we obtain the following corollary of Theorem 5.

Corollary 2.

For a max-k-cut game played on the complete graph $G = (V, E)$ with initial state $s \in {[k]}^{n}$ , where $n = | V |$ , the mixing time of the relaxed logit dynamics Markov chain at rationality level $T \geq 0$ satisfies

τ_{s} (ϵ) \leq n^{3} (\log n + \log \log k + \log (\frac{T k (k - 1)}{ϵ^{2}})) .

Levin et al. [44] show that the mixing time of the (nonrelaxed) logit dynamics on the complete graph depends on the parameter $T \geq 0$ in the case k = 2. They show there is a critical value T_c so that, when $T \leq T_{c}$ , the logit dynamics converge quickly to the Gibbs distribution, whereas when $T > T_{c}$ , the dynamics converge slowly. What Corollary 2 shows is that, if one in addition is allowed to randomly interchange the colors of two nodes, the slow mixing for $T > T_{c}$ can be resolved. We believe this to be of independent interest.

6. Capacitated Uniform Congestion Games

In this section, we consider u-capacitated k-uniform congestion games for vectors $u = (u_{1}, \dots, u_{m})$ and $k = (k_{1}, \dots, k_{n})$ . We write $K = | k | = \sum_{i} k_{i}$ and $U = | u | = \sum_{e} u_{e}$ . The vector u models the capacities of the resources $e \in E$ , that is, the variables ${(u_{e})}_{e \in E}$ as defined in Section 2.1. The strategy set of player $i \in N$ is given by all subsets $S \subseteq E$ of cardinality $| S | = k_{i}$ , that is, the bases of the k_i-uniform matroid on E. We write $Γ (u, k)$ for the collection of all u-capacitated k-uniform congestion games. We remark that, in this section, load profiles refer to resource load profiles as defined in Section 2.1 and no longer to path load profiles as considered in Section 4.

Note that we can naturally model a feasible strategy profile in $s = (s_{1}, \dots, s_{n}) \in S$ of a capacitated uniform congestion game as a (simple) bipartite graph $G = (N \cup E, F) \in G (u, k)$ on $N \cup E$ : there is an edge ${i, e} \in F$ if and only if player $i \in N$ uses resource $e \in E$ in its strategy s_i. The main result needed in this section is stated in Proposition 7. We use the notation ${[x]}_{b} = x (x - 1) \dots (x - b + 1)$ for $x, b \in ℤ_{\geq 1}$ . For a bipartite degree sequence $(k, α)$ , we then write $K_{b} = \sum_{i = 1}^{n} {[k_{i}]}_{b}$ and $A_{b} = \sum_{j = 1}^{m} {[α_{j}]}_{b}$ . Note that $K = A = K_{1} = A_{1}$ .

Proposition 7

(McKay [49]). Let $D$ be the collection of all bipartite degree sequences $(k, α)$ for which $1 \leq k_{\max} α_{\max} = o (K^{1 / 4})$ . Then,

| G (k, α) | = \frac{K!}{\prod_{i} k_{i}! \prod_{j} α_{j}!} \exp (- \frac{K_{2}}{K^{2}} \cdot A_{2} + O (\max {k_{\max}, α_{\max}}^{4} / K))

K \to \infty

6.1. Sampling from the Gibbs Distribution

In this section, we give an (almost) polynomial time sampling algorithm that samples from a distribution that is close to the Gibbs distribution provided the game is sufficiently large. That is, we show that, for a large class of pairs (u, k), we can sample from a distribution close to the Gibbs distribution.

We follow again the high-level approach in Algorithm 1. The set of all feasible load profiles is now given by $L (k, u) = {α : 0 \leq α \leq u and | α | = \sum_{i} k_{i}}$ and $S (α) = G (k, α)$ for any feasible load profile $α \in L (k, u)$ . Recall that we want to sample an $α \in L (k, u)$ with probability proportional to (approximately) $\approx | S (α) | e^{- T Φ (α)},$ and then sample a strategy profile $s \in S (α)$ with probability $\approx 1 / | S (α) |$ .

A couple of problems arise here compared with the case of extension parallel congestion games. First of all, there is no polynomial time algorithm known to compute the numbers $w_{α} = | S (α) |$ . (In fact, it is still an open question whether this problem is $#$ P-complete; see, e.g., Jerrum et al. [40].) Instead, we rely on a fully polynomial randomized approximation scheme for computing approximations ${\hat{w}}_{α}$ to the numbers $w_{α}$ up to arbitrary precision (Bezáková et al. [13], Jerrum et al. [40]). Second, in this case, the polynomial

g (x_{1}, \dots, x_{n}) = \sum_{α \in L (k, u)} | S (α) | e^{- T Φ (α)} x^{α}

(17)

is in general not strongly log-concave. We overcome this problem by showing that we can restore strong log-concavity “approximately” when the game becomes large and when there are, in addition, suitable capacity constraints. We do this by using asymptotic enumeration formulas for the number of bipartite graphs with a given degree sequence, an area that has received considerable attention in combinatorics. It turns out that replacing

| S (α) |

by an asymptotic approximation

ϕ (α)

in (17) gives rise to a strongly log-concave polynomial.

Finally, the problem of sampling a strategy profile $s \in S (α) = G (k, α)$ now corresponds to that of sampling a bipartite graph with degree sequence $(k, α)$ for which many algorithms are known. The main result of this section is given in Theorem 7.

Theorem 7.

Let $ϵ \geq 0$ , let π be the Gibbs distribution at rationality level $T \geq 0$ , and let $D$ be the class of all congestion games $Γ (k, u)$ satisfying

1 \leq k_{\max} u_{\max} = o (K^{1 / 4}) .

(18)

There is a randomized algorithm $A$ for the class $D$ and a constant $K_{0} \geq 0$ such that the output distribution $\bar{σ}$ over $S$ has the property that $d_{T V} (\bar{σ}, π) \leq ϵ$ whenever $K \geq K_{0}$ . The algorithm runs in (expected) time

C \cdot n (\log n + \log \log | P | + \log (\frac{2 T Φ_{\max}}{ϵ^{2}}))

with

C (n, m, ϵ, Φ_{\max}) = poly (1 / ϵ, n, m, Φ_{\max})

Proof.

Setting

ϕ (α) = \frac{K!}{k! α!} \exp (- \frac{K_{2}}{K^{2}} \cdot A_{2}),

it follows, assuming (18) holds, that, for any

0 \leq α \leq u

, we have

ϕ (α) = (1 + o (1)) | S (α) |

, where o(1) is with respect to

K \to \infty

. In particular, if

K \geq K_{0}

for K₀ large enough, it follows that

\frac{1}{2} | S (α) | \leq ϕ (α) \leq \frac{3}{2} | S (α) | .

(19)

The next step is now to show that replacing $| S (α) |$ by $ϕ (α)$ in (20) gives rise to a strongly log-concave polynomial. The crucial observation here is to see that $A_{2} = \sum_{j} α_{j} (α_{j} - 1)$ is a separable convex function.

Lemma 2.

The K-homogeneous generating polynomial

g (x_{1}, \dots, x_{n}) = \sum_{α \in L (k, u)} \frac{K!}{k! α!} \cdot \exp (- \frac{K_{2}}{K^{2}} \cdot \sum_{j = 1}^{m} α_{j} (α_{j} - 1)) \exp (- T Φ (α)) \cdot x^{α}

(20)

is strongly log-concave.

Proof.

Following the proof of Lemma 1, first observe that (and note that k and all quantities involving k are considered fixed)

\frac{k!}{K!} g (x_{1}, \dots, x_{n}) = \sum_{α \in L (k, u)} \frac{1}{α!} \exp (- \frac{K_{2}}{K^{2}} \cdot \sum_{j = 1}^{m} α_{j} (α_{j} - 1)) \exp (- T Φ (α)) x^{α}

is strongly log-concave as well because of Proposition 2. Then, in order to apply Proposition 4, it suffices to show that

- (\frac{K_{2}}{K^{2}} \cdot \sum_{j = 1}^{m} α_{j} (α_{j} - 1) + T Φ (α))

is M-concave over its domain L(k, u). (Formally speaking, we define it to be

- \infty

outside of L(k, u).) This follows directly from the fact that both

\sum_{j = 1}^{m} α_{j} (α_{j} - 1)

and

Φ (α)

are separable convex functions in

(α_{1}, \dots, α_{m})

over the (effective) domain L(k, u), and the fact that

K_{2}, K, T \geq 0

. Separable convex functions (over effective domain L(k, u)) are known to be M-convex (Murota [52]). □

One can now carry out similar steps as in the proof of Theorem 4, albeit with some modifications. Again, the polarization $Π (g)$ of g as in (20) is strongly log-concave as well. The support of $Π (g)$ is now the set of bases of the n-truncation of a partition matroid $N$ on ground set $E = \cup_{j} E_{j}$ , where $E_{j} = {(j, i) : 1 \leq i \leq n}$ with $A \subseteq E$ is independent if and only if $| A \cap E_{j} | \leq u_{j}$ for $j = 1, \dots, m$ . It follows that the modified log-Sobolev constant of this chain satisfies $ρ_{N} \geq 1 / n$ . Using a Markov chain comparison argument as described in Remark A.1 in Appendix A.2 with $δ = 1 / 2$ because of (19) then yields that the modified log-Solev constant ρ of the Markov chain in which we use the original quantities $| S (α) |$ , instead of the approximation $ϕ (α)$ , satisfies $ρ \geq 1 / (3 n)$ .

The algorithmic problem is now that we cannot compute the quantities $w_{α} = | S (α) |$ exactly in polynomial time, so one step of this base-exchange Markov chain cannot be implemented efficiently. Nevertheless we can use the approximation scheme of Bezáková et al. [13] to compute approximations ${\hat{w}}_{α}$ up to arbitrary precision in time polynomial in n, m and $1 / ϵ$ (this gives the dependence of $1 / ϵ$ in C). A Markov chain comparison argument (again as in Remark A.1) then implies that the base-exchange Markov chain on $N$ using these approximations is also rapidly mixing, and in particular, it is sufficient to run the chain

3 n (\log n + \log \log | P | + \log (\frac{2 T Φ_{\max}}{ϵ^{2}}))

steps and then output

α (B)

, where B is the current base after having run the chain for the aforementioned number of steps.

We conclude with the sampling of a strategy profile from the set $S (α)$ uniformly at random, which now requires sampling a bipartite graph from the set $G (k, α)$ uniformly at random. One algorithm to do this for degree sequences satisfying the condition in Proposition 7 is that of Arman et al. [7] that runs in expected polynomial time. For an overview of algorithms that can be used to (approximately) sample a bipartite graph with a given degree sequence, see, for example, Dyer et al. [23]. □

Remark 6.

We remark here that the algorithm of Theorem 7 does not run in polynomial time as described in Section 2.6 because of the dependence of C on $1 / ϵ$ (as opposed to the required $\log (1 / ϵ)$ ). This dependence arises because of the algorithmic approximations of the numbers $| S (α) |$ used in the proof of Theorem 7. Alternatively, we could just use the approximations $ϕ (α)$ straight away. However, these predictions only become accurate when $K \to \infty$ , so this gives a weaker result in terms of closeness to the Gibbs distribution.

Acknowledgments

The author is grateful to Prasad Tetali for pointing him to Hermon and Salez [36] and to the anonymous reviewers of the 22nd ACM Conference on Economics and Computation and Mathematics of Operations Research for their useful comments. A large part of this work has been carried out while the author was a postdoctoral fellow at the Max Planck Institute for Informatics in Saarbrücken, Germany. An abstract of this work appears in the Proceedings of the 22nd ACM Conference on Economics and Computation (Kleer [42]).

Appendix A. Markov Chains and Functional Inequalities

Three well-known quantities that can be used to upper bound the mixing time of a Markov chain are the Poincaré constant, the log-Sobolev constant, and the modified log-Sobolev constant. In this section, we define the modified log-Sobolev constant.

Let $M = (Ω, P)$ be a time-reversible Markov chain with stationary distribution π and $f, g : Ω \to R_{\geq 0}$ . Let $E_{π} (f) = \sum_{x \in Ω} π (x) f (x)$ . Furthermore, define the entropy-like quantity

{Ent}_{π} (f) = E_{π} [f \log (f) - f \log (E_{π} (f))]

and the Dirichlet form

E_{P} (f, g) = \frac{1}{2} \sum_{x \in Ω} \sum_{y \in Ω} π (x) P (x, y) [f (x) - f (y)] [g (x) - g (y)] .

The modified log-Sobolev constant of the Markov chain $M$ is defined by

ρ (P) = \inf {\frac{E_{P} (f, \log (f))}{{Ent}_{π} (f)} | f : Ω \to R_{\geq 0}, {Ent}_{π} (f) \neq 0} .

As also mentioned in Section 2.5, it holds that

τ_{x} (ϵ) \leq \frac{1}{ρ (P)} (\log \log π {(x)}^{- 1} + \log (\frac{1}{2 ϵ^{2}})) .

A.1. Markov Chain Decomposition

Let $M = (Ω, P)$ be a time-reversible Markov chain with stationary distribution π, and let $Ω = Ω_{1} \cup \dots \cup Ω_{m}$ be a disjoint partition of the state space. We write ρ for the modified log-Sobolev constant of this chain, $\bar{ρ}$ for the modified log-Sobolev constant of the projection chain, and ρ_i for that of the restriction chain P_i (and define $ρ_{\min} = \min_{i} ρ_{i}$ ).

Hermon and Salez [36] recently give a Markov chain decomposition theorem that applies to the Poincaré constant, the log-Sobolev constant, and the modified log-Sobolev constant. For other Markov chain decomposition theorems, see, for example, the work of Jerrum et al. [41], who, in particular, give stronger theorems for the Poincaré and log-Sobolev constants. We next describe the necessary objects to formulate their result.

Assume that, for each $i, j \in [m]$ with $i \neq j$ and $\bar{P} (i, j) > 0$ , we are given a coupling $κ_{i j} : Ω_{i} \times Ω_{j} \to [0, 1]$ of the probability distributions π_i and π_j. That is, κ_ij is such that

\begin{array}{l} \forall x \in Ω_{i}, \sum_{y \in Ω_{j}} κ_{i j} (x, y) = π_{i} (x), \\ \forall y \in Ω_{j}, \sum_{x \in Ω_{i}} κ_{i j} (x, y) = π_{j} (y) . \end{array}

Based on the couplings κ_ij, we define

χ = \min_{x \in Ω_{i}, y \in Ω_{j}, i, j \in [m]} {\frac{π (x) P (x, y)}{\bar{π} (i) \bar{P} (x, y) κ_{i j} (x, y)}},

with the range taken over all combinations for which the denumerator in the fraction is strictly positive. We state (a small variation of) the theorem of Hermon and Salez [36] for the modified log-Sobolev constant (for the other constants the statements are similar).

Theorem A.1

(Hermon and Salez [36]). With the preceding notation, it holds that $ρ \geq \min {χ \bar{ρ}, ρ_{\min}} .$ Furthermore, the parameter χ satisfies

χ \geq \max_{x \in Ω_{i}, y \in Ω_{j}, i, j \in [m] : \bar{P} (i, j) > 0} {\frac{P (x, y)}{\bar{P} (i, j)}, \frac{P (y, x)}{\bar{P} (j, i)}} .

A.2. Markov Chain Comparison

Another useful property of proving mixing time bounds through Poincaré and (modified) log-Sobolev constants is that it is easy to see that small perturbations in the transition probabilities and the stationary distribution only result in mild variations in these constants by means of a Markov chain comparison argument. Goel [32] shows the following for the modified log-Sobolev constant based on similar results for the other constants by Diaconis and Saloff-Coste [22].

Theorem A.2

(based on Goel [32, Lemma 4.1]). Let $M = (Ω, P)$ and $M^{'} = (Ω^{'}, P^{'})$ be two finite, reversible Markov chains with stationary distributions π and $π^{'}$ , respectively, and modified log-Sobolev constant ρ and $ρ^{'}$ , respectively. Assume there is a mapping $ϕ$ mapping any (arbitrary) $f : Ω \to R_{\geq 0}$ to $f^{'} : Ω^{'} \to R_{\geq 0}$ and constants $C, c > 0$ and $B \geq 0$ such that, for all f, we have

E_{P^{'}} (f^{'}, \log f^{'}) \leq C \cdot E_{P} (f, \log f) and c \cdot E n t_{π} (f) \leq E n t_{π^{'}} (f^{'}) + B \cdot E_{P} (f, \log f) .

Then,

\frac{c ρ^{'}}{C + B ρ^{'}} \leq ρ .

Remark A.1.

In particular, if $Ω = Ω^{'}$ and there exists a $δ > 0$ such that $(1 - δ) P (x, y) \leq P^{'} (x, y) \leq (1 + δ) P (x, y)$ for all $x, y \in Ω$ and $(1 - δ) π (x) \leq π^{'} (x) \leq (1 + δ) π (x)$ for $x \in Ω$ , it directly follows that

\frac{1}{ρ} \leq \frac{1 + δ}{1 - δ} \cdot \frac{1}{ρ^{'}} .

Appendix B. Comparison Arguments Omitted in Proof of Theorem 5

B.1. First Inequality in (14)

We start with showing the first inequality in (14), which is

\bar{ρ} \geq \frac{1}{n} \cdot ρ (Π (L)),

by using Theorem A.2. We heavily abuse notation and write

Π (\cdot)

for many different objects in order not to overload the notation. Remember that

\bar{ρ}

is the modified log-Sobolev constant of the projection chain

M = (L, \bar{P})

, where

L = {α \in {[q]}^{n} : | α | = n}

with stationary distribution given by

\bar{π} (α) = | S (α) | e^{- T Φ (α)} .

Furthermore, $ρ (Π (L))$ is the modified log-Sobolev constant of the base-exchange Markov chain $M_{Π} = (Π (L), P_{Π})$ on $Π (L)$ with stationary distribution $π_{Π}$ . Similar to what is explained in Section 3.1, we may write $Π (L) = \cup F_{i}$ , where $F_{i} = {(i, j) : 1 \leq j \leq n}$ for $i = 1, \dots, n$ , and $Π (L)$ is then the set of bases of the n-uniform matroid on $\cup F_{i}$ . Roughly speaking, for every path $p \in P$ , we introduce n auxiliary elements (corresponding to the n auxiliary variables introduced when polarizing).

For every $α \in L$ , there are $(\begin{matrix} n \\ α \end{matrix})$ bases $B \in Π (L)$ corresponding to it (as in Section 3.1). We denote this set of bases by

Π (α) = {A \in Π (L) : α (A) = α},

where

α (A)

is the vector given by

α_{i} = | A \cap F_{i} |

for

i = 1, \dots, n

We start with defining the required mapping $ϕ$ needed in Theorem A.2. For $f : L \to R_{\geq 0}$ , we define $f^{'} : Π (L) \to R_{\geq 0}$ simply by setting $f^{'} (A) = f (α (A))$ for $A \in Π (L)$ . It can then easily be checked that ${Ent}_{π} (f) = {Ent}_{π_{Π}} (f^{'})$ as $π (α) = \sum_{A \in Π (α)} π_{Π} (A)$ . This means that we can take c = 1 and B = 0 in Theorem A.2. In order to show the desired Dirichlet form inequality in the statement of Theorem A.2, it suffices to prove that, for any adjacent $α, β \in L$ , it holds that

\sum_{A \in Π (α)} π_{Π} (A) \sum_{B \in Π (β)} P_{Π} (A, B) \leq C \cdot \bar{π} (α) \bar{P} (α, β) .

(B.1)

with C = n. The fact that this is sufficient follows from the observation that summing up (B.1) for all ordered pairs

(α, β)

for

α, β \in L

gives the desired result (in combination with the definition of

f^{'}

Now, fix $α, β \in L$ and assume that they are adjacent (the case $α = β$ can be dealt with similarly). Remember that $γ \in L$ is adjacent to α if $\sum_{e} | α_{e} - γ_{e} | = 2$ , that is, there exist paths p and $p^{'}$ such that

α_{e} = {\begin{array}{l} γ_{e} + 1 & if e = p \\ γ_{e} - 1 & if e = p^{'} \\ α_{e} & if e \in E \ {p, p} . \end{array}

Let r be the path for which $α_{r} = β_{r} + 1$ and write $N_{r} (α)$ for all load profiles γ adjacent to α for which $α_{r} = γ_{r} + 1$ (including β). Following the definition of the base-exchange Markov chain, it then holds that, for any $A \in Π (α)$ and $B \in Π (β)$ , we have

2 \cdot \sum_{B \in Π (β)} P_{Π} (A, B) = \frac{α_{p}}{n} \frac{(n - β_{p_{β}^{'}} + 1) {(\begin{matrix} n \\ β \end{matrix})}^{- 1} e^{- T Φ (β)}}{\sum_{γ \in N_{r} (α) \cup {α}} (n - γ_{p_{γ}^{'}} + 1) {(\begin{matrix} n \\ γ \end{matrix})}^{- 1} e^{- T Φ (γ)}} ≕ Q,

(B.2)

where

p_{γ}^{'}

is used to indicate the path

p^{'} = p_{γ}^{'}

for which

α_{p^{'}} = γ_{p^{'}} - 1

for

γ \in N_{p} (α)

and

p_{α}^{'} = r

. With some care, it can be shown that, for

γ \in N (α) \cup {a}

, it holds that

\frac{(n - γ_{p_{γ}^{'}} + 1) {(\begin{matrix} n \\ γ \end{matrix})}^{- 1}}{(n - β_{p_{β}^{'}} + 1) {(\begin{matrix} n \\ β \end{matrix})}^{- 1}} = \frac{γ_{p_{γ}^{'}}}{β_{p_{β}^{'}}} = \frac{β_{p_{γ}^{'}} + 1}{β_{p_{β}^{'}}} \geq \frac{1}{n} .

Continuing the estimate in (B.2), we then get

Q \leq n \cdot \frac{α_{p}}{n} \frac{e^{- T Φ (β)}}{\sum_{γ \in N_{r} (α) \cup {α}} e^{- T Φ (γ)}} = 2 n \bar{P} (α, β),

using (12) for the final equality. This gives the desired result in (B.1). Applying Theorem A.2 with

c = 1, C = n

and B = 0 then gives the desired first inequality in (14).

B.2. First Inequality in (15)

In order to show the inequality in (15), we again use a Markov chain comparison between two chains on different state spaces. We want to show that

ρ_{α} \geq ρ_{r t},

where

ρ_{α}

is the modified log-Sobolev constant of the restriction chain on

S (α)

for

a \in L

, in which we randomly interchange the strategies of two players, and ρ_rt the modified log-Sobolev constant of the so-called random transposition walk. From now on, we fix some

α \in L

It is convenient to study these chains in terms of bipartite graphs with given degrees on node partition $A \cup B$ . For $α \in L$ , we consider the degree sequence $x = (x_{1}, \dots, x_{n})$ with x_i = 1 for every $i \in B$ , and the sequence $y = (y_{1}, \dots, y_{q})$ with $y_{p} = α_{p}$ for $p \in A$ , where one should remember that q is the number of strategies, that is, paths, available in the common strategy set (denoted by A here) of all players. It follows directly that there is a one-to-one correspondence between $S (α)$ and $G (x, y)$ , where, for a given strategy profile $s \in S (α)$ , there is an edge {i, p} if and only if s_i = p. (This is similar to the setting we consider in Section 6.) Given $s \in S (α)$ , our restriction chain can be interpreted as randomly selecting two edges from the bipartite graph G_s corresponding to the profile s and switching them if possible. That is, if we select {i, p} and ${i^{'}, p^{'}}$ with $p \neq p^{'}$ , we delete the edges {i, p} and ${i^{'}, p^{'}}$ and add the edges ${i, p^{'}}$ and ${i^{'}, p}$ (note that $i \neq i^{'}$ always holds as the nodes in B have degree one).

In order to introduce the random transposition Markov chain, we split up every node $p_{j} \in A$ into nodes $p_{j 1}, \dots, p_{j α_{j}}$ and consider bipartite graphs on two sets of n nodes $B = {1, \dots, n}$ and $A^{*} = \cup_{j} A_{j}^{*}$ with $A_{j}^{*} = {p_{j 1}, \dots, p_{j α_{j}}}$ , where every node has degree one. That is, every such graph is a perfect matching between $A^{*}$ and B. Note that there are precisely

\prod_{j = 1}^{q} α_{j}! = α!

perfect matchings corresponding to the graph G_s for

s \in S (α)

under the natural transformation in which, for a given perfect matching, we consider the graph that we get by merging all the nodes

p_{j 1}, \dots, p_{j α_{j}}

back into one node p_j for every

j = 1, \dots, q

. We denote this set of perfect matchings by H(s) for

s \in S (α)

. The random tranposition Markov chain

M = (H, P)

with

H = \cup_{s} H (s)

denoting the set of all perfect matchings on the bipartition

A^{*} \cup B

proceeds by selecting two edges (of the current perfect matching) uniformly at random and switching them. Note that this is always possible here as opposed to in the case of our restriction chains on

S (α)

We can now use a similar type of comparison argument as for the first inequality in (14) given earlier. We define the mapping $ϕ$ for a given function $f : S (α) \to R_{\geq 0}$ by setting $f^{'} (M) = f (s)$ whenever $M \in H (s)$ for $s \in S (α)$ . Let σ be the uniform distribution over $S (α)$ and let $σ_{H}$ the uniform distribution over $H$ . Note that $σ (s) = \sum_{M \in H (s)} σ_{H} (M)$ . It then follows that, for every $s, s^{'} \in S (α)$ , we have

\sum_{M \in H (s)} σ_{H} (M) \sum_{M^{'} \in H (s^{'})} P (M, M^{'}) = σ (s) P_{α} (s, s^{'})

(B.3)

because

P_{α} (s, s^{'}) = \frac{1}{n (n - 1)} = \sum_{M^{'} \in H (s^{'})} P (M, M^{'})

for all

M \in H (s)

whenever

s \neq s^{'}

and

P_{α} (s, s^{'}) > 0

. Note that there is only one matching

M^{'} \in H (s^{'})

such that

P (M, M^{'}) > 0

. When

s = s^{'}

, we also have

\sum_{M^{'} \in H (s^{'})} P (M, M^{'}) = P_{α} (s, s^{'}) .

This implies that we can take C = 1. As before, we can take a = 1 and B = 0.

Endnotes

¹ If one drops the assumption that the cost functions are nonnegative (see Section 2), the parameter $Φ_{\max}$ can be replaced by $Δ Φ ≔ Φ_{\max} - Φ_{\min}$ , where $Φ_{\min}$ is the minimum value attained by Rosenthal’s potential. This also holds for all subsequent results.

² In fact, the result generalizes directly to “symmetric congestion games for which Rosenthal’s potential is M-convex,” but we are not aware of any other interesting class of congestion games for which this is true (and, therefore, choose to formulate our results in terms of EP congestion games). M-convexity of Rosenthal’s potential already fails to hold for the smallest nonextension parallel network congestion game, which is the graph that has two graphs, both consisting of two parallel edges, in series.

³ In fact, no efficient algorithm was known for the approximate sampling and counting of the bases of a matroid.

References

[1] Ackermann H, Röglin H (2008) On the convergence time of the best response dynamics in player-specific congestion games. Preprint, submitted May 8, https://arxiv.org/abs/0805.1130.Google Scholar
[2] Ackermann H, Röglin H, Vöcking B (2008) On the impact of combinatorial structure on congestion games. J. ACM 55(6):1–22.Google Scholar
[3] Anari N, Gharan SO, Vinzant C (2018) Log-concave polynomials, entropy, and a deterministic approximation algorithm for counting bases of matroids. Proc. 59th Annual Sympos. Foundations Comput. Sci., 35–46.Google Scholar
[4] Anari N, Liu K, Gharan SO, Vinzant C (2018) Log-concave polynomials III: Mason’s ultra-log-concavity conjecture for independent sets of matroids. Preprint, submitted July 2, https://arxiv.org/abs/1807.00929.Google Scholar
[5] Anari N, Liu K, Gharan SO, Vinzant C (2019) Log-concave polynomials II: High-dimensional walks and an FPRAS for counting bases of a matroid. Proc. 51st Annual ACM SIGACT Sympos. Theory Comput., 1–12.Google Scholar
[6] Anari N, Liu K, Gharan SO, Vinzant C, Vuong TD (2021) Log-concave polynomials IV: Approximate exchange, tight mixing times, and near-optimal sampling of forests. Proc. 53rd Annual ACM SIGACT Sympos. Theory Comput., 408–420.Google Scholar
[7] Arman A, Gao P, Wormald N (2019) Fast uniform generation of random graphs with given degree sequences. Proc. 60th Annual Sympos. Foundations Comput. Sci., 1371–1379.Google Scholar
[8] Asadpour A, Saberi A (2009) On the inefficiency ratio of stable equilibria in congestion games. Internet and Network Economics, 545–552.Crossref, Google Scholar
[9] Auletta V, Ferraioli D, Pasquale F, Persiano G (2013) Mixing time and stationary expected social welfare of logit dynamics. Theory Comput. Systems 53(1):3–40.Crossref, Google Scholar
[10] Auletta V, Ferraioli D, Pasquale F, Persiano G (2018) Metastability of logit dynamics for coordination games. Algorithmica 80(11):3078–3131.Crossref, Google Scholar
[11] Auletta V, Ferraioli D, Pasquale F, Penna P, Persiano G (2015) Logit dynamics with concurrent updates for local interaction potential games. Algorithmica 73(3):511–546.Crossref, Google Scholar
[12] Auletta V, Ferraioli D, Pasquale F, Penna P, Persiano G (2016) Convergence to equilibrium of logit dynamics for strategic games. Algorithmica 76(1):110–142.Crossref, Google Scholar
[13] Bezáková I, Bhatnagar N, Vigoda E (2007) Sampling binary contingency tables with a greedy start. Random Structures Algorithms 30(1–2):168–205.Crossref, Google Scholar
[14] Blume LE (1993) The statistical mechanics of strategic interaction. Games Econom. Behav. 5(3):387–424.Crossref, Google Scholar
[15] Bobkov SG, Tetali P (2006) Modified logarithmic Sobolev inequalities in discrete settings. J. Theoretical Probab. 19(2):289–336.Crossref, Google Scholar
[16] Brändén P, Huh J (2018) Hodge-Riemann relations for Potts model partition functions. Preprint, submitted November 5, https://arxiv.org/abs/1811.01696.Google Scholar
[17] Brändén P, Huh J (2020) Lorentzian polynomials. Ann. Math. 192(3):821–891.Crossref, Google Scholar
[18] Camerer CF (2010) Behavioural game theory. Behavioural and Experimental Economics, 42–50.Google Scholar
[19] Chien S, Sinclair A (2011) Convergence to approximate Nash equilibria in congestion games. Games Econom. Behav. 71(2):315–327.Crossref, Google Scholar
[20] Cryan M, Guo H, Mousa G (2019) Modified log-Sobolev inequalities for strongly log-concave distributions. Proc. 60th Annual Sympos. Foundations Comput. Sci., 1358–1370.Google Scholar
[21] Del Pia A, Ferris M, Michini C (2017) Totally unimodular congestion games. Proc. 28th Annual ACM-SIAM Sympos. Discrete Algorithms, 577–588.Google Scholar
[22] Diaconis P, Saloff-Coste L (1996) Logarithmic Sobolev inequalities for finite Markov chains. Ann. Appl. Probab. 6(3):695–750.Crossref, Google Scholar
[23] Dyer M, Greenhill C, Kleer P, Ross J, Stougie L (2021) Sampling hypergraphs with given degrees. Discrete Math. 344(11):112566.Crossref, Google Scholar
[24] Even-Dar E, Kesselman A, Mansour Y (2007) Convergence time to Nash equilibrium in load balancing. ACM Trans. Algorithms 3(3):32.Crossref, Google Scholar
[25] Fabrikant A, Papadimitriou C, Talwar K (2004) The complexity of pure Nash equilibria. Proc. 36th Annual ACM Sympos. Theory Comput., 604–612.Google Scholar
[26] Fanelli A, Moscardelli L (2011) On best response dynamics in weighted congestion games with polynomial delays. Distributed Comput. 24(5):245–254.Crossref, Google Scholar
[27] Feder T, Mihail M (1992) Balanced matroids. Proc. 24th Annual ACM Sympos. Theory Comput., 26–38.Google Scholar
[28] Ferraioli D (2013) Logit dynamics: A model for bounded rationality. ACM SIGecom Exchanges 12(1):34–37.Crossref, Google Scholar
[29] Fotakis D (2010) Congestion games with linearly independent paths: Convergence time and price of anarchy. Theory Comput. Systems 47(1):113–136.Crossref, Google Scholar
[30] Fujishige S, Goemans M, Harks T, Peis B, Zenklusen R (2015) Congestion games viewed from m-convexity. Oper. Res. Lett. 43(3):329–333.Crossref, Google Scholar
[31] Garey MR, Johnson DS (1990) Computers and Intractability: A Guide to the Theory of NP-Completeness (W.H. Freeman & Co., New York).Google Scholar
[32] Goel S (2004) Modified logarithmic Sobolev inequalities for some models of random walk. Stochastic Processes Appl. 114(1):51–79.Crossref, Google Scholar
[33] Gourvès L, Monnot J (2009) On strong equilibria in the max cut game. Proc. Fifth Internat. Workshop Internet Network Econom., 608–615.Google Scholar
[34] Gurvits L (2010) On multivariate Newton-like inequalities. Advances in Combinatorial Mathematics, 61–78.Google Scholar
[35] Helgason T (1974) Aspects of the theory of hypermatroids. Hypergraph Seminar (Springer, Berlin/Heidelberg), 191–213.Google Scholar
[36] Hermon J, Salez J (2019) Modified log-Sobolev inequalities for strong-Rayleigh measures. Preprint, submitted February 7, https://arxiv.org/abs/1902.02775.Google Scholar
[37] Holzman R, Law-Yone N (1997) Strong equilibrium in congestion games. Games Econom. Behav. 21(1–2):85–101.Crossref, Google Scholar
[38] Ieong S, McGrew R, Nudelman E, Shoham Y, Sun Q (2005) Fast and compact: A simple class of congestion games. Proc. 20th Natl. Conf. Artificial Intelligence, 489–494.Google Scholar
[39] Jerrum M, Sinclair A (1993) Polynomial-time approximation algorithms for the Ising model. SIAM J. Comput. 22(5):1087–1116.Crossref, Google Scholar
[40] Jerrum M, Sinclair A, Vigoda E (2004) A polynomial-time approximation algorithm for the permanent of a matrix with nonnegative entries. J. ACM 51(4):671–697.Crossref, Google Scholar
[41] Jerrum M, Son JB, Tetali P, Vigoda E (2004) Elementary bounds on Poincaré and log-Sobolev constants for decomposable Markov chains. Ann. Appl. Probab. 14(4):1741–1765.Crossref, Google Scholar
[42] Kleer P (2021) Sampling from the Gibbs distribution in congestion games. Proc. 22nd ACM Conf. Econom. Comput., 679–680.Google Scholar
[43] Kleer P, Schäfer G (2017) Potential function minimizers of combinatorial congestion games: Efficiency and computation. Proc. 18th ACM Conf. Econom. Comput., 223–240.Google Scholar
[44] Levin DA, Luczak MJ, Peres Y (2010) Glauber dynamics for the mean-field Ising model: Cut-off, critical power law, and metastability. Probab. Theory Related Fields 146(1–2):223–265.Crossref, Google Scholar
[45] Mamageishvili A, Penna P (2016) Tighter bounds on the inefficiency ratio of stable equilibria in load balancing games. Oper. Res. Lett. 44(5):645–648.Crossref, Google Scholar
[46] Martin RA, Randall D (2000) Sampling adsorbing staircase walks using a new Markov chain decomposition method. Proc. 41st Annual Sympos. Foundations Comput. Sci., 492–502.Google Scholar
[47] Mäs M, Nax HH (2016) A behavioral study of “noise” in coordination games. J. Econom. Theory 162:195–208.Crossref, Google Scholar
[48] McFadden D (1973) Conditional logit analysis of qualitative choice behavior. Frontiers in Econometrics, 105–142.Google Scholar
[49] McKay BD (1984) Asymptotics for 0-1 matrices with prescribed line sums. Enumeration and Design (Academic Press, Cambridge, MA), 225–238.Google Scholar
[50] Mihail M, Vazirani U (1989) On the expansion of 0-1 polytopes. J. Combin. Theory Ser. BGoogle Scholar
[51] Murota K (1998) Discrete convex analysis. Math. Programming 83(1–3):313–371.Crossref, Google Scholar
[52] Murota K (2003) Discrete convex analysis. SIAM Monograph on Discrete Mathematics and Applications (Society for Industrial and Applied Mathematics).Crossref, Google Scholar
[53] Murota K (2009) Recent developments in discrete convex analysis. Research Trends in Combinatorial Optimization, 219–260.Google Scholar
[54] Penna P (2018) The price of anarchy and stability in general noisy best-response dynamics. Internat. J. Game Theory 47(3):839–855.Crossref, Google Scholar
[55] Rosenthal RW (1973) A class of games possessing pure-strategy Nash equilibria. Internat. J. Game Theory 2:65–67.Crossref, Google Scholar
[56] Sandholm WH (2010) Population Games and Evolutionary Dynamics (MIT Press, Cambridge, MA).Google Scholar
[57] Schrijver A (2003) Combinatorial optimization: Polyhedra and efficiency. Matroids, Trees, Stable Sets, vol. B, Algorithms and Combinatorics (Springer-Verlag, Berlin).Google Scholar
[58] Sinclair A (1992) Improved bounds for mixing rates of Markov chains and multicommodity flow. Combin. Probab. Comput. 1(4):351–370.Crossref, Google Scholar
[59] Skopalik A, Vöcking B (2008) Inapproximability of pure Nash equilibria. Proc. 40th Annual ACM Sympos. Theory Comput., 355–364.Google Scholar

cover image Mathematics of Operations Research

Volume 48, Issue 4

November 2023

Pages 1811-2382, C2

Article Information

Metrics

Information

Received:August 25, 2021
Accepted:September 10, 2022
Published Online:April 04, 2023

Cite as

Pieter Kleer (2023) Sampling from the Gibbs Distribution in Congestion Games. Mathematics of Operations Research 48(4):1846-1870.

https://doi.org/10.1287/moor.2022.1322

Keywords

Acknowledgments

The author is grateful to Prasad Tetali for pointing him to Hermon and Salez [36] and to the anonymous reviewers of the 22nd ACM Conference on Economics and Computation and Mathematics of Operations Research for their useful comments. A large part of this work has been carried out while the author was a postdoctoral fellow at the Max Planck Institute for Informatics in Saarbrücken, Germany. An abstract of this work appears in the Proceedings of the 22nd ACM Conference on Economics and Computation (Kleer [42]).

PDF download

Available Issues

Available Issues

Sampling from the Gibbs Distribution in Congestion Games

Abstract

1. Introduction

1.1. Our Contributions

1.1.1. Extension Parallel Congestion Games.

1.1.2. Max-k-Cut Game on a (Unweighted) Complete Graph.

1.1.3. Capacitated Uniform Congestion Games.

1.2. Technical Approach

1.3. Discussion and Further Related Work

2. Preliminaries

2.1. Congestion Games

2.2. Gibbs Distribution and Logit Dynamics

2.3. Matroids and M-Concavity

2.4. Strongly Log-Concave Polynomials

2.5. Markov Chains

2.5.1. Markov Chain Decomposition.

2.5.2. Base-Exchange Markov Chain.

2.6. Sampling Algorithms

2.7. Bipartite Graphs

3. General Approach

3.1. Sampling Bases of Discrete Polymatroids

4. Extension Parallel Congestion Games

4.1. Sampling from the Gibbs Distribution

4.2. Relaxed Logit Dynamics

4.3. Uniform Sampling of Pure Nash Equilibria

5. Max-k-Cut Games

6. Capacitated Uniform Congestion Games

6.1. Sampling from the Gibbs Distribution

Appendix A. Markov Chains and Functional Inequalities

A.1. Markov Chain Decomposition

A.2. Markov Chain Comparison

Appendix B. Comparison Arguments Omitted in Proof of Theorem 5

B.1. First Inequality in (14)

B.2. First Inequality in (15)

References

Volume 48, Issue 4

Article Information

Metrics

Information

Cite as

Keywords