Open Access

A Particle System with Mean-Field Interaction: Large-Scale Limit of Stationary Distributions

Alexander L. Stolyar
Alexander L. Stolyar
[email protected]
https://orcid.org/0000-0002-1496-9803
Industrial and Enterprise Systems Engineering Department and Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801
Search for more papers by this author

Industrial and Enterprise Systems Engineering Department and Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801

Search for more papers by this author

Published Online:20 Apr 2023https://doi.org/10.1287/stsy.2023.0108

Abstract

We consider a system consisting of n particles, moving forward in jumps on the real line. System state is the empirical distribution of particle locations. Each particle “jumps forward” at some time points, with the instantaneous rate of jumps given by a decreasing function of the particle’s location quantile within the current state (empirical distribution). Previous work on this model established, under certain conditions, the convergence, as $n \to \infty$ , of the system random dynamics to that of a deterministic mean-field model (MFM), which is a solution to an integro-differential equation. Another line of previous work established the existence of MFMs that are traveling waves, as well as the attraction of MFM trajectories to traveling waves. The main results of this paper are: (a) We prove that, as $n \to \infty$ , the stationary distributions of (recentered) states concentrate on a (recentered) traveling wave; (b) we obtain a uniform across n moment bound on the stationary distributions of (recentered) states; and (c) we prove a convergence-to-MFM result, which is substantially more general than that in previous work. Results (b) and (c) serve as “ingredients” of the proof of (a), but also are of independent interest.

1. Introduction

We consider a system consisting of n particles, moving forward on the real line. The particles move in jumps. The system state at a given time is the current empirical distribution of particle locations. Each particle gets “urges to jump” as an independent Poisson process of constant rate. However, a particle getting a jump urge actually jumps with the probability given by a decreasing function of the particle’s location quantile within the current state (i.e., empirical distribution); hence, this is a mean-field type of particles’ interaction with each other. When a particle does jump, the jump size is independent, distributed as a random variable Z > 0. We are interested in the system behavior when n is large.

This model was introduced in Greenberg et al. (1995, 1996) as an idealized model of distributed parallel simulation. In this case, n particles represent n processors (“sites”) simulating different part of some large system, and a particle location is the current “local simulation time” of the corresponding processor. The following types of questions are of interest, as n becomes large: how the local times of the processors progress over time; whether local times “stay closely together”; whether the evolution of the empirical distribution of local times becomes that of a traveling wave; etc. This model, and similar models, are motivated by other applications as well (including recent applications, such as blockchains), where, roughly speaking, a distributed synchronization of a large number of sites is of interest; cf. Manita and Shcherbakov (2005), Malyshev and Manita (2006), Manita (2006, 2009, 2014), Malyshkin (2006), Balazs et al. (2014), and references therein for examples of synchronization models.

There are two lines of work on the particle system described above. The first one is in Greenberg et al. (1995), where it is shown that, under certain additional conditions, as $n \to \infty$ , the system random dynamics converges to that of a deterministic mean-field model (MFM), which is a solution to an integro-differential equation. There are several additional assumptions made in Greenberg et al. (1995), one of which is especially restrictive, in that the proof technique crucially relies on it—a particle jump probability depends on the current locations of K other particles chosen uniformly at random, where K > 0 is a fixed parameter, same for all n. We will refer to this additional assumption as the finite-dependence assumption, and it substantially restricts the more general model of this paper.

The second line of work is represented by Greenberg et al. (1996) and Stolyar (2023), where formally defined mean-field models (solutions to an integro-differential equation) are studied. The results of Greenberg et al. (1996) prove, in particular, that if an MFM that is a traveling wave exist, then “typically,” the traveling wave is unique, and MFM trajectories are attracted to it as time increases; the question of a traveling-wave existence under general assumptions was left open in Greenberg et al. (1996). Stolyar (2023) proves the existence of a traveling wave, under very general assumptions. (We also note that, for some MFMs that are different from the one in this paper, the existence and explicit forms of the traveling waves are obtained in Balazs et al. 2014, Hongler 2015, and Hongler and Filliger 2019, in some special cases of the jump-size distribution.)

The combination of the results of Greenberg et al. (1995, 1996) and Stolyar (2023) strongly suggests the following asymptotic property of the stationary distributions: As $n \to \infty$ , the stationary distributions of (recentered) states concentrate on a (recentered) traveling wave. This property is stated as conjecture 7.1 in Stolyar (2023, section 7).

Our main results are:

(a) We prove (Theorem 1) that, as $n \to \infty$ , the stationary distributions of (recentered) states concentrate on a (recentered) traveling wave. (This proves conjecture 7.1 in Stolyar (2023, section 7), under a slightly stronger assumption on the jump size, namely, $E Z^{2 + χ} < \infty, χ > 0$ , as opposed to $E Z^{2} < \infty$ .)
(b) As a key “ingredient” of the proof of (a), we obtain (Theorem 2) a uniform across n moment bound on the stationary distributions of (recentered) states. This result is also of independent interest.
(c) We prove (Theorem 3) the convergence-to-MFM result in our general setting, without the additional finite-dependence assumption. (This substantially generalizes the result of Greenberg et al. 1995. The proof largely follows the approach used in Balazs et al. 2014, for a different model. The approach is more generic than that in Greenberg et al. 1995; in particular, it does not rely on the finite-dependence assumption.) This result is another ingredient of the proof of (a), but is also of independent interest.

The proof of (a) relies on (b) and (c) and on the results in Greenberg et al. (1996) and Stolyar (2023) on the existence/uniqueness of—and attraction to—traveling waves.

1.1. Outline of the Rest of the Paper

In Section 2, we formally define the model and informally state the main results. Section 3 gives some basic notation, definitions, and conventions used throughout the paper. In Section 4, we state our main results, Theorems 1, 2, and 3, formally. Sections 5, 6, and 7 contain the proofs of Theorems 2, 3, and 1, respectively. A brief discussion of our results is in Section 8.

2. The Model and Main Results

The particle system model is as follows. There are n particles, moving in the positive direction (“right”) on the real axis $R$ . Each particle moves in jumps, as follows. For each particle, there is an independent Poisson process of rate $μ > 0$ of “jump urges.” When a particle gets an urge to jump, it actually jumps, to the right, with probability $η_{n} (ν)$ , where ν is its quantile in the current empirical distribution of the particles’ locations; that is, $ν = ℓ / n$ when the particle location is $ℓ$ -th from the left. With complementary probability $1 - η_{n} (ν)$ , the particle does not jump. To have the model well-defined, assume that quantile-ties between colocated particles are broken uniformly at random. We adopt the convention that function $η_{n} (ν)$ is defined for a continuous argument $ν \in [0, 1]$ , by assuming that it is constant in each interval $((ℓ - 1) / n, ℓ / n]$ for $ℓ = 1, \dots, n$ and $η_{n} (0) = 1$ . Assume that, for each n, function $η_{n} (ν), 0 \leq ν \leq 1,$ is nonincreasing and that, as $n \to \infty$ , it uniformly converges to a continuous, strictly decreasing function $η (ν), 0 \leq ν \leq 1,$ with $η (0) = 1, η (1) = 0$ . The jump sizes, when a particle does jump, are given by independent and identically distributed (i.i.d.) nonnegative random variables (r.v.) with cumulative distribution function (CDF) $J (y), y \geq 0$ ; we denote by $\bar{J} (y) = 1 - J (y)$ the complementary CDF; a generic jump size is given by the r.v. $Z \geq 0$ . Without loss of generality, we can and will assume that Z > 0—that is, $J (0) = 0$ . We will use notation

m^{(ℓ)} ≐ \int_{0}^{\infty} y^{ℓ} d J (y) = E Z^{ℓ}, ℓ \geq 0,

(1)

for the

ℓ

-th moment of a jump size. (So,

m^{(1)} = E Z

is the mean jump size.)

For some (not all) of our results, we will need the following two additional conditions on the jump-size distribution:

m^{(2 + χ)} < \infty for some χ > 0,

(2)

and

J (\cdot) is absolutely continuous with density J' (y), bounded away from 0 on compact subsets of R_{+} .

(3)

Note that, without loss of generality (WLOG), we can and do assume μ = 1; otherwise, we can achieve this condition by rescaling time. Also, if $m^{(1)} = E Z < \infty$ , we can and do assume, WLOG, that $m^{(1)} = 1$ ; otherwise, $m^{(1)} = 1$ is achieved by rescaling space.

Let $f^{n} (t) = (f_{x}^{n} (t), x \in R)$ be the (random) empirical distribution of the particle locations at time t; namely, $f_{x}^{n} (t)$ is the fraction of particles located in $(- \infty, x]$ at time t.

As $n \to \infty$ , it is very intuitive that $f_{x}^{n} (t)$ converges (in appropriate sense, under appropriate conditions) to a deterministic function $f_{x} (t)$ , such that $f (t) = (f_{x} (t), x \in R)$ is a distribution function for each t, and the following equation holds:

\frac{\partial}{\partial t} f_{x} (t) = - \int_{- \infty}^{x} d_{y} f (y, t) η (f_{y} (t)) \bar{J} (x - y),

(4)

where d_y means the differential in y (and recall that μ = 1, WLOG). We call a function

f_{x} (t)

satisfying (4) a mean-field model. (The formal meaning of (4) and the definition of a mean-field model will be given in Definition 1 in Section 6.3.) The intuition for (4) is as follows. For each t, the distribution

f (t) = (f_{x} (t), x \in R)

approximates the distribution of particles

f^{n} (t)

when n is large. Because particles move right,

f_{x} (t)

is nonincreasing in t for each x. So, the partial derivative

(\partial / \partial t) f_{x} (t)

is nonpositive, and it should be equal to the right-hand side (RHS) of (4), which gives the instantaneous rate (scaled by

1 / n

and taken with minus sign) at which particles jump over point x at time t.

It is known (Greenberg et al. 1996, Stolyar 2023) that, as long as $m^{(1)} < \infty$ (or, $m^{(1)} = 1$ , WLOG), for any mean-field model, the speed at which the mean $\int x d_{x} f_{x} (t)$ of the distribution f(t) moves right, is equal to $v = m^{(1)} μ \int_{0}^{1} η (ν) d ν = \int_{0}^{1} η (ν) d ν$ . A distribution function $ϕ = (ϕ_{x}, x \in R)$ is called a traveling wave shape (TWS) if $f_{x} (t) = ϕ_{x - v t}$ is a mean-field model. By substituting into (4), we see that any TWS $ϕ$ must satisfy equation

v ϕ_{x}^{'} = \int_{- \infty}^{x} ϕ_{y}^{'} η (ϕ_{y}) \bar{J} (x - y) d y .

(5)

It is known (Stolyar 2023) that a TWS $ϕ$ exists as long as $m^{(2)} < \infty$ (no other assumption on the jump-size distribution is required), and it is unique (up to a shift) if, in addition, (3) holds.

We now informally state the main results of this paper. (The formal results will be given later, in Section 4, after introducing more notation.) Let $\overset{°}{f}^{n} (t)$ denote the distribution $f^{n} (t)$ , recentered so that the distribution mean is at 0.

Informal Statement of Theorem 1

(in Section 4). Assume (2) and (3). Then, as $n \to \infty$ , the stationary distribution of the process $\overset{°}{f}^{n} (\cdot)$ converges to the (Dirac) distribution concentrated on the unique TWS $ϕ$ , centered so that its mean is at zero. Moreover, $ϕ$ has finite absolute $(1 + χ)$ -th moment: $\int_{- \infty}^{\infty} | y |^{1 + χ} d ϕ_{y} < \infty$ .

Informal Statement of Theorem 2

(in Section 4). Assume (2) and (3). Then, for all sufficiently large n, the Markov process $\overset{°}{f}^{n} (\cdot)$ is stable (positive Harris recurrent), and its stationary distribution is such that the expected absolute $(1 + χ)$ -th moment of $\overset{°}{f}^{n} (\cdot)$ is bounded uniformly in n:

E \int_{- \infty}^{\infty} | y |^{1 + χ} d_{y} \overset{°}{f_{y}^{n}} (t) \leq \bar{C} .

Informal Statement of Theorem 3

(in Section 4). Assume $m^{(1)} < \infty$ (or, $m^{(1)} = 1$ , WLOG) and (3). Suppose that, as $n \to \infty$ , the initial conditions $f^{n} (0)$ converge to a deterministic proper distribution f(0). (Nothing else about f(0) is assumed, not even the existence of a finite mean.) Then, the process $f^{n} (\cdot)$ converges to the unique mean-field model $f (\cdot)$ with initial condition f(0).

3. Basic Notation

The set of real numbers is denoted by $R$ and is viewed as the usual Euclidean space. As a measurable space, $R$ is endowed with Borel σ-algebra. For scalar functions h(x) of a real x: ${‖ h ‖}_{1} = \int_{x} | h (x) | d x$ is L₁-norm; h(x) is called c-Lipschitz if it is Lipschitz with constant $c \geq 0$ . Let $C_{b}$ be the set of continuous bounded functions on $R$ , which are constant outside a closed interval (one constant value to the “left” of it and possibly another constant value to the right of it.)

For functions h(x) of a real x: $h (x +)$ and $h (x -)$ are the right and left limits; a function h(x) is RCLL if it is right-continuous and has left limits at each x.

A function h of x may be written as either h(x) or h_x. Notation $d_{x} h (x, t)$ for a multivariate function h(x, t), where $x \in R$ , means the differential in x.

Denote by $M$ the set of scalar RCLL nondecreasing functions $f = (f (x), x \in R)$ , which are (proper) probability distribution functions—that is, such that $f (x) \in [0, 1], \lim_{x ↓ - \infty} f (x) = 0$ and $\lim_{x ↑ \infty} f (x) = 1$ . For elements $f \in M$ , we use the terms distribution function and distribution interchangeably. Space $M$ is endowed with the Levy-Prohorov metric (cf. Ethier and Kurtz 1986) and the corresponding topology of weak convergence (which is equivalent to the convergence at every point of continuity of the limit); the weak convergence in $M$ is denoted $\overset{w}{\to}$ . Note that, for $f, ϕ \in M$ , the L₁-norm of their difference, ${‖ f - ϕ ‖}_{1}$ , is equal to the Wasserstein W₁-distance between the corresponding two distributions. The inverse (ν-th quantile) of $f \in M$ is $f^{- 1} (ν) ≐ \inf {y | f (y) \geq ν}, ν \in [0, 1]$ ; $γ^{- 1} (1) = \infty$ when f(y) < 1 for all y.

Unless explicitly specified otherwise, we use the following conventions regarding random elements and random processes. A measurable space is considered equipped with a Borel σ-algebra, induced by the metric, which is clear from the context. A random process $Y (t), t \geq 0,$ always takes values in a complete separable metric space (clear from the context) and has RCLL sample paths. For a random process $Y (t), t \geq 0,$ we denote by $Y (\infty)$ the random value of Y(t) in a stationary regime (which will be clear from the context). Symbol $\Rightarrow$ signifies convergence of random elements in distribution; $\overset{P}{\to}$ means convergence in probability. W.p.1 or a.s. means with probability one. For a condition/event A, $I {A} = 1$ if A holds, and $I {A} = 0$ otherwise.

Space $D ([0, \infty), R)$ (respectively (resp.) $D ([0, \infty), M)$ ) is the Skorohod space of RCLL functions on $[0, \infty)$ taking values in $R$ (resp. $M$ ), with the corresponding Skorohod (J₁) metric and topology (cf. Ethier and Kurtz 1986); $\overset{J_{1}}{\to}$ denotes the convergence in these spaces.

For a distribution $f \in M$ and scalar function $h (x), x \in R$ , $f h ≐ \int_{R} h (x) d f_{x}$ .

For scalar functions $h (x), x \in X$ , with some domain $X, ‖ h ‖ = \sup_{x \in X} | h (x) |$ is the sup-norm. When $G_{k}, G$ are operators mapping the space of such functions into itself, $\lim G_{k} h = G h$ and $G_{k} h \to G h$ mean the uniform convergence: $‖ G_{k} h - G h ‖ \to 0$ .

Suppose we have a Markov process with state space $X$ and transition function $P^{t} (x, H),$ $t \geq 0$ . A measurable set $X \subseteq X$ is called small (cf. Bramson 2008) if there exist constants T > 0 and $δ > 0$ and probability distribution $α (\cdot)$ on $X$ , such that for any $x \in X$ and any measurable $H \subseteq X, P^{T} (x, H) \geq δ α (H)$ . P^t, as an operator, is $P^{t} h (x) ≐ \int_{y} P^{t} (x, d y) h (y)$ , where h is a scalar function with domain $X$ ; $I = P^{0}$ is the identity operator. The process (infinitesimal) generator B is

B h ≐ \lim_{t ↓ 0} (1 / t) [P^{t} - I] h .

Function h is within the domain of the generator B if Bh is well-defined. We say that a Markov process is stable if it is positive Harris recurrent (cf. Bramson 2008); if it is, it has unique stationary distribution.

For real numbers a and b, we use notations: $sign (a) ≐ I (a \geq 0) - I (a < 0)$ , $a \land b ≐ \min {a, b}$ . RHS and LHS mean right-hand side and left-hand side, respectively. Abbreviation w.r.t. means with respect to; a.e. means almost everywhere w.r.t. Lebesgue measure.

4. Formal Statements of Main Results

4.1. Uniform Moment Bound for Stationary Distributions. Limit of Stationary Distributions

For an element $f \in M$ , denote by $\bar{f}$ the mean of the corresponding distribution,

\bar{f} = \int_{- \infty}^{\infty} x d f_{x} = \int_{0}^{\infty} (1 - f_{x}) d x - \int_{- \infty}^{0} f_{x} d x,

with the usual convention that the mean is well-defined and finite when both integrals in the RHS are finite. Denote

\overset{°}{M} = {f \in M | \bar{f} = 0} .

When $\bar{f}$ is finite, denote by $\overset{°}{f} = (\overset{°}{f_{x}}, x \in R) \in \overset{°}{M}$ the centered version of f, namely,

\overset{°}{f_{x}} = f_{x + \bar{f}}, x \in R .

If we denote by $M^{(n)} \subset M$ the state space of the process $f^{n} (\cdot)$ , then ${\overset{°}{M}}^{(n)} = M^{(n)} \cap \overset{°}{M}$ is the state space of $\overset{°}{f}^{n} (\cdot)$ . If we use notation

Φ_{ℓ} (f) = \int_{- \infty}^{\infty} | x |^{ℓ} d f_{x},

for the

ℓ

-th absolute moment of f, then we obviously have

Φ_{ℓ} (f) < \infty, \forall f \in M^{(n)}, \forall n, \forall ℓ \geq 0 .

Theorem 1.

Suppose Conditions (2) and (3) hold. Then, as $n \to \infty$ ,

\overset{°}{f}^{n} (\infty) \Rightarrow ϕ,

(6)

where

ϕ

is the unique TWS with

\bar{ϕ} = 0

. Moreover,

ϕ

is such that

Φ_{1 + χ} (ϕ) < \infty,

(7)

and a stronger form of convergence (6) holds:

{‖ \overset{°}{f}^{n} (\infty) - ϕ ‖}_{1} \Rightarrow 0 .

(8)

The proof of Theorem 1 is in Section 7.

Theorem 2.

Suppose Conditions (2) and (3) hold. Then, there exist $\bar{C} > 0$ and $\bar{n}$ such that, for all $n \geq \bar{n}$ , the Markov process $\overset{°}{f}^{n} (\cdot)$ is stable, and we have

E Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) \leq \bar{C} .

(9)

The proof of Theorem 2 is in Section 5.

4.2. Transient Behavior: Convergence to a Mean-Field Model

Essentially all our asymptotic results on the transient behavior of the processes are for the noncentered processes $f^{n} (\cdot)$ . The result for the centered processes (Theorem 3(ii)) is obtained essentially as a corollary.

Denote by $L^{(n)}$ the generator of the process $f^{n} (\cdot)$ . For any $h \in C_{b}$ , function $f^{n} h$ of fⁿ is within the domain of $L^{(n)}$ (where we use the fact each function in $C_{b}$ is constant outside a closed interval), and

L^{(n)} [f^{n} h] = \int_{0}^{1} d ν {[f^{n}]}^{- 1} (ν) η_{n} ({[f^{n}]}^{- 1} (ν)) E [h ({[f^{n}]}^{- 1} (ν) + Z) - h ({[f^{n}]}^{- 1} (ν))],

where the expectation is over the distribution of the random jump size Z. We also formally define the “limit” of

L^{(n)}

L [f h] = \int_{0}^{1} d ν f^{- 1} (ν) η (f^{- 1} (ν)) E [h (f^{- 1} (ν) + Z) - h (f^{- 1} (ν))] .

Theorem 3.

Suppose $f^{n} (0) \overset{w}{\to} f (0)$ , where ${f^{n} (0)}$ is the deterministic sequence of $f^{n} (0) \in M^{(n)}$ , and $f (0) \in M$ . (Note that we do not assume that f(0) has well-defined mean $\bar{f} (0)$ .) Assume that conditions $m^{(1)} = E Z < \infty$ (or $m^{(1)} = E Z = 1$ , WLOG) and (3) hold. Then, we have:

(i) $f^{n} (\cdot) \Rightarrow f (\cdot)$ in $D ([0, \infty), M)$ , where $f (\cdot) \in D ([0, \infty), M)$ is deterministic, uniquely determined by f(0). Moreover, $f (\cdot)$ is a continuous element of $D ([0, \infty), M)$ , which satisfies
$f (t) h - f (0) h - \int_{0}^{t} L f (s) hds = 0, \forall h \in C_{b}, \forall t \geq 0 .$ (10)
The dependence of $f (\cdot)$ (as an element of space $D ([0, \infty), M)$ with J₁-convergence topology) on f(0) (as an element of $M$ with weak convergence topology) is continuous.
(ii) If, in addition, $\bar{f} (0) = 0$ , then $\bar{f} (t) = v t$ , and, consequently, $\overset{°}{f} (\cdot)$ is a continuous element of $D ([0, \infty), M)$ , uniquely determined by $\overset{°}{f} (0) = f (0) \in \overset{°}{M}$ , with $\overset{°}{f} (t) \in \overset{°}{M}$ for all $t \geq 0$ . The dependence of $\overset{°}{f} (\cdot)$ on $\overset{°}{f} (0)$ is continuous. And if, in addition, ${\bar{f}}^{n} (0) = 0$ for all n, then $\overset{°}{f}^{n} (\cdot) \Rightarrow \overset{°}{f} (\cdot)$ in $D ([0, \infty), M)$ .

The proof of Theorem 3 is in Section 6. Also in Section 6, we show (Theorem 8) that solutions to (10) are exactly the mean-field models (Definition 1). We note that many of the supplementary results in Section 6, which may be of independent interest, require assumptions on the jump-size distribution that are much weaker than conditions $m^{(1)} = E Z < \infty$ and (3). In particular, some of those results assume nothing about the jump-size distribution besides it being a proper distribution.

5. Proof of Theorem 2

5.1. Equivalent View of Process $\overset{°}{f}^{n} (\cdot)$

State $\overset{°}{f}^{n} (t)$ can be equivalently described as $w^{n} (t) = (w_{1} (t), w_{2} (t), \dots, w_{n} (t)) \in R^{n}$ , where $w_{1} (t), w_{2} (t), \dots, w_{n} (t)$ are the locations of the n particles w.r.t. the mean ${\bar{f}}^{n} (t)$ , listed in a nondecreasing order. (So, the average $(1 / n) \sum_{i} w_{i} (t) = 0$ at all times.) From now on, for each $\overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)}$ , we will consider the corresponding vector $w^{n} = (w_{1}, w_{2}, \dots, w_{n})$ , and vice versa. Any function of $\overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)}$ may be expressed via the corresponding wⁿ, and vice versa. In particular,

Φ_{ℓ} (\overset{°}{f}^{n} (t)) = \frac{1}{n} \sum_{i = 1}^{n} | w_{i} (t) |^{ℓ} .

Note that the topology on ${\overset{°}{M}}^{(n)}$ , induced by the (weak convergence) topology on $M$ , is equivalent to the usual topology of component-wise convergence of the corresponding vectors wⁿ.

The evolution of $w^{n} (t)$ is as follows. Between the times of the jump urges, $w^{n} (t)$ remains constant. At a time t of a jump urge, the following occurs. Let $κ_{i} (t)$ be the actual jumps size of particle i, in the system without recentering, upon this urge; $κ_{i} (t) \geq 0$ and can be nonzero for at most one particle. Then, in the recentered system, particle i jump size (i.e., the increment of $w_{i} (t)$ ) at t is $ζ_{i} = κ_{i} (t) - \sum_{s} κ_{s} (t) / n$ (which may be positive or negative). After the jumps (if any) at t occur, the particles indices i are changed, if necessary, to keep $w_{i} (t)$ nondecreasing in i.

5.2. Informal Intuition for the Proof

The proof of stability, in Subsection 5.3, uses fluid limit technique and is fairly straightforward. Let us discuss the intuition for the proof of the bound (9) in Subsection 5.4.

At a very high level, the bound (9) is due to the fundamental property of the system, which can be called the “egalitarian trend”: In a recentered system, the particles at high quantiles (large i) will have a negative drift, whereas particles at low quantiles (small i) will have a positive drift, thus preventing the centered empirical distribution $\overset{°}{f}^{n}$ from “spreading out.”

To obtain the bound on the expected $(1 + χ)$ -th moment of $\overset{°}{f}^{n}$ , we need the finite $(2 + χ)$ -th moment on a jump size. Informally speaking, we use $Φ_{2 + χ} (\overset{°}{f}^{n})$ as a Lyapunov function. If $B^{(n)}$ is the generator of process $\overset{°}{f}^{n} (\cdot)$ , then, “generally speaking,” we have $E B^{(n)} Φ_{2 + χ} (\overset{°}{f}^{n} (\infty)) = 0$ ; in other words, the expected drift of $Φ_{2 + χ} (\overset{°}{f}^{n} (t))$ in steady-state is zero. Function

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) = (2 + χ) \sum_{i} sign (w_{i}) | w_{i} |^{1 + χ} E ζ_{i}^{(\overset{°}{f}^{n})}, \overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)},

where

ζ_{i}^{(\overset{°}{f}^{n})}

is the random jump size of particle i (in recentered system) upon a jump urge when the state is

\overset{°}{f}^{n}

, can be thought of as the “first-order approximation of the generator

B^{(n)}

, applied to function

Φ_{2 + χ} (\overset{°}{f}^{n})

”; note that the derivative

| w_{i}^{2 + χ} |' = (2 + χ) sign (w_{i}) w_{i}^{1 + χ}

. We can show that, when

Φ_{1 + χ} (\overset{°}{f}^{n})

is large,

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) \leq - ϵ Φ_{1 + χ} (\overset{°}{f}^{n}),

for some constant

ϵ > 0

; this is where we use the egalitarian trend property, which ensures that

E ζ_{i}^{(\overset{°}{f}^{n})}

is negative (resp., positive) for particles at high (resp., low) quantiles. From here, we can obtain, informally speaking,

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) \leq - ϵ Φ_{1 + χ} (\overset{°}{f}^{n}) + K,

which holds for some K > 0 and all

\overset{°}{f}^{n}

. Taking into account the fact that

G_{1 + χ}^{(n)} (\overset{°}{f}^{n})

is not

B^{(n)}

, but only its first-order approximation, and doing the corresponding estimates, we obtain, informally speaking,

B^{(n)} Φ_{2 + χ} (\overset{°}{f}^{n}) \leq - (ϵ / 2) Φ_{1 + χ} (\overset{°}{f}^{n}) + K .

Taking expectation w.r.t. $\overset{°}{f}^{n} (\infty)$ , we obtain, informally speaking,

0 \leq - (ϵ / 2) E Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) + K,

which yields (9).

In the actual proof, instead of $Φ_{2 + χ} (\overset{°}{f}^{n})$ , we use its truncated version $Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n}) = Φ_{2 + χ} (\overset{°}{f}^{n}) \land C_{1}$ as the Lyapunov function because the latter is certainly within the domain of generator $B^{(n)}$ . And then let $C_{1} ↑ \infty$ .

We note that this “program” for proving a property of the type of (9) is likely applicable to other models having the egalitarian trend property, though the technical details may differ.

5.3. Stability

The stability (positive Harris recurrence) is easily established using the fluid-limit technique (Rybko and Stolyar 1992, Dai 1995, Stolyar 1995, Bramson 2008). We note that the stability proof only uses Conditions (3) and $m^{(1)} < \infty$ (as opposed to stronger Condition (2)).

To prove stability, we will use the equivalent representation $w^{n} (\cdot)$ of the process $\overset{°}{f}^{n} (\cdot)$ , given in Subsection 5.1. The state space of $w^{n} (\cdot)$ is

{\overset{°}{W}}^{(n)} = {w^{n} = (w_{1}, \dots, w_{n}) \in R^{n} | (1 / n) \sum_{i} w_{i} = 0},

that is, the set of those vectors in

R^{n}

, corresponding to

\overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)}

. The norm of a state

w^{n} = (w_{1}, \dots, w_{n}) \in {\overset{°}{W}}^{(n)}

‖ w^{n} ‖ = \max_{i} | w_{i} |

. Using (3), it is straightforward to see that, for any fixed a > 0, the closed set

{\overset{°}{W}}^{(n)} (a) = {w^{n} \in {\overset{°}{W}}^{(n)} ‖ | w^{n} ‖ \leq a}

is small (see the definition in Section 3).

Pick any two numbers, $0 < ν_{1} < ν_{2} < 1$ , and choose $\bar{n}$ large enough so that for any $n \geq \bar{n}$ , $η_{n} (ν_{1}) - η_{n} (ν_{2}) > (η (ν_{1}) - η (ν_{2})) / 2$ . (All we need here is that the jump probabilities of the “left-most” and “right-most” particle are separated by a positive constant.) Consider any fixed $n \geq \bar{n}$ .

Consider a sequence of versions of the process $w^{n} (\cdot)$ , namely, processes $w^{n, k} (\cdot)$ with increasing norm of the initial state, $‖ w^{n, k} (0) ‖ = c_{k} ↑ \infty, k \to \infty$ , with $w^{n, k} (0) / c_{k} \to w (0) = (w_{1} (0), \dots, w_{n} (0))$ . Given that ${\overset{°}{W}}^{(n)} (a)$ is a closed small set for any a > 0, to establish stability, it suffices to show that, for some fixed T > 0, $‖ w^{n, k} (T) ‖ / c_{k} \Rightarrow 0$ . This, in turn, follows from the fact that the limit (in appropriate sense) $w (\cdot) = (w_{1} (\cdot), \dots, w_{n} (\cdot))$ of the sequence of processes $w^{n, k} (c_{k} t) / c_{k}, t \geq 0,$ has trajectories such that $w (t) \in {\overset{°}{W}}^{(n)}$ for all $t \geq 0$ , $‖ w (0) ‖ = 1$ , and $(d / d t) \max w_{i} (t) \leq - ϵ < 0$ , as long as $\max w_{i} (t) > 0$ ; therefore, $‖ w (t) ‖ = 0$ for all $t \geq 1 / ϵ$ . We omit further details, which are straightforward. □

5.4. Proof of (9)

At some level, this proof is similar to the proof of an analogous result in Stolyar (2022) for a different particle system. However, the difference of our model from that in Stolyar (2022) is substantial, so we give full details of the proof for our model.

Consider the following function

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) = (2 + χ) \sum_{i} sign (w_{i}) | w_{i} |^{1 + χ} E ζ_{i}^{(\overset{°}{f}^{n})}, \overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)},

where

ζ_{i}^{(\overset{°}{f}^{n})}

is the random jump size (which can have any sign) of particle i upon a jump urge when the state is

\overset{°}{f}^{n}

. (The sizes

ζ_{i}^{(\overset{°}{f}^{n})}

are dependent across i, of course.)

As we will see, function $G_{1 + χ}^{(n)} (\overset{°}{f}^{n})$ can be thought of as the “first-order approximation of the generator $B^{(n)}$ of process $\overset{°}{f}^{n} (\cdot)$ , applied to function $Φ_{2 + χ} (\overset{°}{f}^{n})$ ”; but we do not even claim that $Φ_{2 + χ} (\overset{°}{f}^{n})$ is within the generator $B^{(n)}$ domain. Note that, for each n, $G_{1 + χ}^{(n)} (\overset{°}{f}^{n})$ is continuous in $\overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)}$ .

Note that each $E ζ_{i}^{(\overset{°}{f}^{n})}$ is the quantity of the order $O (1 / n)$ , which motivates the definition

{\bar{ζ}}_{i}^{(\overset{°}{f}^{n})} ≐ n E ζ_{i}^{(\overset{°}{f}^{n})} .

(11)

We observe that, if particle i location w_i is the $ℓ$ -th from the left, and it is not colocated with any other particle, then

{\bar{ζ}}_{i}^{(\overset{°}{f}^{n})} = η_{n} (ℓ / n) - v_{n},

where

v_{n} ≐ \sum_{ℓ} η_{n} (ℓ / n) \equiv \int_{0}^{1} η_{n} (ν) d ν,

(12)

is the average drift of the mean of the noncentered particle system. (Clearly,

\lim_{n} v_{n} = v

.) In the more general case, when exactly k particles are colocated particles—namely,

ℓ

-th,

(ℓ + 1)

-th, …,

(ℓ + k - 1)

-th left-most particles are colocated—and particle i is one of them, we have

{\bar{ζ}}_{i}^{(\overset{°}{f}^{n})} = \frac{η_{n} (ℓ / n) + \dots + η_{n} ((ℓ + k - 1) / n)}{k} - v_{n} .

(13)

We define the function ${\bar{ζ}}^{(\overset{°}{f}^{n})} (x), x \in R,$ as follows: ${\bar{ζ}}^{(\overset{°}{f}^{n})} (x) = {\bar{ζ}}_{i}^{(\overset{°}{f}^{n})}$ , where i is the particle whose location w_i is the closest to x on the left; we also adopt a convention that, if w_i is the location of the left-most particle, then ${\bar{ζ}}^{(\overset{°}{f}^{n})} (x) = {\bar{ζ}}_{i}^{(\overset{°}{f}^{n})}$ for all $x < w_{i}$ . Clearly, function ${\bar{ζ}}^{(\overset{°}{f}^{n})} (x)$ is a piece-wise constant nonincreasing function.

We can write:

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) = (2 + χ) \int_{- \infty}^{\infty} {\bar{ζ}}^{(\overset{°}{f}^{n})} (x) sign (x) | x |^{1 + χ} d \overset{°}{f_{x}^{n}},

(14)

or, by “integrating over the values ν of

\overset{°}{f_{x}^{n}}

” and using (13),

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) = (2 + χ) \int_{0}^{1} d ν [η_{n} (ν) - v_{n}] sign ({[\overset{°}{f}^{n}]}^{- 1} (ν)) | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} \leq 0 .

(15)

The inequality in (15) follows by the following argument. Denote

ν_{0}^{n} ≐ \inf {ν | η_{n} (ν) < v_{n}},

and observe that, as

n \to \infty

\lim_{n \to \infty} ν_{0}^{n} = ν_{0}, where η (ν_{0}) = v .

To be specific, consider the case when ${[\overset{°}{f}^{n}]}^{- 1} (ν_{0}^{n}) \leq 0$ . (The case ${[\overset{°}{f}^{n}]}^{- 1} (ν_{0}^{n}) \geq 0$ is treated analogously.) We have $ν_{*}^{n} ≐ \overset{°}{f_{0}^{n}} \geq ν_{0}^{n}$ . Note that

\int_{0}^{ν_{0}^{n}} [η_{n} (ν) - v_{n}] d ν = - \int_{ν_{0}^{n}}^{1} [η_{n} (ν) - v_{n}] d ν .

Then, we have

\begin{array}{l} \int_{ν_{*}^{n}}^{1} d ν [η_{n} (ν) - v_{n}] sign ({[\overset{°}{f}^{n}]}^{- 1} (ν)) | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} = \\ \int_{ν_{*}^{n}}^{1} d ν [η_{n} (ν) - v_{n}] | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} \leq 0, \end{array}

and

\begin{array}{l} \int_{0}^{ν_{*}^{n}} d ν [η_{n} (ν) - v_{n}] sign ({[\overset{°}{f}^{n}]}^{- 1} (ν)) | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} = \\ - \int_{0}^{ν_{*}^{n}} d ν [η_{n} (ν) - v_{n}] | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} = \\ - \int_{0}^{ν_{0}^{n}} d ν [η_{n} (ν) - v_{n}] | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} - \int_{ν_{0}^{n}}^{ν_{*}^{n}} d ν [η_{n} (ν) - v_{n}] | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} \leq 0, \end{array}

where the last inequality holds because

| {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ}

is nonincreasing in

[0, ν_{*}^{n}]

. Thus, (15) is proved.

Next, we claim the following property: There exists a sufficiently large C > 0 and some $ϵ > 0$ , such that, uniformly in all sufficiently large n and all $\overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)}$ with $Φ_{1 + χ} (\overset{°}{f}^{n}) \geq C$ ,

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) \leq - ϵ Φ_{1 + χ} (\overset{°}{f}^{n}) .

(16)

The proof of (16) is given in Section 5.5.

From (16) and (15), we obtain that, uniformly in all sufficiently large n,

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) \leq - ϵ Φ_{1 + χ} (\overset{°}{f}^{n}) + ϵ C .

(17)

Denote by $Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n}) = Φ_{2 + χ} (\overset{°}{f}^{n}) \land C_{1}$ the function $Φ_{2 + χ}$ truncated at level $C_{1} > 0$ . Given that this is a continuous bounded function of $\overset{°}{f}^{n} \in {\overset{°}{M}}^{(n)}$ , and the process of jump urges is Poisson, it is not hard to see that $Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n})$ is within the domain of the generator $B^{(n)}$ of process $\overset{°}{f}^{n} (\cdot)$ .

Next, we claim the following fact: There exists $C_{2} > 0$ such that for any fixed $C_{1} > 0$ , uniformly in all large n and $\overset{°}{f}^{n}$ such that $Φ_{2 + χ} (\overset{°}{f}^{n}) \leq C_{1}$ , we have

B^{(n)} Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n}) \leq G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) + C_{2} Φ_{χ} (\overset{°}{f}^{n}) + C_{2},

(18)

and then

B^{(n)} Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n}) \leq - ϵ Φ_{1 + χ} (\overset{°}{f}^{n}) + C_{2} Φ_{χ} (\overset{°}{f}^{n}) + C_{3},

(19)

with

C_{3} = ϵ C + C_{2}

. The proof of (18) is given in Section 5.6.

From (19) and inequality $Φ_{χ} (\overset{°}{f}^{n}) \leq {[Φ_{1 + χ} (\overset{°}{f}^{n})]}^{χ / (1 + χ)} \leq C_{4} + (ϵ / (2 C_{2})) Φ_{1 + χ} (\overset{°}{f}^{n})$ , which holds for a sufficiently large fixed $C_{4} > 0$ , we obtain

B^{(n)} Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n}) \leq - (ϵ / 2) Φ_{1 + χ} (\overset{°}{f}^{n}) + C_{5},

(20)

where

C_{5} = C_{4} C_{2} + C_{3}

Bound (20), in turn, implies that for any fixed $C_{1} > 0$ ,

B^{(n)} Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n}) \leq [- (ϵ / 2) Φ_{1 + χ} (\overset{°}{f}^{n}) + C_{5}] I {Φ_{2 + χ} (\overset{°}{f}^{n}) \leq C_{1}},

(21)

because, obviously,

B^{(n)} Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n}) \leq 0

when

Φ_{2 + χ} (\overset{°}{f}^{n}) > C_{1}

Recalling that $\overset{°}{f}^{n} (\infty)$ is the random value of $\overset{°}{f}^{n} (t)$ in the stationary regime, we have for all large n:

\begin{array}{l} 0 = E B^{(n)} Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n} (\infty)) \leq E [(- (ϵ / 2) Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) + C_{5}) I {Φ_{2 + χ} (\overset{°}{f}^{n} (\infty)) \leq C_{1}}] \\ \leq - (ϵ / 2) E [Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) I {Φ_{2 + χ} (\overset{°}{f}^{n} (\infty)) \leq C_{1}}] + C_{5}, \end{array}

and then

E [Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) I {Φ_{2 + χ} (\overset{°}{f}^{n} (\infty)) \leq C_{1}}] \leq 2 C_{5} / ϵ .

Letting $C_{1} ↑ \infty$ , we finally obtain that

E Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) \leq 2 C_{5} / ϵ,

for all sufficiently large n, and then

E Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) \leq \bar{C},

holds for all n for some large

\bar{C} > 0

. □

5.5. Proof of (16)

The definition of ${\bar{ζ}}_{i} = {\bar{ζ}}_{i}^{(\overset{°}{f}^{n})}$ in (11) can be interpreted as follows: ${\bar{ζ}}_{i}$ is the expected jump size of particle i, conditioned on this particle receiving the jump urge, and then centered by the expected jump size of any particle upon a jump urge in the system.

The proof is by contradiction. Suppose Property (16) does not hold. Then, we can and do choose a subsequence of $n \to \infty$ , and corresponding $\overset{°}{f}^{n}$ , so that along this subsequence $Φ_{1 + χ} (\overset{°}{f}^{n}) ↑ \infty$ and

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) / Φ_{1 + χ} (\overset{°}{f}^{n}) \to 0 .

(22)

Consider separately two cases: (a) ${\lim \inf}_{n} Φ_{1} (\overset{°}{f}^{n}) = c < \infty$ and (b) $\lim_{n} Φ_{1} (\overset{°}{f}^{n}) = \infty$ .

Case (a).

Consider a subsequence of $\overset{°}{f}^{n}$ such that $\lim_{n} Φ_{1} (\overset{°}{f}^{n}) = c < \infty$ and, moreover, $\overset{°}{f}^{n} \overset{w}{\to} f$ , where f is a proper distribution.

For a fixed $0 < δ < 1 / 2$ , denote

Φ_{1 + χ}^{(δ)} (\overset{°}{f}^{n}) = \int_{[0, δ] \cup [1 - δ, 1]} sign ({[\overset{°}{f}^{n}]}^{- 1} (ν)) | {[\overset{°}{f}^{n}]}^{- 1} (ν) |^{1 + χ} d ν .

Using the facts that f is proper and $Φ_{1 + χ} (f^{n}) \to \infty$ , we easily see that, for any fixed $δ > 0$ ,

\lim Φ_{1 + χ}^{(δ)} (\overset{°}{f}^{n}) / Φ_{1 + χ} (\overset{°}{f}^{n}) = 1 .

Pick a sufficiently small $δ > 0$ such that, for some $δ_{1} > 0$ and all large n, $η_{n} (δ) - v_{n} > δ_{1}$ and $η_{n} (1 - δ) - v_{n} < - δ_{1}$ . Then, we have

\lim \inf | G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) | / Φ_{1 + χ} (\overset{°}{f}^{n}) \geq (2 + χ) δ_{1},

which contradicts (22).

Case (b).

Denote $c_{n} = Φ_{1} (\overset{°}{f}^{n}) ↑ \infty$ , and consider the sequence of rescaled versions of $\overset{°}{f}^{n}$ , namely,

{\tilde{f}}_{x}^{n} = {\overset{°}{f}}_{c_{n} x}^{n}, w \in R .

Note that

G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) / Φ_{1 + χ} (\overset{°}{f}^{n}) = G_{1 + χ}^{(n)} ({\tilde{f}}^{n}) / Φ_{1 + χ} ({\tilde{f}}^{n}) .

(23)

Consider two subcases: (b.1) $\lim_{n} Φ_{1 + χ} ({\tilde{f}}^{n}) = \infty$ and (b.2) ${\lim \inf}_{n} Φ_{1 + χ} ({\tilde{f}}^{n}) = c < \infty$ . In the subcase (b.1), on account of (23), we can obtain a contradiction in the same way as in the Case (a).

So, the remaining case to consider is (b.2).

For an $f \in M$ , let us formally define a “limiting version” of the functional $G_{1 + χ}^{(n)} (\overset{°}{f}^{n})$ defined in the LHS of the inequality in (15):

G_{1 + χ} (f) = (2 + χ) \int_{0}^{1} d ν [η (ν) - v] sign (f^{- 1} (ν)) | f^{- 1} (ν)) |^{1 + χ} .

(24)

Note that $Φ_{1 + χ} ({\tilde{f}}^{n}) \geq 1$ , so that $0 < c < \infty$ . Consider a subsequence of ${\tilde{f}}^{n}$ such that $\lim_{n} Φ_{1 + χ} ({\tilde{f}}^{n}) = c$ and ${\tilde{f}}^{n} \overset{w}{\to} f \in M$ . Distribution f cannot be concentrated at a single point. (Otherwise, because $Φ_{1} ({\tilde{f}}^{n}) = 1$ for all n, $Φ_{1 + χ} ({\tilde{f}}^{n})$ could not remain bounded.) Therefore, $| G_{1 + χ} (f) | > 0$ , and then

\lim \inf | G_{1 + χ}^{(n)} ({\tilde{f}}^{n}) | \geq | G_{1 + χ} (f) | > 0,

and then

\lim \inf | G_{1 + χ}^{(n)} ({\tilde{f}}^{n}) | / Φ_{1 + χ} ({\tilde{f}}^{n}) > 0

, which contradicts (22). □

5.6. Proof of (18)

We will use the following inequality, which holds, for some constant $C_{6} > 0$ , for any numbers y and δ:

| y + δ |^{2 + χ} - | y |^{2 + χ} \leq (2 + χ) sign (y) | y |^{1 + χ} δ + C_{6} | y |^{χ} δ^{2} + C_{6} | δ |^{2 + χ} .

(25)

Indeed,

| y + δ |^{2 + χ} - | y |^{2 + χ} \leq (2 + χ) sign (y) | y |^{1 + χ} δ + (1 / 2) (2 + χ) (1 + χ) | \tilde{y} |^{χ} δ^{2},

for some

\tilde{y} \in [y - | δ |, y + | δ |]

. Using

| \tilde{y} |^{χ} \leq {(| y | + | δ |)}^{χ} \leq {(2 | y |)}^{χ} + {(2 | δ |)}^{χ}

, we obtain (25).

Consider a fixed state $\overset{°}{f}^{n}$ and consider the expected increment Δ of $Φ_{2 + χ} (\overset{°}{f}^{n})$ upon a jump urge, which occurs in this state. Then, using (25),

Δ = E \frac{1}{n} \sum_{i} | w_{i} + ζ_{i} |^{2 + χ} - \frac{1}{n} \sum_{i} | w_{i} |^{2 + χ} \leq E \frac{1}{n} \sum_{i} [(2 + χ) sign (w_{i}) | w_{i} |^{1 + χ} ζ_{i} + C_{6} | w_{i} |^{χ} ζ_{i}^{2} + C_{6} | ζ_{i} |^{2 + χ}],

where expectation

E

is with respect to the uniform selection of the particle receiving the jump urge, the random event of it actually jumping, and the randomness of the jump size (if it occurs).

For ζ_i, we have:

ζ_{i} = κ_{i} - \frac{1}{n} \sum_{s} κ_{s},

where

κ_{i} = κ_{i} (\overset{°}{f}^{n})

the (random) jump size of particle i. Note that

E ζ_{i}

is the quantity of the order

O (1 / n)

, because

\sum_{s} κ_{s}

is of order O(1) and

E κ_{i}

is of order

O (1 / n)

(because

1 / n

is the probability of particle i receiving the jump urge). Therefore,

{\bar{ζ}}_{i} = n E ζ_{i} = O (1)

, and we can write:

E \sum_{i} sign (w_{i}) | w_{i} |^{1 + χ} ζ_{i} = \frac{1}{n} \sum_{i} sign (w_{i}) | w_{i} |^{1 + χ} {\bar{ζ}}_{i} .

Next,

E {(ζ_{i} / 2)}^{2 + χ} \leq (1 / 2) E κ_{i}^{2 + χ} + (1 / 2) \frac{1}{n^{2 + χ}} E {(\sum_{s} κ_{s})}^{2 + χ},

and, therefore,

E ζ_{i}^{2 + χ} \leq C_{7} / n,

(26)

for some

C_{7} > 0

because

E {(\sum_{s} κ_{s})}^{2 + χ}

is upper bounded by the

(2 + χ)

-th moment of a jump size and

E κ_{i}^{2 + χ}

is upper bounded by

1 / n

times the

(2 + χ)

-th moment of a jump size. Note that (26) holds for χ = 0 as well, with possibly different C₇. Therefore, by choosing C₇ sufficiently large, we have both (26) and

E ζ_{i}^{2} \leq C_{7} / n .

(27)

Assembling these bounds, we obtain

n Δ \leq (2 + χ) \frac{1}{n} \sum_{i} sign (w_{i}) | w_{i} |^{1 + χ} {\bar{ζ}}_{i} + C_{6} C_{7} \frac{1}{n} \sum_{i} | w_{i} |^{χ} + C_{6} C_{7} = G_{1 + χ}^{(n)} (\overset{°}{f}^{n}) + C_{2} Φ_{χ} (\overset{°}{f}^{n}) + C_{2},

(28)

where, recall, Δ is the expected increment of

Φ_{2 + χ}

upon a jump urge occurring in a fixed state

\overset{°}{f}^{n}

, and

C_{2} = C_{6} C_{7}

Now, consider the value of the generator $B^{(n)} Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n})$ at point $\overset{°}{f}^{n}$ such that $Φ_{2 + χ} (\overset{°}{f}^{n}) \leq C_{1}$ . For that, consider the expected increment of $Φ_{2 + χ}^{(C_{1})} (\overset{°}{f}^{n} (t))$ over a small interval $[0, t / n]$ , with $\overset{°}{f}^{n} (0) = \overset{°}{f}^{n}$ . First, note that, as $t ↓ 0$ , the contribution into this expected increment of the event that more than one jump urge occurs, is o(t) (because jump urges follow a Poisson process of rate n, and $Φ_{2 + χ}^{(C_{1})}$ is bounded). With probability $t + o (t)$ , there will be exactly one jump urge in $[0, t / n]$ , which, therefore, occurs into the state $\overset{°}{f}^{n}$ (such that $Φ_{2 + χ} (\overset{°}{f}^{n}) \leq C_{1}$ ); then, the expected increment of $Φ_{2 + χ}^{(C_{1})}$ will not exceed that of $Φ_{2 + χ}$ . Using these observations and the Estimate (28), we obtain (18). We omit the remaining straightforward $ϵ / δ$ formalities. □

6. Proof of Theorem 3

This proof largely follows the approach used in Balazs et al. (2014), for a different model. Unlike in Balazs et al. (2014), we work with the Levy-Prohorov metric (inducing the weak convergence topology) on $M$ , as opposed to the stronger Wasserstein W₁-metric. This, in fact, simplifies some parts of the proof in our case; we will point out those parts as they appear. However, some other parts of our proof of Theorem 3 are completely different from (or not present in) the development in Balazs et al. (2014). They are Section 6.3 and Theorem 8, which establish the equivalence between solutions to (10) and mean-field models; and Theorem 9, which establishes the uniqueness of a mean-field model and its continuous dependence on the initial state.

We note that many of the supplementary results in this section, which may be of independent interest, require assumptions on the jump-size distribution that are much weaker than conditions $m^{(1)} = E Z < \infty$ and (3). In particular, some of the results assume nothing about the jump-size distribution besides it being a proper distribution. We will emphasize such weaker assumptions, where applicable, in the results’ statements.

6.1. C-relative Compactness of the Processes

A sequence of random processes with sample path in the Skorohod space $D ([0, \infty), R)$ (resp., $D ([0, \infty), M)$ ) is called C-relatively compact (see Perkins 2002 and Balazs et al. 2014) if it is: (a) relatively compact—that is, its any subsequence has a further subsequence converging in distribution to some limiting process; and (b) any such limiting process has continuous sample paths, a.s.

Theorem 4.

Suppose $f^{n} (0) \overset{w}{\to} f (0)$ , where ${f^{n} (0)}$ is the deterministic sequence of $f^{n} (0) \in M^{(n)}$ , and $f (0) \in M$ . Then, for any $h \in C_{b}$ , the sequence of processes ${f^{n} (\cdot) h}$ is C-relatively compact in the Skorohod space $D ([0, \infty), R)$ . (Note that we do not assume that f(0) has well-defined mean $\bar{f} (0)$ , or (3), or (2), or even $m^{(1)} = E Z < \infty$ . Jump-size distribution only needs to be proper.)

Proof.

This result is analogous to theorem 6.9 in Balazs et al. (2014). Note that, although the model in Balazs et al. (2014) is different from ours, the only property that is used in the proof of theorem 6.9 in Balazs et al. (2014) is that the rate of jumps of each particle is upper bounded by a finite constant a at all times. The latter property, obviously, holds for our model as well, with $a = μ = 1$ . Therefore, the proof of theorem 6.9 in Balazs et al. (2014) applies essentially verbatim, with the following adjustments.

What in the proof of theorem 6.9 in Balazs et al. (2014) are $f, μ_{n}, a$ , in our notation, are $h, f^{n}, μ = 1$ , respectively. In the proof, $x_{i} (t)$ denote the locations of the particles, uniquely determined by the process state at time t, and vice versa. The notation ${\tilde{x}}_{i} (t)$ is used for the locations of particles in an artificial system, with the same initial particle locations ${\tilde{x}}_{i} (0) = x_{i} (0)$ , but such that each particle jumps every time it gets a jump urge; the artificial system is coupled to the original one in the natural way, so that the corresponding particles have common processes of jump urges and common jump sizes (if the particles in the original system happen to jump at a jump urge). Clearly, with this coupling, $x_{i} (t) \leq {\tilde{x}}_{i} (t)$ and $x_{i} (t) - x_{i} (s) \leq {\tilde{x}}_{i} (t) - {\tilde{x}}_{i} (s)$ at all times $s \leq t$ . Finally, in our case, $I_{n} (t) = f^{n} (t) h = \int h (x) d_{x} f_{x}^{n} (t) = (1 / n) \sum_{i} h (x_{i} (t))$ . □

Theorem 5.

Suppose $f^{n} (0) \overset{w}{\to} f (0)$ , where ${f^{n} (0)}$ is the deterministic sequence of $f^{n} (0) \in M^{(n)}$ , and $f (0) \in M$ . Then, the sequence of processes ${f^{n} (\cdot)}$ is C-relatively compact in the Skorohod space $D ([0, \infty), M)$ . (Note that we do not assume that f(0) has well-defined mean $\bar{f} (0)$ , or (3), or (2), or even $m^{(1)} = E Z < \infty$ . Jump-size distribution only needs to be proper.)

Proof.

This result is analogous to corollary 6.10 in Balazs et al. (2014), with essentially the same proof. In fact, in our case, the proof is simpler. We need to verify conditions (i) and (ii) of theorem II.4.1 in Perkins (2002), which in our case take the following form.

(i) For any T > 0 and $ϵ > 0$ , there exists K > 0 such that
$\sup_{n} P (\sup_{t \leq T} [f_{- K}^{n} (t) + 1 - f_{K}^{n} (t)] > ϵ) < ϵ .$
(ii) For any $h \in C_{b}$ , the sequence of processes ${f^{n} (\cdot) h}$ is C-relatively compact in the Skorohod space $D ([0, \infty), R)$ . (Note that the class of functions $h \in C_{b}$ is separating, which means that a probability measure g is uniquely determined by the values of $g h$ for $h \in C_{b}$ .)

Condition (ii) is verified by Theorem 4. The verification of condition (i) repeats the proof of corollary 6.10 in Balazs et al. (2014), essentially verbatim, up to and including the display where Markov inequality is used for the first time. At that point, it remains to observe that the probability in the RHS of the display can be made arbitrarily small by making K sufficiently large. The measure $μ_{n} (t, \cdot)$ in the proof of corollary 6.10 in Balazs et al. (2014) is in our notation the measure (distribution) $f^{n} (t)$ on $R$ ; the particle locations $x_{i} (t)$ and ${\tilde{x}}_{i} (t)$ in the coupled original and artificial systems, are as described above in the proof of our Theorem 5. □

6.2. Trajectories of a Limit Satisfy (10)

For trajectories $f^{n} (\cdot) \in D ([0, \infty), M)$ with $f^{n} (t) \in M^{(n)}$ for all $t \geq 0$ , let us define the following functional for each $h \in C_{b}$ and $t \geq 0$ :

A_{t, h}^{n} (f^{n} (\cdot)) ≐ f^{n} (t) h - f^{n} (0) h - \int_{0}^{t} L^{(n)} f^{n} (s) hds .

We will also formally define a “limit version” of $A_{t, h}^{n}$ for trajectories $f (\cdot) \in D ([0, \infty), M)$ , for each $h \in C_{b}$ and $t \geq 0$ :

A_{t, h} (f (\cdot)) ≐ f (t) h - f (0) h - \int_{0}^{t} L f (s) hds .

Theorem 6.

Suppose $f^{n} (0) \overset{w}{\to} f (0)$ , where ${f^{n} (0)}$ is deterministic sequence of $f^{n} (0) \in M^{(n)}$ , and $f (0) \in M$ . Then, the sequence of processes ${f^{n} (\cdot)}$ is such that, for every $t \geq 0$ and any $h \in C_{b}$ ,

\sup_{0 \leq s \leq t} | A_{s, h}^{n} (f^{n} (\cdot)) | \Rightarrow 0, as n \to \infty .

(Note that we do not assume that f(0) has well-defined mean

\bar{f} (0)

, or (3), or (2), or even

m^{(1)} = E Z < \infty

. Jump-size distribution only needs to be proper.)

Proof.

The proof repeats the proof of theorem 6.11 in Balazs et al. (2014), in which we replace: L by $L^{(n)}$ ; f by h; μ_n by f ⁿ; a by μ = 1. In particular, in our case, $I_{n} (t) = f^{n} (t) h = \int h (x) d_{x} f_{x}^{n} (t) = (1 / n) \sum_{i} h (x_{i} (t))$ , so that $L I_{n} (t)$ is replaced by

L^{(n)} I_{n} (t) = L^{(n)} f^{n} (t) h = \int_{0}^{1} E [h (f_{ν}^{- 1} (t) + Z) - h (f_{ν}^{- 1} (t))] η_{n} (ν) d ν,

where the expectation in the integrand is over a random jump size Z. The martingale

M_{n} (t), t \geq 0,

in our case is

M_{n} (t) = A_{t, h}^{n} (f^{n} (\cdot)),

so that

L M_{n}^{2} (t)

in our case is

L^{(n)} M_{n}^{2} (t)

. The last line of the last display of the proof in Balazs et al. (2014) can be removed, and the final estimate

L^{(n)} M_{n}^{2} (t) \leq 4 {‖ h ‖}^{2} / n

for

h \in C_{b}

can be observed without that line (because the jump urge rate of each particle is μ = 1).

Finally, note that in our theorem, we only need to consider $h \in C_{b}$ . We do not need to consider the identity test function h(x) = x, and that is why in Theorem 6, we do not need condition $E Z^{2} < \infty$ —it suffices that Z has a proper distribution. □

Theorem 7.

Suppose $f^{n} (0) \overset{w}{\to} f (0)$ , where ${f^{n} (0)}$ is deterministic sequence of $f^{n} (0) \in M^{(n)}$ , and $f (0) \in M$ . Suppose the sequence of processes ${f^{n} (\cdot)}$ is such that, $f^{n} (\cdot) \Rightarrow f (\cdot)$ as $n \to \infty$ , in $D ([0, \infty), M)$ . Then, for every $t \geq 0$ and any $h \in C_{b}$ ,

A_{t, h}^{n} (f^{n} (\cdot)) \Rightarrow A_{t, h} (f (\cdot)), a s n \to \infty .

(Note that we do not assume that f(0) has well-defined mean

\bar{f} (0)

, or (3), or (2), or even

m^{(1)} = E Z < \infty

. Jump-size distribution only needs to be proper.)

Proof.

The statement of this theorem is analogous to that of theorem 6.12 in Balazs et al. (2014). However, the proof in our case is simpler and is as follows. We know that, by Theorem 5, a.s. the limiting process $f (\cdot)$ has continuous trajectories in $D ([0, \infty), M)$ . We can use Skorohod representation to construct all processes on a common probability space so that, w.p.1, $f^{n} (\cdot) \overset{J_{1}}{\to} f (\cdot)$ as $n \to \infty$ ; moreover, by the continuity of $f (\cdot)$ , we see that $f^{n} (t) \to f (t)$ uniformly on compact sets of t; we also see that, w.p.1, $f_{x} (t)$ is nonincreasing in t (because so are $f_{x}^{n} (t)$ ). Then, it is easy to see (using, in particular, the facts that convergence $η_{n} (\cdot) \to η (\cdot)$ is uniform, and $η (\cdot)$ is strictly decreasing continuous) that $A_{t, h}^{n} (f^{n} (\cdot)) \to A_{t, h} (f (\cdot))$ w.p.1. □

Corollary 1.

Suppose $f^{n} (0) \overset{w}{\to} f (0)$ , where ${f^{n} (0)}$ is the deterministic sequence of $f^{n} (0) \in M^{(n)}$ , and $f (0) \in M$ . Then, the sequence of processes ${f^{n} (\cdot)}$ is such that its any distributional limit $f (\cdot)$ in $D ([0, \infty), M)$ is such that, a.s., $f (\cdot)$ is continuous, and it satisfies $A_{t, h} (f (\cdot)) = 0$ (i.e., (10)) for every $t \geq 0$ and any $h \in C_{b}$ . (Note that we do not assume that f(0) has well-defined mean $\bar{f} (0)$ , or (3), or (2), or even $m^{(1)} = E Z < \infty$ . Jump-size distribution only needs to be proper.)

Proof.

By Theorem 5, there exists a subsequence of ${f^{n} (\cdot)}$ , which converges in distribution to a process $f (\cdot)$ with a.s. continuous trajectories. By Theorems 6 and 7, this process must satisfy $A_{t, h} (f (\cdot)) = 0$ for every $t \geq 0$ and any $h \in C_{b}$ . □

6.3. Equivalent Characterization of Solutions to (10) as Mean-Field Models

Definition 1.

A function $f_{x} (t), x \in R, t \in R_{+},$ will be called a mean-field model if it satisfies the following conditions.

(a) For any t, $f (t) = (f_{x} (t), x \in R) \in M$ .
(b) For any x, $f_{x} (t)$ is nonincreasing c-Lipschitz in t, with constant c independent of x.
(c) For any x, for any t where the partial derivative $(\partial / \partial t) f_{x} (t)$ exists (which is almost all t w.r.t. Lebesgue measure, by the Lipschitz property), equation
$\frac{\partial}{\partial t} f_{x} (t) = - \int_{0}^{1} d ν f^{- 1} (ν) η (f^{- 1} (ν)) I {f^{- 1} (ν) \leq x} \bar{J} (x - f^{- 1} (ν)),$ (29)
holds.

Note that, by the change of variable $y = f^{- 1} (ν)$ in the integral in the RHS of (29), Equation (29) can be equivalently written as

\frac{\partial}{\partial t} f_{x} (t) = - \int_{- \infty}^{x} d_{y} f_{y} (t) \bar{η} (y, f (t)) \bar{J} (x - y),

(30)

where we use notations

\bar{η} (y, f (t)) ≐ {\begin{array}{l} η (f_{y} (t)), & when f_{u} (t) is continuous at point u = y \\ {(ν_{2} - ν_{1})}^{- 1} \int_{ν_{1}}^{ν_{2}} η (ν) d ν, & otherwise, \end{array}

where

ν_{2} = f_{y} (t), ν_{1} = f_{y -} (t)

. Equation (30) (or (29)) is a more general form of (4), allowing

f_{x} (t)

to be RCLL in x, rather than continuous. If

f_{u} (t)

is continuous at u = y, then

\bar{η} (y, f (t)) = η (f_{y} (t))

; if

f_{u} (t)

has a jump at u = y, then

\bar{η} (y, f (t))

η (ν)

averaged over

ν \in [f_{y -} (t), f_{y} (t)]

. In paper Stolyar (2023), the equation in Form (30) is used to define a mean-field model.

Theorem 8.

For any initial condition $f (0) \in M, f (\cdot)$ satisfies (10) if and only if it is a mean-field model. (Note that we do not assume that f(0) has well-defined mean $\bar{f} (0)$ , or (3), or (2), or even $m^{(1)} = E Z < \infty$ . Jump-size distribution only needs to be proper.)

Proof.

“Only if.” Let $h = (h (u), u \in R)$ be the left-continuous step-function $h (u) = I (u \leq x)$ , jumping from one to zero at point x. Let $h_{ϵ}, ϵ > 0$ be a continuous approximation of h, which is linearly decreasing from one to zero in $[x, x + ϵ]$ . We see that

L [f (t) h_{ϵ}] \to L [f (t) h], \forall t .

Indeed, $| L [f (t) h_{ϵ}] - L [f (t) h] |$ is upper bounded by the probability that a random jump of size Z of a particle randomly located according to f(t) is such that the jump either originates or lands in $(x, x + ϵ)$ ; this probability vanishes as $ϵ ↓ 0$ . Also, because both $L [f (t) h_{ϵ}]$ and $L [f (t) h_{ϵ}]$ are within $[- 1, 0]$ , for all t and ϵ, we have a universal bound $| L [f (t) h_{ϵ}] - L [f (t) h] | \leq 1$ . Then, for any fixed t, by taking the $ϵ ↓ 0$ limit in $f (t) h_{ϵ} - f (0) h_{ϵ} - \int_{0}^{t} L [f (s) h_{ϵ}] d s = 0$ , we obtain

f (t) h - f (0) h - \int_{0}^{t} L [f (s) h] d s = 0 .

This means that $f (t) h = f_{x} (t)$ is absolutely continuous in t, with the derivative equal to $(\partial / \partial t) f_{x} (t) = L [f (t) h]$ a.e. in t. This, in particular, implies that, for any fixed y, the $f_{y} (t) - f_{y -} (t)$ is continuous in t (in fact, Lipschitz); this, in turn, means that, possible discontinuity points y of $f_{y} (t)$ “cannot move” in time t. We can then conclude that, for any x, the derivative $(\partial / \partial t) f_{x} (t) = L [f (t) h]$ is, in fact, continuous in t. Therefore, $(\partial / \partial t) f_{x} (t) = L [f (t) h]$ at every t. It remains to observe that $L [f (t) h]$ is exactly the RHS of (29).

“If.” The definition of a mean-field model $f (\cdot)$ implies that (10) holds for the defined above step-function h for any x. Then, we have (10) for any h, which is piece-wise constant with finite number of pieces; and the set of such functions h is tight within the space of test functions $h \in C_{b}$ , equipped with uniform metric. Then, (10) holds for any $h \in C_{b}$ . □

6.4. Uniqueness of Solution to (10) and Continuity in Initial State. Proof of Theorem 3(i)

Theorem 9.

Assume that $m^{(1)} = E Z < \infty$ (or, $m^{(1)} = E Z = 1$ , WLOG) and Condition (3) holds. Then, for any initial condition $f (0) \in M$ , a solution $f (\cdot)$ of (10) (i.e., a mean-field model) is unique. (Note that we do not assume that f(0) has well-defined mean $\bar{f} (0)$ , or (2).)

Proof.

We know that solutions to (10) are mean-field models. Papers Greenberg et al. (1996) and Stolyar (2023) study the properties of mean-field models. It is easy to check that the proofs of all results in sections 4.4 and 4.5 of Greenberg et al. (1996), including Theorem 2, stating that the Wasserstein W₁-distance (i.e., the L₁-norm of the difference) between any two mean-field models $f^{(1)} (\cdot)$ and $f^{(2)} (\cdot)$ is nonincreasing, never use the fact that the means ${\bar{f}}^{(1)} (0)$ and ${\bar{f}}^{(2)} (0)$ are well-defined and equal to zero. Those proofs only use the fact that $f^{(1)} (0), f^{(2)} (0) \in M$ and

\int_{- \infty}^{\infty} [f_{w}^{(1)} (0) - f_{w}^{(2)} (0)] d w = 0 .

(The conditions

E Z < \infty

and (3) are used there. Technically, those proofs in Greenberg et al. (1996) assume that the jump-size distribution

J (\cdot)

is exponential—but only the Property (3) is actually used.)

For any $f (0) \in M$ , a mean-field model $f (\cdot)$ is such that (see Stolyar 2023)

\int_{- \infty}^{\infty} [f_{w} (0) - f_{w} (t)] d w = v t, \forall t \geq 0 .

(31)

Now, if $f^{(1)} (\cdot)$ and $f^{(2)} (\cdot)$ are two different solutions with the same initial condition, $f^{(1)} (0) = f^{(2)} (0) = f (0)$ , then, from (31),

\int_{- \infty}^{\infty} [f_{w}^{(1)} (t) - f_{w}^{(2)} (t)] d w = 0, \forall t \geq 0 .

Therefore, the Wasserstein W₁-distance between $f^{(1)} (t)$ and $f^{(2)} (t)$ cannot increase, which implies the uniqueness. □

Proof of Theorem 3(i).

We have established that the family of distributions of the processes is C-tight, any subsequential distributional limit is concentrated on solutions $f (\cdot)$ to (10) (i.e., mean-field models), and the solution $f (\cdot)$ with initial condition f(0) is unique. This implies the convergence $f^{n} (\cdot) \Rightarrow f (\cdot)$ .

The continuity of $f (\cdot)$ in f(0) is then a consequence of convergence and uniqueness. Indeed, consider a sequence of initial states $f^{(k)} (0) \in M$ , converging to some $f (0) \in M$ ; namely, $f^{(k)} (0) \overset{w}{\to} f (0)$ as $k \to \infty$ . Fix $ϵ > 0$ and any $δ > 0$ . We can choose an increasing sequence $n = n (k) \to \infty$ and corresponding $f^{(k), n} (0) \in M^{(n)}$ , such that $f^{(k), n} (0) \overset{w}{\to} f (0)$ as $k \to \infty$ , and for each k, the process $f^{(k), n} (\cdot)$ is within distance ϵ from the deterministic trajectory $f^{(k)} (\cdot)$ with probability at least $1 - δ$ . Then, for all large k, $f^{(k), n} (\cdot)$ is within distance ϵ from both $f^{(k)} (\cdot)$ and $f (\cdot)$ with probability at least $1 - 2 δ$ . Because this is true for any ϵ and δ, the only option is that $f^{(k)} (\cdot) \overset{J_{1}}{\to} f (\cdot)$ . □

6.5. Corollaries for the Centered Processes: Proof of Theorem 3(ii)

Suppose $\bar{f} (0) = 0$ . Then, from (31), we have $\bar{f} (t) = v t$ . Then, the continuity and uniqueness of $\overset{°}{f} (\cdot)$ , as well as the continuity of its dependence on $\overset{°}{f} (0)$ , follow from the corresponding properties of noncentered $f (\cdot)$ in Theorem 3(i).

Suppose, in addition, that ${\bar{f}}^{n} (0) = 0$ for all n. Because η_n converges to η, we easily see directly that ${\bar{f}}^{n} (t) \overset{P}{\to} v t$ for any t. Combining these observations with Theorem 3(i), we obtain the convergence $\overset{°}{f}^{n} (\cdot) \Rightarrow \overset{°}{f} (\cdot)$ ; for example, we can use Skorohod representation to obtain a.s. convergence to the limit. □

7. Proof of Theorem 1

7.1. Basic Properties of a Limit of Stationary Distributions

Lemma 1.

The sequence of stationary distributions—that is, the distributions of $\overset{°}{f}^{n} (\infty)$ —is tight. Any subsequential distributional limit $\overset{°}{f} (\infty)$ is such that:

$\overset{°}{f} (\infty)$ is concentrated on $\overset{°}{M}$ ;

E Φ_{1 + χ} (\overset{°}{f} (\infty)) \leq \bar{C},

(32)

where

\bar{C}

is the constant in Theorem 2;

E Φ_{1} (\overset{°}{f} (\infty)) \leq 1 + \bar{C} .

(33)

Proof.

By Theorem 2 and Markov inequality, for any $δ > 0$ , there exists a sufficiently large number $C (δ) > 0$ , such that for all large n, with probability at least $1 - δ, Φ_{1 + χ} (\overset{°}{f}^{n} (\infty)) \leq C (δ)$ . The set

S (δ) = {f \in \overset{°}{M} | Φ_{1 + χ} (f) \leq C (δ)},

is compact in

M

(equipped with the weak convergence topology). And we know that, for all large n,

P {\overset{°}{f}^{n} (\infty) \in S (δ)} \geq 1 - δ

. Therefore, the sequence of distributions of

\overset{°}{f}^{n} (\infty)

is tight. Consider any subsequential distributional limit

\overset{°}{f} (\infty)

—that is,

\overset{°}{f}^{n} (\infty) \Rightarrow \overset{°}{f} (\infty)

for a subsequence of n. Then,

P {\overset{°}{f} (\infty) \in S (δ)} \geq 1 - δ

, and then the mean

\bar{\overset{°}{f}} (\infty) = 0

w.p.1. Moreover, the limit

\overset{°}{f} (\infty)

must be such that (32) holds. (This is by the Fatou lemma because, using Skorohod representation, we can construct the sequence of

\overset{°}{f}^{n} (\infty)

and

\overset{°}{f} (\infty)

on a common probability space such that the convergence

\overset{°}{f}^{n} (\infty) \overset{w}{\to} \overset{°}{f} (\infty)

is w.p.1.) Finally, (33) is from

E Φ_{1} (\overset{°}{f} (\infty)) \leq E [{[Φ_{1 + χ} (\overset{°}{f} (\infty))]}^{1 / (1 + χ)}] \leq 1 + E Φ_{1 + χ} (\overset{°}{f} (\infty)) \leq 1 + \bar{C} . □

7.2. Characterization of a Limit of Stationary Distributions

Suppose $f (0) \in \overset{°}{M}$ —that is, $f (0) \in M$ and $\bar{f} (0) = 0$ . If $f (\cdot)$ is a mean-field model with initial state f(0) (i.e., the solution to (10)), then the corresponding centered trajectory $\overset{°}{f} (\cdot)$ will be called the centered mean-field model.

Lemma 2.

The distribution of any subsequential limit $\overset{°}{f} (\infty)$ is a stationary distribution of the deterministic process evolving along centered mean-field limits.

Proof.

By Theorem 3(ii), the dependence of the deterministic trajectory $\overset{°}{f} (\cdot)$ on the initial state $\overset{°}{f} (0)$ is continuous. Then, we can apply theorem 8.5.1 in Liptser and Shiryaev (1989), adapted to our setting. Or, the proof is also easy to obtain directly as follows. We need to show that for any test function $h \in C_{b}$ and any $t \geq 0$ , we have

E \overset{°}{f} (0) h = E \overset{°}{f} (t) h,

(34)

where f(0) is equal in distribution to

f (\infty)

. We obtain (34) by taking the limit of the equality

E \overset{°}{f}^{n} (0) h = E \overset{°}{f}^{n} (t) h,

(35)

where

\overset{°}{f}^{n} (0)

is equal in distribution to

\overset{°}{f}^{n} (\infty)

; (35) clearly holds for all n. Clearly,

E \overset{°}{f}^{n} (0) h \to E \overset{°}{f} (0) h

. To demonstrate

E \overset{°}{f}^{n} (t) h \to E \overset{°}{f} (t) h,

(36)

we can use Skorohod representation, so that the convergence

\overset{°}{f}^{n} (0) \overset{w}{\to} \overset{°}{f} (0)

is a.s. For any deterministic converging sequence

\overset{°}{f}^{n} (0) \overset{w}{\to} \overset{°}{f} (0)

, we have, by Theorem 3(ii),

\overset{°}{f}^{n} (t) \Rightarrow \overset{°}{f} (t)

(which is equivalent to

\overset{°}{f}^{n} (t) \overset{P}{\to} \overset{°}{f} (t)

), and then

E \overset{°}{f}^{n} (t) h \to E \overset{°}{f} (t) h

. Thus, we obtain (36), and then (34). □

7.3. Completion of the Proof of Theorem 1

Consider any subsequential distributional limit $\overset{°}{f} (\infty)$ . Its distribution is a stationary distribution of the deterministic process evolving along centered mean-field limits $\overset{°}{f} (\cdot)$ .

First Proof.

Consider any two different initial conditions $\overset{°}{f}^{(1)} (0) \in \overset{°}{M}$ and $\overset{°}{f}^{(2)} (0) \in \overset{°}{M}$ of the deterministic process $\overset{°}{f} (\cdot)$ . The Wasserstein W₁-distance ${‖ \overset{°}{f}^{(1)} (t) - \overset{°}{f}^{(2)} (t) ‖}_{1}$ must strictly decrease with t, by theorem 2 in Greenberg et al. (1996). Then, the only option is that $\overset{°}{f} (\infty)$ is concentrated on a single element $ϕ$ , which then must be a traveling wave shape, and then a unique traveling wave shape. Otherwise, if we consider two independent stationary versions of the process $\overset{°}{f} (\cdot)$ , say, $\overset{°}{f}^{(1)} (0)$ and $\overset{°}{f}^{(2)} (0)$ , then $E {‖ \overset{°}{f}^{(1)} (t) - \overset{°}{f}^{(2)} (t) ‖}_{1}$ is finite for t = 0 (which follows from $E Φ_{1 + χ} (\overset{°}{f} (\infty)) \leq \bar{C}$ ), and it is strictly decreasing in t; this contradicts the stationarity. Finally, we obtain (7)—that is, $Φ_{1 + χ} (ϕ) < \infty$ —because $E Φ_{1 + χ} (\overset{°}{f} (\infty)) \leq \bar{C}$ . □

Second Proof.

By theorem 3.2 in Stolyar (2023), we have the existence of the unique traveling wave shape $ϕ \in \overset{°}{M}$ . Then, by theorem 2 in Greenberg et al. (1996), for any $\overset{°}{f} (0) \in \overset{°}{M}$ , the Wasserstein W₁-distance ${‖ \overset{°}{f} (t) - ϕ ‖}_{1}$ is strictly decreasing when it is nonzero. Then, $\overset{°}{f} (\infty)$ must be concentrated at $ϕ$ . Property (7) follows from $E Φ_{1 + χ} (\overset{°}{f} (\infty)) \leq \bar{C}$ . □

Remark 1.

We have given two proofs that complete the proof of Theorem 1. They both require additional Assumptions (2) and (3). Note that, as a byproduct of the first proof, we also obtain the existence of the unique traveling wave shape, but only under these additional conditions. As far as the existence of the traveling wave shape is concerned, it is established in theorem 3.1 in Stolyar (2023) under weaker conditions, only requiring the finite second moment of jump size, $m^{(2)} = E Z^{2} < \infty$ .

8. Discussion

The main results of this paper, in a sense, complete the “program” represented by previous work by Greenberg et al. (1995, 1996) and Stolyar (2023) on the specific model in this paper. Greenberg et al. (1995), informally speaking, prove the convergence to a deterministic mean-field model as $n \to \infty .$ (In our paper, we generalize that result to a more general model, without the finite-dependence assumption.) Greenberg et al. (1996), informally speaking, prove that if a traveling wave exists, then each mean-field model trajectory is attracted to that traveling wave, as $t \to \infty$ ; Stolyar (2023) shows that a traveling wave does exist under very general assumptions. This paper proves that the convergence to the traveling wave also holds if we “interchange the limits”: first consider the stationary distribution (take limit in $t \to \infty$ ) and then consider the $n \to \infty$ limit of stationary distribution; if we take limits in this order, the limit is the same—a traveling wave. Thus, the results of this program answer essentially “all” questions about the behavior of the system when n is large—both about its transient behavior and about its stationary distribution.

There are many other well-motivated large-scale particle systems (cf. Manita and Shcherbakov 2005; Malyshev and Manita 2006; Manita 2006, 2009, 2014; Malyshkin 2006, and Balazs et al. 2014), for which implementing a similar program would be of interest.

References

Balazs M, Racz MZ, Toth B (2014) Modeling flocks and prices: Jumping particles with an attractive interaction. Ann. Institut Henri Poincare Probab. Statist. 50(2):425–454.Google Scholar
Bramson M (2008) Stability of Queueing Networks (Springer Berlin Heidelberg, Berlin).Google Scholar
Dai JG (1995) On the positive Harris recurrence for multiclass queueing networks: A unified approach via fluid models. Ann. Appl. Probab. 5:49–77.Google Scholar
Ethier S, Kurtz T (1986) Markov Processes: Characterization and Convergence (Wiley, Hoboken, NJ).Google Scholar
Greenberg A, Malyshev V, Popov S (1995) Stochastic model of massively parallel computation. Markov Processes Related Fields 1(4):473–490.Google Scholar
Greenberg A, Shenker S, Stolyar A (1996) Asynchronous updates in large parallel systems. Proc. ACM SIGMETRICS Internat. Conf. Measurement Modeling Comput. Systems (ACM, New York), 91–103.Google Scholar
Hongler MO (2015) Exact soliton-like probability measures for interacting jump processes. Math. Sci. 40(1):62–66.Google Scholar
Hongler MO, Filliger R (2019) When do redundant requests reduce latency? Methodology Comput. Appl. Probab. 21:753–764.Google Scholar
Liptser RS, Shiryaev AN (1989) Theory of Martingales (Kluwer Academic Publishers Dordrecht, Dordrecht, Netherlands).Google Scholar
Malyshev V, Manita A (2006) Phase transitions in the time synchronization model. Theory Probab. Appl. 50:134–141.Google Scholar
Malyshkin A (2006) Limit dynamics for stochastic models of data exchange in parallel computation networks. Problems Inform. Transmission 42:234–250.Google Scholar
Manita A (2006) Markov processes in the continuous model of stochastic synchronization. Russian Math. Surveys 61:993–995.Google Scholar
Manita A (2009) Stochastic synchronization in a large system of identical particles. Theory Probab. Appl. 53:155–161.Google Scholar
Manita A (2014) Clock synchronization in symmetric stochastic networks. Queueing Systems 76:149–180.Google Scholar
Manita A, Shcherbakov V (2005) Asymptotic analysis of a particle system with mean-field interaction. Markov Processes Related Fields 11:489–518.Google Scholar
Perkins EE (2002) Dawson-Watanabe superprocesses and measure-valued diffusions. Bernard P, ed. Lectures on Probability Theory and Statistics: Ecole d’Eté de Probabilités de Saint-Flour XXIX, 1999, Lecture Notes in Mathematics, vol. 1781 (Springer, Berlin), 125–324.Google Scholar
Rybko A, Stolyar A (1992) Ergodicity of stochastic processes describing the operation of open queueing networks. Problems Inform. Transmission 28:199–220.Google Scholar
Stolyar AL (1995) On the stability of multiclass queueing networks: A relaxed sufficient condition via limiting fluid processes. Markov Processes Related Fields 1(4):491–512.Google Scholar
Stolyar AL (2022) Parallel server systems with cancel-on-completion redundancy. Stochastic Systems 12(4):340–372.Link, Google Scholar
Stolyar AL (2023) Large-scale behavior of a particle system with mean-field interaction: Traveling wave solutions. Adv. Appl. Probab. 55(1):245–274.Google Scholar

Volume 13, Issue 3

September 2023

Pages 321-397

Article Information

Metrics

Information

Received:June 03, 2022
Accepted:February 27, 2023
Published Online:April 20, 2023

Cite as

Alexander L. Stolyar (2023) A Particle System with Mean-Field Interaction: Large-Scale Limit of Stationary Distributions. Stochastic Systems 13(3):343-359.

https://doi.org/10.1287/stsy.2023.0108

Keywords

PDF download

Available Issues

Available Issues

Available Issues

A Particle System with Mean-Field Interaction: Large-Scale Limit of Stationary Distributions

Abstract

1. Introduction

1.1. Outline of the Rest of the Paper

2. The Model and Main Results

3. Basic Notation

4. Formal Statements of Main Results

4.1. Uniform Moment Bound for Stationary Distributions. Limit of Stationary Distributions

4.2. Transient Behavior: Convergence to a Mean-Field Model

5. Proof of Theorem 2

5.1. Equivalent View of Process $\overset{°}{f}^{n} (\cdot)$

5.2. Informal Intuition for the Proof

5.3. Stability

5.4. Proof of (9)

5.5. Proof of (16)

5.6. Proof of (18)

6. Proof of Theorem 3

6.1. C-relative Compactness of the Processes

6.2. Trajectories of a Limit Satisfy (10)

6.3. Equivalent Characterization of Solutions to (10) as Mean-Field Models

6.4. Uniqueness of Solution to (10) and Continuity in Initial State. Proof of Theorem 3(i)

6.5. Corollaries for the Centered Processes: Proof of Theorem 3(ii)

7. Proof of Theorem 1

7.1. Basic Properties of a Limit of Stationary Distributions

7.2. Characterization of a Limit of Stationary Distributions

7.3. Completion of the Proof of Theorem 1

8. Discussion

References

Volume 13, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News

Available Issues

Available Issues

A Particle System with Mean-Field Interaction: Large-Scale Limit of Stationary Distributions

Abstract

1. Introduction

1.1. Outline of the Rest of the Paper

2. The Model and Main Results

3. Basic Notation

4. Formal Statements of Main Results

4.1. Uniform Moment Bound for Stationary Distributions. Limit of Stationary Distributions

4.2. Transient Behavior: Convergence to a Mean-Field Model

5. Proof of Theorem 2

5.1. Equivalent View of Process f°n(·)

5.2. Informal Intuition for the Proof

5.3. Stability

5.4. Proof of (9)

5.5. Proof of (16)

5.6. Proof of (18)

6. Proof of Theorem 3

6.1. C-relative Compactness of the Processes

6.2. Trajectories of a Limit Satisfy (10)

6.3. Equivalent Characterization of Solutions to (10) as Mean-Field Models

6.4. Uniqueness of Solution to (10) and Continuity in Initial State. Proof of Theorem 3(i)

6.5. Corollaries for the Centered Processes: Proof of Theorem 3(ii)

7. Proof of Theorem 1

7.1. Basic Properties of a Limit of Stationary Distributions

7.2. Characterization of a Limit of Stationary Distributions

7.3. Completion of the Proof of Theorem 1

8. Discussion

References

Volume 13, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

5.1. Equivalent View of Process $\overset{°}{f}^{n} (\cdot)$