Open Access

The Join-the-Shortest-Queue System in the Halfin-Whitt Regime: Rates of Convergence to the Diffusion Limit

Anton Braverman
Anton Braverman
[email protected]
https://orcid.org/0000-0003-4030-3172
Operations Department, Kellogg School of Management, Northwestern University, Evanston, Illinois 60208
Search for more papers by this author

Anton Braverman

[email protected]

https://orcid.org/0000-0003-4030-3172

Operations Department, Kellogg School of Management, Northwestern University, Evanston, Illinois 60208

Search for more papers by this author

Published Online:17 Nov 2022https://doi.org/10.1287/stsy.2022.0102

Abstract

We bound the rate at which the steady-state distribution of the join-the-shortest-queue (JSQ) system converges, in the Halfin-Whitt regime, to its diffusion limit. Our proof uses Stein’s method and, specifically, the recently proposed prelimit generator comparison approach. The JSQ system is nontrivial and high-dimensional and has a state-space collapse component; our analysis may serve as a helpful example to readers wishing to apply the approach to their own setting.

1. Introduction

Consider a queueing system with n identical servers, each with a finite buffer of length b. Customers arrive according to a Poisson process with rate $n λ$ , and service times are independent and identically distributed (i.i.d.), exponentially distributed with rate one. Customers cannot change servers after the initial routing decision, and a customer arriving to a system where all servers are busy and all buffers are full is blocked. This is known as a parallel-server system. A load-balancing policy specifies the manner in which arriving customers are assigned to the servers. In this paper, we consider the classical join-the-shortest-queue (JSQ) policy. Under JSQ, an arriving customer enters service immediately if at least one server is idle; if not, they get routed to the server with the smallest number of customers in its buffer. Ties are broken arbitrarily. We refer to this as the JSQ system.

Parallel-server systems have generated immense interest in recent years, and the JSQ policy is fundamental because it minimizes the expected customer delay and maximizes, with respect to stochastic order, the number of customers served in a given time interval; see, for instance, Winston (1977) and Weber (1978). For a sample of recent work on the JSQ policy, we refer readers to Eryilmaz and Srikant (2012), Mukherjee et al. (2016), Eschenfeldt and Gamarnik (2018), Banerjee and Mukherjee (2019), Gupta and Walton (2019), Liu and Ying (2019), Banerjee and Mukherjee (2020), Braverman (2020), Zhou and Shroff (2020a, b), Cao et al. (2021), Hurtado-Lange and Maguluri (2022), and Zhao et al. (2021). Other popular load-balancing policies include the join-the-idle-queue policy (Stolyar 2015, Mukherjee et al. 2016), the idle-one-first policy (Gupta and Walton 2019), and of course the power-of-d policy (Vvedenskaya et al. 1996, Mitzenmacher 2001); but in this paper, we focus on the JSQ policy. We make no attempt to give a comprehensive review of the literature on parallel-server systems, instead referring the reader to van der Boor et al. (2021) for a recent survey.

Understanding the exact performance of the system is known to be difficult, and much attention has been devoted over the past decade to “heavy-traffic” asymptotics. The term heavy-traffic refers to parameter regimes where the system utilization tends to one. “Conventional heavy traffic” assumes that the number of servers n is fixed and $λ ↑ 1$ , whereas “many-server heavy traffic” assumes that $n \to \infty$ and $λ ↑ 1$ jointly. For two examples of work in the conventional heavy-traffic setting, see Eryilmaz and Srikant (2012) and Zhou and Shroff (2020b). In this paper, we use the term heavy-traffic to refer to the many-server setting—the setting considered in most of the papers mentioned in the previous paragraph.

There are multiple many-server heavy-traffic regimes, depending on how n and λ jointly converge to their limit. For example, assuming that $λ = 1 - 1 / n$ yields entirely different asymptotic behavior compared to when $λ = 1 - 1 / \sqrt{n}$ . To capture all the possible heavy-traffic regimes, it is common practice to assume that the per-server load λ is related to the number of servers n through $λ = 1 - β / n^{α} \in (0, 1)$ for some $α \geq 0$ and $β > 0$ . In this paper, we focus on the case when $α = 1 / 2$ ; that is, $λ = 1 - β / \sqrt{n}$ . This regime is known as the Halfin-Whitt regime and is ubiquitous across the queueing theory literature. It derives from the work of Halfin and Whitt (1981) and is also known as the quality-and-efficiency-driven regime because it achieves reasonable customer wait times while maintaining high utilization of servers. The full list of parameter regimes is found in Figure 1.

Figure 1. The Various Many-Server Heavy-Traffic Regimes
*Notes*. Higher values of α represent heavier loads. Existing work across the different parameter regimes is reviewed in Section 1.1.

We now state and discuss our main results. Let $Q_{i} (t)$ be the number of servers with i or more customers at time $t \geq 0$ , noting that $Q_{i} (t) = 0$ for $i > b + 1$ . The process ${Q (t) = (Q_{1} (t), \dots, Q_{b + 1} (t))}$ is an irreducible continuous-time Markov chain (CTMC) on a finite state space and therefore possesses a unique stationary distribution. We let $Q = (Q_{1}, \dots, Q_{b + 1})$ be the random vector having the stationary distribution of the CTMC. To describe the asymptotic behavior of Q, we let $δ = 1 / \sqrt{n}$ and define the diffusion-scaled random vector $X = (X_{1}, \dots, X_{b + 1})$ by $X_{1} = δ (n - Q_{1})$ , and $X_{i} = δ Q_{i}$ for $2 \leq i \leq b + 1$ . The results of Eschenfeldt and Gamarnik (2018) and Braverman (2020) imply that X converges in distribution to some limiting $R_{+}^{b + 1}$ -valued random vector Y as $n \to \infty$ . In this paper, we establish an upper bound of order $1 / \sqrt{n}$ on the rate of convergence to Y.

The random variable Y is distributed according to the stationary distribution of the diffusion process ${Y (t) \in R_{+}^{b + 1}}$ , which satisfies

Y_{1} (t) = Y_{1} (0) + \sqrt{2} W (t) + β t - \int_{0}^{t} (Y_{1} (s) + Y_{2} (s)) d s + U (t),

Y_{2} (t) = Y_{2} (0) + U (t) - \int_{0}^{t} Y_{2} (s) d s, Y_{3} (t) = \dots = Y_{b + 1} (t) = 0,

(1)

where

{W (t)}

is standard Brownian motion and

{U (t)}

is the unique nondecreasing, nonnegative process in the space of càdlàg functions

D [0, \infty)

satisfying

\int_{0}^{\infty} 1 (Y_{1} (t) > 0) d U (t) = 0

. The diffusion

{Y (t)}

was shown to be positive recurrent; see Banerjee and Mukherjee (2019) or Braverman (2020). Furthermore, (1) implies that

Y_{3} = \dots = Y_{b + 1} = 0

Our main result is that there exists a constant $C (b, β)$ such that for all $n \geq 1$ , and any function $h : R_{+}^{b + 1} \to R$ whose first-order and second-order partial derivatives are bounded in magnitude by one,

| E h (X) - E h (Y) | \leq C (b, β) / \sqrt{n} .

(2)

The assumption that b is finite is used frequently in the proof of (2) and, specifically, in the proof of Proposition 1. We deem the finite buffer assumption to be acceptable because it was shown by Braverman (2020) that even with infinite-sized buffers, $E Q_{3} \leq C (β)$ for all $n \geq 1$ in the Halfin-Whitt regime, implying that $X_{3} \Rightarrow 0$ or that the mass concentrates on those states with at most one customer waiting. Moreover, Liu and Ying (2020) showed that assuming finite buffers, $E Q_{3} \to 0$ as $n \to \infty$ in the even busier super-Halfin-Whitt regime ( $1 / 2 < α < 1$ ).

In addition to the novelty of our result, this paper makes a methodological contribution. We prove (2) using Stein’s method, a framework introduced by Stein (1972) that allows one to study the rate of convergence of a sequence of random variables to its limit. Popularized in the area of queueing systems by Gurvich (2014), Ying (2017), Braverman and Dai (2017), and Gast (2017), the generator comparison approach of Stein’s method, attributed to Barbour (1988, 1990) and Götze (1991), is used to study convergence rates of steady-state Markov chain distributions to their diffusion, fluid, or mean-field limits. For a few recent applications of the generator comparison approach in queueing, we refer the reader to Gaunt and Walton (2020), Hurtado-Lange and Maguluri (2022), Lu (2021), and Liu et al. (2022); this list is by no means comprehensive. In this paper, we restrict our attention to the case when the limit is the stationary distribution of a diffusion process, referring the reader to Ying (2017) for a treatment of fluid and mean-field limits.

The generator approach requires bounds on various moments of the prelimit, known as moment bounds, and bounds on the derivatives of the solution to the Poisson equation for the limiting distribution. The latter are called gradient bounds in Braverman and Dai (2017). But in this paper, we stick with the original term “Stein factors” or “Stein factor bounds”; see, for example, Ross (2011). Although moment bounds can be difficult to obtain in some applications, Stein factor bounds are typically the bigger problem. When the limit is one dimensional, Stein factors are bounded using the explicit form of the solution to the Poisson equation—an ordinary differential equation. When the limit is multidimensional, the Poisson equation is a partial differential equation (PDE) that generally does not have an explicit solution, making Stein factor bounds harder to establish. Techniques proposed to obtain multidimensional Stein factor bounds include using a priori Schauder estimates from elliptic PDE theory as in Gurvich (2014), using couplings to analyze and bound the sensitivity of the diffusion to its initial condition as in Barbour (1988) and Mackey and Gorham (2016), and bounding the Stein factors using Malliavin calculus as in Fang et al. (2018) and Jin et al. (2022). A detailed description of these techniques can be found in section 1.1 of Braverman (2022). However, despite progress on multidimensional Stein factor bounds, the JSQ system is not covered by existing results because our limiting diffusion in (1) is constrained to the nonnegative orthant via reflecting boundary conditions.

To deal with the Stein factor bound problem, this paper promotes the use of the prelimit generator comparison approach, which was recently proposed by Braverman (2022) as an alternative to the generator comparison approach. The prelimit approach is the mirror image of the classical generator approach. Whereas the latter requires moment bounds on the prelimit X and Stein factor bounds for limit Y, the former needs moment bounds on Y and Stein factor bounds for the prelimit X. For the moment bounds used in this paper, the result that all moments of Y are finite, proved by Banerjee and Mukherjee (2019), is sufficient because our limit Y does not depend on n. The Stein factor bounds pose a bigger challenge, and we deal with them in Section 3. It was noted in Braverman (2022) that the prelimit and classical generator comparison approaches should be equivalent, in theory, in the sense that any bound on $| E h (X) - E h (Y) |$ obtained using one of them should be attainable using the other. However, in practice, one approach could be more tractable, or convenient, to work with; see, for instance, the example in section 4 of Braverman (2022). In the case of the JSQ system, we discuss in Remark 1 of Section 3.2.3 how the discrete state space simplifies the analysis of the couplings we use to establish Stein factor bounds because the initial spacing of the coupled systems is preserved until coupling.

The introduction of the prelimit approach in Braverman (2022) was intended to be gentle, with the only example used there being the $M / M / 1$ system. Our application of the approach to the JSQ system exposes all of its moving pieces and can be useful to those who want to apply the prelimit approach to their own setting. For example, some of the technical components of this paper that could be useful in other settings include the regenerative argument used to establish first-order Stein factor bounds in Section 3.1, the approach we use to bound $E | X |$ in Section 3.2.1, and our treatment of reflecting boundary conditions in Section A.2.2 in Appendix A.

It should be noted that Hurtado-Lange and Maguluri (2022) and Zhou and Shroff (2020a) used the classical generator comparison approach to obtain rates of convergence of the steady-state total customer count to an exponential random variable for $α > 2$ . The former paper was in the continuous-time setting, whereas the latter considered the discrete-time system; the results in both papers also hold for routing policies other than JSQ, such as the power-of-d policy. Because the limiting random variable in both papers is one dimensional, the Stein factors bounds do not pose a challenge there.

1.1. Literature Review

Let us first review the literature on the analysis of the JSQ system in the various many-server heavy-traffic regimes. Most of the work has been done in the setting with infinite buffer sizes, so, unless otherwise noted, we assume that $b = \infty$ . In Eschenfeldt and Gamarnik (2018), the authors established the process-level convergence of ${X (t)}_{n = 1}^{\infty}$ to its diffusion limit in the Halfin-Whitt regime ( $α = 1 / 2$ ). That paper triggered a wave of interest in the many-server heavy-traffic asymptotics of the JSQ system. Convergence of the stationary distributions was later established by Braverman (2020), and the behavior of the stationary distribution of the limiting diffusion was studied by Banerjee and Mukherjee (2019, 2020). Our work fits with this group of papers, elevating the steady-state convergence result to one with rates of convergence.

Outside the Halfin-Whitt regime, Mukherjee et al. (2016) studied the transient and steady-state behavior of the JSQ system’s fluid limit when $λ = 1 - β < 1$ is a fixed constant (α = 0); Gupta and Walton (2019) established process-level convergence to the diffusion limit when α = 1, known as the nondegenerate slowdown (NDS) regime and introduced by Atar (2012). In the sub-Halfin-Whitt regime when $α \in (0, 1 / 2)$ , Liu and Ying (2019) assumed finite buffers and obtained bounds on the steady-state total customer count in the system. A similar result was obtained for Coxian-2 service times by Liu et al. (2022) and by Liu and Ying (2020) for the super-Halfin-Whitt regime $α \in (1 / 2, 1)$ . Another recent work in the super-Halfin-Whitt regime was by Zhao et al. (2021), who worked with infinite buffers and established transient and steady-state diffusion limits for the normalized total queue length process. Their analysis exploited the regenerative structure of the JSQ system and contained several hitting-time estimates very close to our own estimates needed for the Stein factor bounds in Section 3. Lastly, both Hurtado-Lange and Maguluri (2022) and Zhou and Shroff (2020a) established rates of convergence to the exponential distribution for the steady-state normalized total customer count. Their results covered the case when $α > 2$ .

Other works have used Stein’s method in the setting of parallel-server systems beyond Hurtado-Lange and Maguluri (2022) and Zhou and Shroff (2020a). In Liu and Ying (2019, 2020) and Liu et al. (2022), the authors used Stein’s method for mean-field analysis to obtain bounds on steady-state performance metrics of interest, like $E Q_{2}$ for instance, for the power-of-d system. Another line of work on power-of-d systems was by Gast (2017), Gast and Van Houdt (2017), and Gast et al. (2019), where the authors showed how to derive refined mean-field models for improved steady-state approximations. More recently, Hairi and Ying (2021) provide calculable error bounds for the mean-field approximation of the power-of-two-choices model.

1.2. Notation

We use $ℤ$ to denote the set of integers and let $N = {0, 1, 2, \dots}$ . For any $k \in N$ and $B \subset R^{d}$ , we let $C^{k} (B)$ be the set of all k-times continuously differentiable functions $f : B \to R$ . We let $e \in R^{d}$ be the vector whose elements all equal one and let $e^{(i)}$ be the element with one in the ith entry and zeros otherwise. For any $δ > 0$ and integer d > 0, we let $δ ℤ^{d} = {δ k : k \in ℤ^{d}}$ and define $δ N^{d}$ similarly. For any function $f : δ ℤ^{d} \to R$ , we define the forward difference operator in the ith direction as

Δ_{i} f (δ k) = f (δ (k + e^{(i)})) - f (δ k), k \in ℤ^{d}, 1 \leq i \leq d;

for

j \geq 0

, we define

Δ_{i}^{j + 1} f (δ k) = Δ_{i}^{j} f (δ (k + e^{(i)})) - Δ_{i}^{j} f (δ k),

(3)

with the convention that

Δ_{i}^{0} f (δ k) = f (δ k)

. For a vector

a \in N^{d}

, we also let

Δ^{a} f (δ k) = Δ_{1}^{a_{1}} \dots Δ_{d}^{a_{d}} f (δ k);

f : R^{d} \to R

, then

\frac{\partial^{a}}{\partial x^{a}} f (x) = \frac{\partial^{a_{1}}}{\partial x_{1}^{a_{1}}} \dots \frac{\partial^{a_{d}}}{\partial x_{d}^{a_{d}}} f (x),

and we adopt the convention that

\partial^{0} f (x) / \partial x^{0} = f (x)

. For any

x \in R^{d}

, we define

{‖ x ‖}_{1} = \sum_{i = 1}^{d} | x_{i} |

and use

| x |

to denote the Euclidean norm. For any

f : R^{d} \to R

, we let

{‖ f ‖}_{\infty} = \sup_{x \in R^{d}} | f (x) |

. Throughout the paper, we will often use C to denote a generic positive constant that may change from line to line and that is independent of any parameters not explicitly specified.

2. Main Result

Recall that $Q_{i} (t)$ is the number of servers with i or more customers at time $t \geq 0$ and that ${Q (t) = {(Q_{i} (t))}_{i = 1}^{b + 1}}_{t \geq 0}$ is an irreducible CTMC with state space given by

S_{Q} = {q \in {0, \dots, n}^{b + 1} : q_{i} \geq q_{i + 1}} .

(4)

Figure 2 gives an example of a state $q \in S_{Q}$ .

Figure 2. An Example of a State Q(t) = q in a System Where the Number of Servers n = 5
*Notes*. Customers below the dashed horizontal line are in service, while those above are waiting in buffers. Each vertical column corresponds to a server and its buffer.

We assume that $λ = 1 - β / \sqrt{n}$ for some fixed $β > 0$ . Let $δ = 1 / \sqrt{n}$ and define the diffusion-scaled CTMC ${X (t)}$ by

X_{1} (t) = δ (n - Q_{1} (t)), X_{i} (t) = δ Q_{i} (t), 2 \leq i \leq b + 1,

which takes values on the state space

S = {(x_{1}^{q}, x_{2}^{q}, \dots, x_{b + 1}^{q}) = (δ (n - q_{1}), δ q_{2}, \dots, δ q_{b + 1}) : q \in S_{Q}} .

We will often use $x^{q} \in S$ and $q \in S_{Q}$ interchangeably. Recalling that $Δ_{i} f (x^{q}) = f (x^{q} + δ e^{(i)}) - f (x^{q})$ , for any $f : S \to R$ , the infinitesimal generator of ${X (t)}$ satisfies

\begin{array}{l} G_{X} f (x^{q}) = - 1 (q_{1} < n) n λ Δ_{1} f (x^{q} - δ e^{(1)}) + n λ \sum_{j = 1}^{b} 1 (q_{1} = \dots = q_{j} = n, q_{j + 1} < n) Δ_{j + 1} f (x^{q}) \\ + (q_{1} - q_{2}) Δ_{1} f (x^{q}) - \sum_{j = 2}^{b} (q_{j} - q_{j + 1}) Δ_{j} f (x^{q} - δ e^{(j)}) - q_{b + 1} Δ_{b + 1} f (x^{q} - δ e^{(b + 1)}) . \end{array}

(5)

The first line of transitions in (5) corresponds to arrivals. We see that for $j \geq 2$ , the jth component of $x_{j}^{q}$ only grows provided the preceding j – 1 horizontal levels, as depicted in Figure 2, are full. The transitions in the second line of (5) correspond to service completions. Using Figure 2 again, we interpret $(q_{j} - q_{j + 1})$ as the number of servers (vertical columns) with exactly j customers.

Recall that $X = (X_{1}, \dots, X_{b + 1})$ and $Y = (Y_{1}, Y_{2}, 0, \dots, 0)$ are distributed according to the stationary distributions of the scaled CTMC and the diffusion ${Y (t) \in R_{+}^{b + 1}}$ defined in (1), respectively. Going forward, we note that unless explicitly stated, all expectations are with respect to the stationary distribution at hand, that is, either X or Y. To state our main result, we define

M_{j} = {h^{*} : R^{b + 1} \to R, {‖ \frac{\partial^{a}}{\partial x^{a}} h^{*} (x) ‖}_{\infty} \leq 1, 1 \leq {‖ a ‖}_{1} \leq j},

and

d_{M_{j}} (X, Y) = \sup_{h^{*} \in M_{j}} | E h^{*} (X) - E h^{*} (Y) |

. We use an asterisk to emphasize that

h^{*} (x)

is defined on the continuum

R^{b + 1}

. Later we will drop the asterisk to refer to functions defined only on the grid

δ ℤ^{b + 1}

. It was shown in lemma 2.2 of Mackey and Gorham (2016) that

M_{3}

is a convergence-determining class; that is,

d_{M_{3}} (U, V) \to 0

implies U and V converge in distribution. The following is our main result.

Theorem 1.

For any $0 < b < \infty$ , there exists a constant $C (b, β)$ such that for all $n \geq 1$ ,

d_{M_{2}} (X, Y) = \sup_{h^{*} \in M_{2}} | E h^{*} (X) - E h^{*} (Y) | \leq C (b, β) / \sqrt{n} .

(6)

Note that $M_{2}$ is also a convergence-determining class because $M_{3} \subset M_{2}$ . We prove Theorem 1 in Section 2.1 using the prelimit generator approach of Stein’s method. Multiple parts of the proof assume that n is large enough, say, $n > N (β)$ for some $N (β) > 0$ . We can make this assumption without loss of generality by redefining $C (b, β)$ to be larger than $\max_{1 \leq n \leq N (β)} d_{M_{2}} (X, Y)$ .

2.1. Proving Theorem 1

Central to our proof is the ability to extend any grid-valued function to be defined on all of $R_{+}^{b + 1}$ . Although there are infinitely many such extensions, we use a polynomial spline A that extends grid-valued functions $f : δ N^{b + 1} \to R$ to functions $A f : R_{+}^{b + 1} \to R$ . We leave the detailed construction to Section A.2 in Appendix A because for this section, it suffices to know that A is a linear operator, that $A f \in C^{2} (R_{+}^{b + 1})$ , and that A applied to a constant equals that constant. Recalling that $δ = 1 / \sqrt{n}$ , the following auxiliary lemma is needed.

Lemma 1.

Define

M_{disc, j} (c) = {h : δ N^{b + 1} \to R, | Δ^{a} h (δ k) | \leq c δ^{{‖ a ‖}_{1}}, 1 \leq {‖ a ‖}_{1} \leq j, δ k \in δ N^{b + 1}} .

There exist some $C, C^{'} > 0$ independent of any JSQ model parameters such that

d_{M_{2}} (X, Y) \leq \sup_{h \in M_{disc, 2} (C)} | E h (X) - E A h (Y) | + C^{'} δ .

(7)

Proof of Lemma 1.

The result follows by repeating the arguments used in the proof of lemma 1 in Braverman (2022). □

Going forward, when we write $M_{disc, 2} (C)$ , the constant C is assumed to be the one in Lemma 1. Furthermore, note that if $h (0) \neq 0$ , then the linearity of A and the fact that A applied to a constant equals that constant implies that $\tilde{h} (x) = h (x) - h (0)$ satisfies $E \tilde{h} (X) - E A \tilde{h} (Y) = E h (X) - E A h (Y)$ . We therefore, without loss of generality, consider only those $h \in M_{disc, 2} (C)$ such that $h (0) = 0$ .

To prove Theorem 1, we bound the right-hand side of (7) with the help of the following two ingredients. The first ingredient is a rate-conservation law for ${Y (t)}$ , proved in Appendix A.

Lemma 2.

Given $f \in C^{2} (R_{+}^{b + 1})$ , define

G_{Y} f (x) = (β - (x_{1} + x_{2})) \frac{\partial}{\partial x_{1}} f (x) - x_{2} \frac{\partial}{\partial x_{2}} f (x) + \frac{\partial^{2}}{\partial x_{1}^{2}} f (x), x \in R_{+}^{b + 1} .

(8)

If $E | f (Y) | < \infty$ and $E | G_{Y} f (Y) | < \infty$ , and if Y(0) is initialized according to Y, then

E G_{Y} f (Y) + E (\int_{0}^{1} (\frac{\partial}{\partial x_{1}} f (Y (s)) + \frac{\partial}{\partial x_{2}} f (Y (s))) 1 (Y_{1} (s) = 0) d U (s)) = 0 .

(9)

The second ingredient is the Poisson equation. For $h : δ N^{b + 1} \to R$ and $c \in R$ , let

f_{h}^{(c)} (x^{q}) = c + \int_{0}^{\infty} (E_{x^{q}} h (X (t)) - E h (X)) d t, x^{q} \in S,

which is well defined because the CTMC has a finite state space and is therefore exponentially ergodic. Furthermore, lemma 2 of Braverman (2022) (see also lemma 1 of Barbour 1988) implies that

G_{X} f_{h}^{(c)} (x^{q}) = E h (X) - h (x^{q}), x^{q} \in S .

(10)

Most applications of Stein’s method have c = 0, but we choose $c = c^{*} = - f_{h}^{(0)} (0)$ and define

\begin{array}{l} f_{h} (x^{q}) = f_{h}^{(c^{*})} (x^{q}) = \int_{0}^{\infty} (E_{x^{q}} h (X (t)) - E h (X)) d t - \int_{0}^{\infty} (E_{0} h (X (t)) - E h (X)) d t \\ = \int_{0}^{\infty} (E_{x^{q}} h (X (t)) - E_{0} h (X (t))) d t, x^{q} \in S . \end{array}

(11)

Our choice of c yields $f_{h} (0) = 0$ , which comes in handy later when we need to bound $| f_{h} (x^{q}) |$ in Proposition 1. Going forward, we assume that $c = c^{*}$ when referring to (10).

Let us give an informal roadmap for bounding (7), with the formal statement of the bounds left to Proposition 2. We bound (6) by comparing the CTMC and diffusion generators. However, the former is defined only on a subset of $R_{+}^{b + 1}$ , which requires the following workaround. Suppose that we are given a set $B \subset R_{+}^{b + 1}$ such that (a) $E h (X) - A h (x) = A G_{X} f_{h} (x)$ for $x \in B$ and (b) the probability that $Y \notin B$ goes to zero rapidly (we will make this precise) as $n \to \infty$ . We decompose $E h (X) - A h (x)$ as

E h (X) - A h (x) = A G_{X} f_{h} (x) 1 (x \in B) + (E h (X) - A h (x)) 1 (x \notin B)

and take expected values with respect to Y (we will show that these are finite) to get

E h (X) - E A h (Y) = E (A G_{X} f_{h} (Y) 1 (Y \in B)) + E ((E h (X) - A h (Y)) 1 (Y \notin B)) .

Now extend $f_{h} (x^{q})$ to $δ N^{b + 1}$ by defining $f_{h} (x^{q}) = 0$ for $x^{q} \in δ N^{b + 1} ∖ S$ and consider $A f_{h} (x)$ . Provided that $E | A f_{h} (Y) | < \infty$ and $E | G_{Y} A f_{h} (Y) | < \infty$ , we can invoke Lemma 2 with $f (x) = A f_{h} (x)$ there to conclude that

\begin{array}{l} E h (X) - E A h (Y) = E ((A G_{X} f_{h} (Y) - G_{Y} A f_{h} (Y)) 1 (Y \in B)) \\ + E ((E h (X) - A h (Y) - G_{Y} A f_{h} (Y)) 1 (Y \notin B)) \\ - E (\int_{0}^{1} (\frac{\partial}{\partial x_{1}} A f_{h} (Y (s)) + \frac{\partial}{\partial x_{2}} A f_{h} (Y (s))) 1 (Y_{1} (s) = 0) d U (s)), \end{array}

(12)

where Y(0) in the third line is initialized according to Y. We bound the first line by showing that G_X and G_Y are close to one another. The middle term is small because of our choice of B, and the last term can be bounded because the JSQ system exhibits reflecting behavior similar to

{Y (t)}

at the boundary

{x \in S : x_{1}^{q} = 0}

. As a final remark, our choice of

f_{h} (x^{q}) = 0

for

x^{q} \in δ N^{b + 1} ∖ S

is made for convenience and is not essential to the proof because the probability that

Y \notin B

shrinks rapidly as

n \to \infty

To state the following proposition, define $k : R^{b + 1} \to ℤ^{b + 1}$ elementwise by $k_{j} (x) = ⌊ x_{j} / δ ⌋$ . For notational convenience, we also define $I = {i = (i_{1}, i_{2}, 0, \dots, 0) \in N^{b + 1} : 0 \leq i_{1}, i_{2} \leq 4}$ . The following proposition is proved in Section A.2 in Appendix A.

Proposition 2.

If $h \in M_{disc, 2} (C)$ , then Ah(Y), $A f_{h} (Y)$ , and $G_{Y} A f_{h} (Y)$ are integrable and (12) holds. Furthermore, suppose that n > 16, define

B = {(x_{1}, x_{2}, 0, \dots, 0) \in R_{+}^{b + 1} : x_{2} + x_{1} \leq δ (n / 2 - 8) = (n / 2 - 8) / \sqrt{n}},

and let

\begin{array}{l} ε_{1} (Y) = (A G_{X} f_{h} (Y) - G_{Y} A f_{h} (Y)) 1 (Y \in B), \\ ε_{2} (Y) = (E h (X) - A h (Y) - G_{Y} A f_{h} (Y)) 1 (Y \notin B), \\ ε_{3} (Y) = (\frac{\partial}{\partial x_{1}} A f_{h} (Y) + \frac{\partial}{\partial x_{2}} A f_{h} (Y)) 1 (Y \in B), and \\ ε_{4} (Y) = (\frac{\partial}{\partial x_{1}} A f_{h} (Y) + \frac{\partial}{\partial x_{2}} A f_{h} (Y)) 1 (Y \notin B) . \end{array}

There exist $C (β), C (b, β) > 0$ independent of h(x) and n such that

\begin{array}{l} | ε_{1} (Y) | \leq & C (β) (1 + δ^{- 1} Y_{2}) \max_{\begin{matrix} i \in I \\ a_{1} + a_{2} = 2 \end{matrix}} | Δ_{1}^{a_{1}} Δ_{2}^{a_{2}} f_{h} (δ (k (Y) + i)) | + C (β) δ^{- 2} \max_{i \in I} | Δ_{1}^{3} f_{h} (δ (k (Y) + i)) | \\ + C (β) δ^{- 2} 1 (Y_{1} \leq δ) \max_{\begin{matrix} i \in I \\ i_{1} = 0 \end{matrix}} | (Δ_{1}^{2} - (Δ_{1} + Δ_{2})) f_{h} (δ (k (Y) + i)) |, \\ | ε_{2} (Y) | \leq & C (b, β) 1 (Y \notin B) δ^{- 2} (1 + Y_{1} + Y_{2}) \max_{i \in I} | f_{h} (δ (k (Y) + i)) |, \\ | ε_{3} (Y) | \leq & C (β) δ^{- 1} 1 (Y \in B) (| (Δ_{1} + Δ_{2}) f_{h} (δ k (Y) | + \max_{\begin{matrix} i \in I \\ a_{1} + a_{2} = 2 \end{matrix}} | Δ_{1}^{a_{1}} Δ_{2}^{a_{2}} f_{h} (δ (k (Y) + i)) |), \\ | ε_{4} (Y) | \leq & C (β) δ^{- 1} 1 (Y \notin B) \max_{i \in I} | f_{h} (δ (k (Y) + i)) | . \end{array}

Note that $ε_{1} (Y)$ and $ε_{2} (Y)$ are related to the first and second lines of (12), respectively, whereas $ε_{3} (Y)$ and $ε_{4} (Y)$ are related to the last line there. From the bounds in Proposition 2, we see that the bound on (12) depends on the CTMC through the function $f_{h} (x^{q})$ and its differences and on the diffusion through the distribution of Y. The differences of $f_{h} (x^{q})$ are commonly known as Stein factors; the following proposition, proved in Section 3, exhibits the Stein factor bounds we need to prove Theorem 1.

Proposition 1.

There exists $C (β, b) > 0$ such that for any $n \geq 1$ and $h \in M_{disc, 2} (C)$ ,

| Δ_{1}^{a_{1}} Δ_{2}^{a_{2}} f_{h} (x^{q}) | \leq C (β, b) δ^{a_{1} + a_{2}} {(1 + x_{2}^{q})}^{a_{1} + a_{2}},

for all

a_{1}, a_{2} \geq 0

with

1 \leq a_{1} + a_{2} \leq 2

, and all

x^{q} \in S

with

x_{1}^{q} \leq δ (n - a_{1}), x_{2}^{q} \leq δ (n - a_{2})

, and

x_{3}^{q} = 0

. Furthermore,

\begin{array}{l} | f_{h} (x^{q}) | \leq C (β, b) (1 + x_{2}^{q}) (x_{1}^{q} + x_{2}^{q}) / δ, & x^{q} \in S, x_{3}^{q} = 0, \\ | Δ_{1}^{3} f_{h} (x^{q}) | \leq C (β, b) δ^{3} {(1 + x_{2}^{q})}^{3}, & x^{q} \in S, x_{1}^{q} \leq δ (n - 3), x_{3}^{q} = 0, \end{array}

and for all

x^{q} \in S

with

x_{1}^{q} = 0, 0 \leq x_{2}^{q} \leq δ (n - 1)

, and

x_{3}^{q} = 0

\begin{array}{l} | (Δ_{1} + Δ_{2}) f_{h} (x^{q}) | \leq & C (β, b) δ^{2} {(1 + x_{2}^{q})}^{2} and \\ | (Δ_{1}^{2} - (Δ_{1} + Δ_{2})) f_{h} (x^{q}) | \leq & C (β, b) δ^{3} {(1 + x_{2}^{q})}^{3} . \end{array}

The last component needed for the proof of Theorem 1 is the following lemma.

Lemma 3.

All moments of Y₁ and Y₂ are finite. Furthermore, suppose that Y(0) is initialized according to Y. Then for any j > 0,

E Y_{2}^{j + 1} = (\int_{0}^{1} (Y_{2} (s))^{j} 1 (Y_{1} (s) = 0) d U (s)) .

(13)

Proof of Lemma 3.

The finiteness of the moments follows from theorem 2.1 of Banerjee and Mukherjee (2019), and (13) is implied by (9) of Lemma 2 with $f (y) = y_{2}^{j + 1}$ there. □

Proof of Theorem 1.

Initialize Y(0) according to Y. Using (12) and the definitions of $ε_{1} (Y), \dots, ε_{4} (Y)$ , it follows that

E h (X) - E A h (Y) = E ε_{1} (Y) + E ε_{2} (Y) - E (\int_{0}^{1} (ε_{3} (Y (s)) + ε_{4} (Y (s))) 1 (Y_{1} (s) = 0) d U (s)) .

We argue that $| E h (X) - E A h (Y) | \leq C (b, β) δ$ for any $h \in M_{disc, 2} (C)$ , which implies Theorem 1 when combined with Lemma 1. Because $δ (k_{2} (Y) + i_{2}) \leq Y_{2} + 4 δ$ for $i \in I$ , applying the Stein factor bounds in Proposition 1 with the bounds on $ε_{1} (Y)$ and $ε_{2} (Y)$ in Proposition 2 yields

| ε_{1} (Y) | \leq C (b, β) 1 (Y \in B) δ {(1 + Y_{2})}^{3}, | ε_{2} (Y) | \leq 1 (Y \notin B) C (β) δ^{- 3} {(1 + Y_{1} + Y_{2})}^{3} .

(14)

We point out that

δ^{- 1} \leq C (Y_{1} + Y_{2}), for any Y \notin B,

(15)

which follows from the facts that

Y_{1} + Y_{2} \geq δ (n / 2 - 8) = δ^{- 1} / 2 - δ

for

Y \notin B

, that

δ = 1 / \sqrt{n}

, and that n > 16. Combining (14), (15), and the fact that the moments of Y_i are finite yields

E | ε_{1} (Y) | + E | ε_{2} (Y) | \leq C (b, β) δ E {(1 + Y_{1} + Y_{2})}^{7} \leq C (b, β) δ .

Furthermore, applying the Stein factor bounds in Proposition 1 to the bounds on $ε_{3} (Y)$ and $ε_{4} (Y)$ in Proposition 2, and using (15), we get

\begin{array}{l} | ε_{3} (Y) + ε_{4} (Y) | \leq C (b, β) 1 (Y \in B) δ {(1 + Y_{2})}^{2} + C (b, β) 1 (Y \notin B) δ^{- 2} {(1 + Y_{1} + Y_{2})}^{2} \\ \leq C (b, β) δ {(1 + Y_{1} + Y_{2})}^{5} . \end{array}

Thus, (13) of Lemma 3 implies that

E (\int_{0}^{1} | ε_{3} (Y (s)) + ε_{4} (Y (s)) | 1 (Y_{1} (s) = 0) d U (s)) \leq C (b, β) δ . □

3. Stein Factor Bounds

In this section, we prove Proposition 1. We bound the first-order differences in Section 3.1. This requires the most effort. The second-order differences are bounded at the start of Section 3.2, with Section 3.2.1 showing how they can be used to bound $E | h (X) |$ , which may be of independent interest. Section 3.2.2 contains the third-order bounds, and Section 3.2.3 proves two technical lemmas needed for the second-order bounds.

3.1. First-Order Differences

In this section, we bound

Δ_{i} f_{h} (x^{q}) = \int_{0}^{\infty} E_{x^{q} + δ e^{(i)}} h (X (t)) - E_{x^{q}} h (X (t)) d t

by coupling two copies of the JSQ model initialized one customer apart. The coupling is introduced in the following lemma, which is stated in terms of the unscaled CTMC

{Q (t)}

Lemma 4.

For $1 \leq i \leq b + 1$ , define $Θ_{i}^{Q} = {(q, \tilde{q}) \in S_{Q} \times S_{Q} : q_{i} < n, {\tilde{q}}_{i} = q_{i} + 1}$ . There exists a coupling ${\tilde{Q} (t)}$ of ${Q (t)}$ whose transient distribution satisfies

{\tilde{Q} (t) | (Q (0), \tilde{Q} (0)) \in Θ_{i}^{Q}, Q (0) = q}_{t \geq 0} \overset{d}{=} {Q (t) | Q (0) = (q + e^{(i)})} .

(16)

Furthermore, if $(Q (0), \tilde{Q} (0)) \in \cup_{i = 1}^{b + 1} Θ_{i}^{Q}$ , then

The equality $\tilde{Q} (t) = Q (t) holds for all times t \geq τ_{C}$ , where $τ_{C} = \inf {t \geq 0 : Q (t) = \tilde{Q} (t)}$ .
The pair $(Q (t), \tilde{Q} (t))$ belongs to $\cup_{i = 1}^{b + 1} Θ_{i}^{Q}$ for all times $t < τ_{C}$ .
Let V be a unit-mean exponentially distributed random variable independent of ${Q (t)}$ . Then
$τ_{C} \overset{d}{=} \min {\inf_{t \geq 0} {\int_{0}^{t} 1 ((Q (s), \tilde{Q} (s)) \in Θ_{1}^{Q}) d s = V}, \inf_{t \geq 0} {Q_{b + 1} (t) = n}} .$ (17)

Proof of Lemma 4.

Let us construct a joint CTMC ${(Q (t), \tilde{Q} (t))}$ by specifying its transitions. For simplicity, we refer to ${Q (t)}$ as system 1 and to ${\tilde{Q} (t)}$ as system 2. We think of system 2 as a copy of system 1 but with an additional low-priority customer following a preemptive resume rule. That is, service is interrupted, and the extra customer moves to the back of its buffer when a regular customer joins, even if the low-priority customer is currently in service.

Any state in $Θ_{1}^{Q}$ is one where the low-priority customer is in service. The remaining $Θ_{i}^{Q}$ correspond to states where the low-priority customer is assigned to a server with a total of i customers; Figure 3 contains an example of a states in $Θ_{1}^{Q}$ and $Θ_{3}^{Q}$ . Assuming $(Q (0), \tilde{Q} (0)) = (q, \tilde{q}) \in Θ_{i}^{Q}$ for some $1 \leq i \leq b + 1$ , we now describe the possible transitions of the joint chain.

If i = 1, then the low-priority customer is in service. After a unit-mean exponentially distributed amount of time, the customer leaves system 2 and both systems couple. After coupling, systems 1 and 2 are identical in terms of current and future customers, so they coincide on every sample path. All other transitions of the joint chain are based on the standard transitions of the JSQ model. In other words, a service completion by any of the q₁ servers working in system 1 results in a customer departure from both systems.

Figure 4 illustrates the effect of arrivals when $(q, \tilde{q}) \in Θ_{1}^{Q}$ . Namely, when $q_{1} \leq n - 2$ , a new arrival is assigned to the same idle server in both systems. If a customer arrives when $q_{1} = n - 1$ , then system 1 has only one idle server and system 2 has none. In system 1, that customer will be assigned to the last remaining idle server. Recall that when defining our JSQ model, we allowed for an arbitrary tie-breaking decision in routing arrivals. Therefore, in system 2, we assign that customer to the server working on the low-priority customer, causing a service preemption and pushing the low-priority customer to the back of the buffer. An arrival when $q_{1} = n - 1$ transitions the joint chain from $Θ_{1}^{Q}$ to $Θ_{2}^{Q}$ .

If $2 \leq i \leq b$ , then the low-priority customer is in the back of some server’s buffer. A service completion by any of the q₁ servers working in system 1 results in a customer departure from both systems. If, however, the service completion happens at the server containing the low-priority customer, then the chain transitions from $Θ_{i}^{Q}$ to $Θ_{i - 1}^{Q}$ because the low-priority customer is now assigned to a server with i – 1 customers; see Figure 5 for a depiction of such a transition. All new arrivals get assigned to the same server in each system. Note that if an arrival happens when $q_{i} = n - 1$ and $q_{1} = \dots = q_{i - 1} = n$ , then the system transitions from $Θ_{i}^{Q}$ to $Θ_{i + 1}^{Q}$ .

The final case is when $i = b + 1$ . All transitions are identical to the $2 \leq i \leq b$ case, except for a customer arrival to a system where $q_{b + 1} = n - 1$ and $q_{1} = \dots = q_{b} = n$ . In that case, system 1 assigns the customer to the last available slot; but system 2 blocks the customer because it is already full. This transition causes the two systems to couple. Note that our construction immediately implies the three claims in Lemma 4. □

Figure 3. Two Possible States of the Joint Chain $(Q (t), \tilde{Q} (t))$ Are Depicted
*Notes*. The red circles correspond to customers in Q(t), while the blue circle is the extra customer in $\tilde{Q} (t)$ . In the figure on the left, the joint chain is in $Θ_{1}^{Q}$ , meaning the blue customer is in service and will leave the system after an exponentially distributed amount of time, coupling the joint chain. In the figure on the right, the joint chain is in $Θ_{3}^{Q}$ because the blue customer is assigned to a server with a total of three customers.

Figure 4. From Left to Right, the Figures Depict the Arrival of Two Customers
*Note*. The second arrival results in a transition from $Θ_{1}^{Q}$ to $Θ_{2}^{Q}$ .

**Figure 5. From Left to Right, the Server Containing the Blue Customer in Its Buffer Completes Service, Resulting in a Transition from $Θ_{2}^{Q}$ to $Θ_{1}^{Q}$**

Let $\tilde{X} (t) = (δ (n - {\tilde{Q}}_{1} (t)), δ {\tilde{Q}}_{2} (t), \dots, δ {\tilde{Q}}_{b + 1} (t))$ be the scaled version of $\tilde{Q} (t)$ . For any $x^{q} \in S$ with $x_{1}^{q} > 0$ , and any $h \in M_{disc, 2} (C)$ ,

\begin{array}{l} | \int_{0}^{\infty} E_{x - δ e^{(1)}} h (X (t)) - E_{x} h (X (t)) d t | = | \int_{0}^{\infty} E_{(x, x - δ e^{(1)})} (h (\tilde{X} (t)) - h (X (t))) d t | \\ \leq | \int_{0}^{\infty} E_{(x, x - δ e^{(1)})} (δ 1 (t \leq τ_{C})) d t | = δ E_{(x, x - δ e^{(1)})} τ_{C}, \end{array}

(18)

where

E_{(x, x - δ e^{(1)})} (\cdot)

denotes the expectation given

(X (0), \tilde{X} (0)) = (x, x - δ e^{(1)})

. The inequality is true because the gap between

{X (t)}

and

{\tilde{X} (t)}

never increases beyond one customer. The same argument implies that

| Δ_{i} f_{h} (x^{q}) | \leq δ E_{(x, x + δ e^{(i)})} τ_{C}

for

i \geq 2

, and we see that bounding the first-order Stein factors amounts to bounding the expected coupling time τ_C. The following lemma provides the necessary bound. It is worth highlighting that proving this result requires a large amount of effort and JSQ-model-specific insight.

Lemma 5.

For any $(q, \tilde{q}) \in \cup_{i = 1}^{b + 1} Θ_{i}^{Q}$ ,

E_{(q, \tilde{q})} τ_{C} \leq C (b, β) (1 + δ q_{2}) .

Before proving the lemma, we note that the first-order bounds in Proposition 1 are a consequence of (18) and Lemma 5; that is,

| Δ_{i} f_{h} (x^{q}) | \leq C (b, β) δ (1 + x_{2}^{q}), i = 1, \dots, b + 1 .

(19)

Furthermore, note that for any $x^{q} \in S$ with $x_{3}^{q} = 0$ ,

f_{h} (x^{q}) = f_{h} (0) + \sum_{j_{1} = 0}^{x_{1}^{q} / δ - 1} Δ_{1} f_{h} (δ j_{1}, 0, \dots, 0) + \sum_{j_{2} = 0}^{x_{2}^{q} / δ - 1} Δ_{2} f_{h} (x_{1}^{q}, δ j_{2}, 0, \dots, 0) .

Recall that $f_{h} (0) = 0$ and that the definition of S implies that $δ (j_{1}, j_{2}, 0, \dots, 0) \in S$ for any $0 \leq j_{1} \leq x_{1}^{q} / δ$ and $0 \leq j_{2} \leq x_{2}^{q} / δ$ . Combining these facts with (19) yields

| f_{h} (x^{q}) | \leq C (b, β) (1 + x_{2}^{q}) (x_{1}^{q} + x_{2}^{q}) / δ, x^{q} \in S, x_{3}^{q} = 0,

(20)

which proves one of the claims from Proposition 1.

We now describe the main idea and introduce several auxiliary lemmas used to prove Lemma 5. Our discussion communicates the main intuition behind the proof, leaving the technical details to Appendix B. Let $γ > 0$ be a constant independent of n whose precise value will be specified later and define

θ_{1} = n - ⌊ \sqrt{n} β / 2 ⌋, and θ_{2} = ⌊ γ \sqrt{n} ⌋ .

Additionally, we define the stopping times

τ_{i} (q_{i}) = \inf {t \geq 0 : Q_{i} (t) = q_{i}}, q_{i} \in {0, 1, \dots, n}, i = 1, 2 .

We now describe a sequence of cycles, or attempts, such that in each cycle, the probability of the joint chain coupling is bounded from below by a constant independent of n. Given an initial state $(Q (0), \tilde{Q} (0)) = (q, \tilde{q})$ belonging to some $Θ_{i}^{Q}$ , we wait until $τ_{2} (θ_{2})$ , which marks the start of the first cycle. From that point, we wait until $\min (τ_{1} (θ_{1}), τ_{2} (2 θ_{2}))$ . If $τ_{1} (θ_{1}) \geq τ_{2} (2 θ_{2})$ , then we give up trying to couple this cycle and wait until $τ_{2} (θ_{2})$ to start a fresh cycle. If $τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})$ , then there are $⌊ \sqrt{n} β / 2 ⌋$ idle servers and at most $2 θ_{2}$ nonempty buffers. From such a state, we are guaranteed that coupling happens if the joint CTMC enters $Θ_{1}^{Q}$ and spends an exponentially distributed amount of time there before all servers in ${Q (t)}$ become busy; that is, $τ_{C} < τ_{1} (n)$ . If $τ_{C} \geq τ_{1} (n)$ , we give up trying to couple this cycle and wait until $τ_{2} (θ_{2})$ for the next cycle to restart the coupling attempt. Note that this cycle sequence resembles a renewal sequence, but the new cycle times are not renewal times because the values of $Q_{3} (\cdot), \dots, Q_{b + 1} (\cdot)$ can vary at the start of each new cycle.

From our discussion, it follows that coupling is guaranteed in any given cycle if, starting from a state with $q_{2} = θ_{2}$ , the events ${τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})}$ and ${τ_{C} < τ_{1} (n)}$ occur. In Appendix B, we derive a lower bound, uniform in n, on the probability of coupling in a given cycle, implying that coupling is guaranteed to happen after a geometrically distributed number of cycles. We also derive an upper bound, uniform in n, on the expected time until the start of the first cycle, as well as the expected cycle duration, and then combine these bounds and prove Lemma 5.

3.2. Higher-Order Bounds

To prove the higher-order bounds, we first use the Poisson equation to write $Δ_{1}^{2} f_{h} (x^{q})$ in terms of $h (x^{q}), E h (X)$ and first-order differences of $f_{h} (x^{q})$ . With the help of this expression, we use the dynamics of the JSQ model to relate all the second-order differences to each other and prove that

| Δ_{1}^{a_{1}} Δ_{2}^{a_{2}} f_{h} (x_{1}^{q}, x_{2}^{q}, 0, \dots, 0) | \leq δ^{2} \sum_{i = 1}^{b + 1} E X_{i} + C (b, β) δ^{2} {(1 + x_{2}^{q})}^{2}

(21)

for

{‖ a ‖}_{1} = 2, x_{1}^{q} \leq δ (n - a_{1})

and

x_{2}^{q} \leq δ (n - a_{2})

, followed by a similar bound for

| (Δ_{1} + Δ_{2}) f_{h} (0, x_{2}^{q}, 0, \dots, 0) |

. We then bound

\sum_{i = 1}^{b + 1} E X_{i}

using the Poisson equation in Section 3.2.1 and bound

| Δ_{1}^{3} f_{h} (x^{q}) |

and

| (Δ_{1}^{2} - (Δ_{1} + Δ_{2})) f_{h} (x^{q}) |

in Section 3.2.2. In Section 3.2.3, we prove two technical lemmas needed to establish (21). We also briefly discuss (see Remark 1) the advantage of using the prelimit generator approach and working with finite differences of

f_{h} (x^{q})

as opposed to using the classical generator approach and working with the derivatives of the solution to the Poisson equation for the diffusion.

For the following discussion, we assume that $x^{q} \in S$ with $x_{3}^{q} = 0$ . Recall from (5) that

\begin{array}{l} G_{X} f_{h} (x^{q}) = & 1 (q_{1} < n) n λ Δ_{1}^{2} f_{h} (x^{q} - δ e^{(1)}) + 1 (q_{1} = n, q_{2} < n) n λ (Δ_{2} + Δ_{1}) f_{h} (x^{q}) \\ + \frac{1}{δ} (β - (x_{1}^{q} + x_{2}^{q})) Δ_{1} f_{h} (x^{q}) - \frac{1}{δ} x_{2}^{q} Δ_{2} f_{h} (x^{q} - δ e^{(2)}) . \end{array}

(22)

We rearrange the Poisson equation $G_{X} f_{h} (x^{q}) = E h (X) - h (x^{q})$ to see that when $0 < q_{1} < n$ , or alternatively $0 < x_{1}^{q} < δ n$ ,

\begin{array}{l} Δ_{1}^{2} f_{h} (x^{q} - δ e^{(1)}) = & \frac{1}{n λ} (E h (X) - h (x^{q})) - \frac{1}{n λ} \frac{1}{δ} (β - (x_{1}^{q} + x_{2}^{q})) Δ_{1} f_{h} (x^{q}) \\ + \frac{1}{n λ} \frac{1}{δ} x_{2}^{q} Δ_{2} f_{h} (x^{q} - δ e^{(2)}) . \end{array}

(23)

Note that $E | h (X) | \leq C E (X_{1} + \dots + X_{b + 1})$ because $h (0) = 0$ and $h \in M_{disc, 2} (C)$ . Together with the bound on $Δ_{i} f_{h} (x^{q})$ from (19), this implies that

| Δ_{1}^{2} f_{h} (x^{q}) | \leq δ^{2} C \sum_{i = 1}^{b + 1} E X_{i} + δ^{2} x_{1}^{q} + C (b, β) δ^{2} {(1 + x_{2}^{q})}^{2}, x_{1}^{q} < δ (n - 1), x_{3}^{q} = 0 .

(24)

Similarly, if $x_{1}^{q} = 0$ ,

(Δ_{2} + Δ_{1}) f_{h} (x^{q}) = \frac{1}{n λ} (E h (X) - h (x^{q})) - \frac{1}{n λ} \frac{1}{δ} (β - x_{2}^{q}) Δ_{1} f_{h} (x^{q}) + \frac{1}{n λ} \frac{1}{δ} x_{2}^{q} Δ_{2} f_{h} (x^{q} - δ e^{(2)}),

(25)

and therefore

| (Δ_{2} + Δ_{1}) f_{h} (0, x_{2}, 0, \dots, 0) | \leq δ^{2} C \sum_{i = 1}^{b + 1} E X_{i} + C (b, β) δ^{2} {(1 + x_{2}^{q})}^{2}, x_{2}^{q} < δ n .

(26)

Not all second-order differences can be bounded like this. For example, the equation for $Δ_{2}^{2} f_{h} (x^{q})$ would involve the third-order difference $Δ_{2} Δ_{1}^{2} f_{h} (x^{q})$ , which we have not bounded. Instead, the following lemma relates the remaining second-order differences to $Δ_{1}^{2} f_{h} (x^{q})$ and $(Δ_{2} + Δ_{1}) f_{h} (0, x_{2}^{q}, 0, \dots, 0)$ using the structure of the JSQ system. The proof is postponed to Section 3.2.3.

Lemma 6.

Fix $h \in M_{disc, 2} (C)$ . Then for any $x^{q} \in S$ with $x_{3}^{q} = 0$ ,

\begin{array}{l} | Δ_{1}^{2} f_{h} (x^{q}) | \leq C δ^{2} + \max_{0 \leq y_{2}^{q} \leq x_{2}^{q}} | Δ_{1}^{2} f_{h} (0, y_{2}^{q}, 0, \dots, 0) |, provided x^{q} + 2 δ e^{(1)} \in S, \\ | Δ_{2} Δ_{1} f_{h} (x^{q}) | \leq C δ^{2} + \max_{\begin{matrix} 0 \leq y_{2}^{q} \leq x_{2}^{q} \\ j = 1, 2 \end{matrix}} | Δ_{j}^{2} f_{h} (0, y_{2}^{q}, 0, \dots, 0) |, provided x^{q} + δ e^{(1)} + δ e^{(2)} \in S, \\ | Δ_{2}^{2} f_{h} (x^{q}) | \leq C δ^{2} + \max_{\begin{matrix} 0 \leq y_{2}^{q} \leq x_{2}^{q} \\ j = 1, 2 \end{matrix}} | Δ_{j}^{2} f_{h} (0, y_{2}^{q}, 0, \dots, 0) |, provided x^{q} + 2 δ e^{(2)} \in S . \end{array}

We see from Lemma 6 that to bound the second-order differences, we only need bounds on $| Δ_{1}^{2} f_{h} (0, x_{2}^{q}, 0, \dots, 0) |$ and $| Δ_{2}^{2} f_{h} (0, x_{2}^{q}, 0, \dots, 0) |$ . The former is bounded in (24); for the latter term, we note that for any $x^{q} \in S$ with $x_{1}^{q} = x_{3}^{q} = 0$ ,

\begin{array}{l} | Δ_{2}^{2} f_{h} (x^{q}) | = | Δ_{2} f_{h} (x^{q} + δ e^{(2)}) - Δ_{2} f_{h} (x^{q}) | \\ = | (Δ_{2} + Δ_{1}) f_{h} (x^{q} + δ e^{(2)}) - Δ_{1} f_{h} (x^{q} + δ e^{(2)}) - Δ_{2} f_{h} (x^{q}) | \\ = | (Δ_{2} + Δ_{1}) f_{h} (x^{q} + δ e^{(2)}) + (f_{h} (x^{q}) - f_{h} (x^{q} + δ e^{(1)} + δ e^{(2)})) | \\ \leq δ^{2} C \sum_{i = 1}^{b + 1} E X_{i} + C (b, β) δ^{2} {(1 + x_{2}^{q})}^{2} + | f_{h} (0, x_{2}^{q}, 0, \dots, 0) - f_{h} (δ, x_{2}^{q} + δ, 0, \dots, 0) |, \end{array}

(27)

where the inequality follows from (26). The following lemma bounds the last term on the right-hand side, implying that

| Δ_{2}^{2} f_{h} (0, x_{2}^{q}, 0, \dots, 0) | \leq δ^{2} C \sum_{i = 1}^{b + 1} E X_{i} + C (b, β) δ^{2} {(1 + x_{2}^{q})}^{2}

and, consequently, (21). It is proved in Section 3.2.3.

Lemma 7.

For all $n \geq 1$ ,

| f_{h} (0, x_{2}^{q}, 0, \dots, 0) - f_{h} (δ, x_{2}^{q} + δ, 0, \dots, 0) | \leq C (b, β) δ^{2} (1 + x_{2}^{q}), 0 \leq x_{2}^{q} < δ n .

(28)

3.2.1. Bounding $\sum_{i = 1}^{b + 1} E X_{i}$ .

The bounds in (21) and (26) do not yet look like the stated bounds in Proposition 1 because the term $\sum_{i = 1}^{b + 1} E X_{i}$ is present. However, we can bound this expectation using the Poisson equation as follows. Recall that $λ = 1 - β / \sqrt{n}$ ; let $x (\infty) = (δ (n - ⌊ n λ ⌋), 0, \dots, 0) = (β + δ (n λ - ⌊ n λ ⌋), 0, \dots, 0)$ , and observe that this point is in S. In fact, it is the closest point in S, when rounded up, to the fluid equilibrium of the JSQ system, which happens to be $(β, 0, \dots, 0)$ ; see Braverman (2020). From (22) we have

G_{X} f_{h} (x (\infty)) = n λ Δ_{1}^{2} f_{h} (x (\infty) - δ e^{(1)}) + (n λ - ⌊ n λ ⌋) Δ_{1} f_{h} (x (\infty)) = E h (X) - h (x (\infty)) .

Choosing $h (x^{q}) = \sum_{i = 1}^{b + 1} x_{i}^{q}$ and noting that $h (x (\infty)) = β + δ (n λ - ⌊ n λ ⌋)$ yields

n λ Δ_{1}^{2} f_{h} (x (\infty) - δ e^{(1)}) + (n λ - ⌊ n λ ⌋) Δ_{1} f_{h} (x (\infty)) - β - δ (n λ - ⌊ n λ ⌋) = \sum_{i = 1}^{b + 1} E X_{i} .

(29)

To bound $\sum_{i = 1}^{b + 1} E X_{i}$ we need only bound $Δ_{1}^{2} f_{h} (x (\infty) - δ e^{(1)})$ because $| Δ_{1} f_{h} (x (\infty)) | \leq δ C (b, β)$ because of (19). Note that we cannot use (24) for the second-order difference bound because $\sum_{i = 1}^{b + 1} E X_{i}$ is present on the right-hand side there. Instead, we exploit the structure of the JSQ model to bound $Δ_{1}^{2} f (x (\infty) - δ e^{(1)})$ as follows.

Define $τ^{-} (x_{1}^{q}) = \inf_{t \geq 0} {X (t) = (x_{1}^{q} - δ, 0, \dots, 0) | X (0) = (x_{1}^{q}, 0, \dots, 0)}$ ; let $(X (t), \tilde{X} (t))$ be the scaled version of the coupling defined in Lemma 4, and let V be the unit-rate exponentially distributed random variable defined in the same lemma. Fix $x^{q} = (x_{1}^{q}, 0, \dots, 0)$ with $x_{1}^{q} \geq 2 δ$ , and suppose $X (0) = x^{q}$ and $\tilde{X} (0) = x^{q} - δ e^{(1)}$ . Consider the evolution of $(X (t), \tilde{X} (t))$ for $t \in [0, V \land τ^{-} (x_{1}^{q})]$ . If $V < τ^{-} (x_{1}^{q})$ , the two processes couple and become identical. Otherwise, the joint process is in state $(x^{q} - δ e^{(1)}, x^{q} - 2 δ e^{(1)})$ . Using the strong Markov property, we conclude that

\begin{array}{l} Δ_{1} f_{h} (x^{q} - δ e^{(1)}) = & \int_{0}^{\infty} E_{x^{q}} [(h (X (t)) - h (X (t) - δ e^{(1)})) 1 (t \leq (V \land τ^{-} (x_{1}^{q})))] d t \\ + ℙ (V \geq τ^{-} (x_{1}^{q})) Δ_{1} f_{h} (x^{q} - 2 δ e^{(1)}) . \end{array}

Choosing $x_{1}^{q} = x_{1} (\infty)$ , we see that

\begin{array}{l} Δ_{1} f_{h} (x (\infty) - δ e^{(1)}) - Δ_{1} f_{h} (x (\infty) - 2 δ e^{(1)}) \\ = & \int_{0}^{\infty} E_{x (\infty)} [(h (X (t)) - h (X (t) - δ e^{(1)})) 1 (t \leq (V \land τ^{-} (x_{1} (\infty))))] d t \\ - ℙ (V < τ^{-} (x_{1} (\infty))) Δ_{1} f_{h} (x (\infty) - 2 δ e^{(1)}) . \end{array}

Choosing $h (x^{q}) = \sum_{i = 1}^{b + 1} x_{i}^{q}$ and using $| Δ_{1} f (x (\infty)) | \leq δ C (b, β)$ , we arrive at

| Δ_{1}^{2} f_{h} (x (\infty) - δ e^{(1)}) | \leq δ E τ^{-} (x_{1} (\infty)) + C (b, β) δ ℙ (V < τ^{-} (x_{1} (\infty))) .

(30)

The quantities involving $τ^{-} (x_{1} (\infty))$ are bounded in the following lemma.

Lemma 8.

There exists a constant $C (β) > 0$ such that for all $n \geq 1$ ,

E τ^{-} (x_{1}^{q}) \leq C (β) δ, and ℙ (V \leq τ^{-} (x_{1}^{q})) \leq C (β) δ, for x_{1}^{q} \in {x_{1} (\infty), δ, 2 δ} .

(31)

Lemma 8 is proved in Section B.5 in Appendix B. It implies that $| Δ_{1}^{2} f_{h} (x (\infty) - δ e^{(1)}) | \leq C (b, β) δ^{2}$ , and therefore

\sum_{i = 1}^{b + 1} E X_{i} \leq C (b, β) .

(32)

Combining (32) with (21) proves the second-order bounds in Proposition 1.

Before moving on, let us make a few remarks. The bound in (32) implies that the sequence of steady-state distributions ${X}_{n = 1}^{\infty}$ is tight; when combined with process-level convergence of ${X (t)}$ to the diffusion ${Y (t)}$ , tightness can be used to imply convergence of the steady-state distributions via a limit-interchange argument. For an example of this applied to the JSQ model, see Braverman (2020). Alternatively, (32) can be recast into a result about the convergence rate to the mean-field equilibrium.

Let $h (x) = | x_{1} + \dots + x_{b + 1} - β | - β$ , noting that $h \in M_{disc, 1} (1)$ and that $h (0) = 0$ ; suppose for the sake of exposition that $⌊ n λ ⌋ = n λ$ . One may check that the bound in (30) holds even when $h \in M_{disc, 1} (1)$ , in which case (29) implies that

E | \sum_{i = 1}^{b + 1} X_{i} - β | = β + n λ Δ_{1}^{2} f_{h} (x (\infty) - δ e^{(1)}) \leq C (b, β) .

If we divide both sides by $\sqrt{n}$ to consider the mean-field scaled version of $\sum_{i = 1}^{b + 1} X_{i}$ , we get

E | (n - Q_{1}) / n + \sum_{i = 2}^{b + 1} Q_{i} / n - β | \leq C (b, β) / \sqrt{n} .

Thus, we recover the $1 / \sqrt{n}$ rate of convergence to the mean field equilibrium that one typically obtains using Stein’s method for the mean-field model, like in Ying (2017). The approach used to show tightness in this section can offer an alternative to the one proposed by Ying (2017), but the difficulty of implementing our approach is directly related to the difficulty of obtaining the relevant Stein factor bounds.

As a final remark, in this section we have shown that establishing tightness, or rates of convergence to the mean-field equilibrium, is equivalent to bounding the first- and second-order differences of $f_{h} (x^{q})$ at a single point near the fluid equilibrium of the CTMC. In contrast, establishing rates of convergence to the diffusion requires bounds on the second- and third-order differences at all points in the support of Y.

3.2.2 Third-Order Bounds.

To bound $Δ_{1}^{3} f_{h} (x^{q})$ , we recall (23), which says that for $0 < x_{1}^{q} < δ n$ with $x_{3} = 0$ ,

\begin{array}{l} Δ_{1}^{2} f_{h} (x^{q} - δ e^{(1)}) = & \frac{1}{n λ} (E h (X) - h (x^{q})) - \frac{1}{n λ} \frac{1}{δ} (β - (x_{1}^{q} + x_{2}^{q})) Δ_{1} f_{h} (x^{q}) \\ + \frac{1}{n λ} \frac{1}{δ} x_{2}^{q} Δ_{2} f_{h} (x^{q} - δ e^{(2)}) . \end{array}

Applying $Δ_{1}$ to both sides yields

\begin{array}{l} Δ_{1}^{3} f_{h} (x^{q} - δ e^{(1)}) = & - \frac{1}{n λ} Δ_{1} h (x^{q}) - \frac{1}{n λ} \frac{1}{δ} (β - (x_{1}^{q} + x_{2}^{q})) Δ_{1}^{2} f_{h} (x^{q}) + \frac{1}{n λ} Δ_{1} f_{h} (x^{q} + δ e^{(1)}) \\ + \frac{1}{n λ} \frac{1}{δ} x_{2}^{q} Δ_{1} Δ_{2} f_{h} (x^{q} - δ e^{(2)}), 0 < x_{1}^{q} < δ (n - 1) . \end{array}

The bounds on the first- and second-order differences of $f_{h} (x^{q})$ , together with the fact that $h \in M_{disc, 2} (C)$ , imply that

| Δ_{1}^{3} f_{h} (x^{q}) | \leq C (b, β) δ^{3} {(1 + x_{2}^{q})}^{3}, x^{q} \in S, x_{1}^{q} \leq δ (n - 3),

which matches the inequality in Proposition 1. The bound on

| (Δ_{1}^{2} - (Δ_{1} + Δ_{2})) f_{h} (x^{q}) |

when

x_{1}^{q} = 0

is proved identically by subtracting

(Δ_{1} + Δ_{2}) f_{h} (x^{q})

in (25) from

Δ_{1}^{2} f_{h} (x^{q})

in (23). This concludes the proof of Proposition 1. □

3.2.3. Proving Lemmas 6 and 7.

To conclude the section, we prove the auxiliary lemmas from Section 3.2.

Proof of Lemma 6.

Our first task is to bound

Δ_{1}^{2} f_{h} (x^{q}) = \int_{0}^{\infty} (E_{x^{q} + 2 δ e^{(1)}} h (X (t)) - 2 E_{x^{q} + δ e^{(1)}} h (X (t)) + E_{x^{q}} h (X (t))) d t .

Note that $x^{q} \in S$ with $x^{q} + 2 δ e^{(1)} \in S$ implies that $q_{2} \leq q_{1} - 2$ . Working with the unscaled CTMC, we now construct four processes ${{\tilde{Q}}^{(1)} (t)}, \dots, {{\tilde{Q}}^{(4)} (t)}$ defined on the time interval $[0, τ_{1} (n)]$ , where

τ_{1} (n) = \inf_{t \geq 0} {{\tilde{Q}}_{1}^{(1)} (t) = n} = \inf_{t \geq 0} {{\tilde{X}}_{1}^{(1)} (t) = 0} .

(33)

We refer to ${{\tilde{Q}}^{(i)} (t)}$ as the ith process. Process four is a copy of ${Q (t)}$ . Numbers two and three are copies of four but with one extra customer who is assigned to a server with an empty buffer. The extra customer in two is different from the one in three. Lastly, process one is a copy of four but with two extra customers. The extra customers are the same as those in two and three. Figure 6 visualizes the initial condition of the processes.

Let ${{\tilde{X}}^{(1)} (t)}, \dots, {{\tilde{X}}^{(4)} (t)}$ be the scaled counterparts of these processes. Note that

\begin{array}{l} Δ_{1}^{2} f_{h} (x^{q}) = \int_{0}^{\infty} (E_{x^{q} + 2 δ e^{(1)}} h (X (t)) - 2 E_{x^{q} + δ e^{(1)}} h (X (t)) + E_{x^{q}} h (X (t))) d t \\ = \int_{0}^{\infty} E_{{\tilde{X}}^{(1)} (0) = x^{q}} ((h ({\tilde{X}}^{(4)} (t)) - h ({\tilde{X}}^{(3)} (t))) - (h ({\tilde{X}}^{(2)} (t)) - h ({\tilde{X}}^{(1)} (t)))) d t . \end{array}

We refer to the different customers according to their shapes in Figure 6. Define τ_s and τ_d to be the service times of the server with the star and diamond customer, respectively. Both are exponentially distributed with unit mean. Setting $τ_{m} = \min {τ_{s}, τ_{d}, τ_{1} (n)}$ , we observe that if $τ_{m} = τ_{s}$ , then

{\tilde{X}}^{(1)} (t) = {\tilde{X}}^{(3)} (t), {\tilde{X}}^{(2)} (t) = {\tilde{X}}^{(4)} (t), t \geq τ_{m};

τ_{m} = τ_{d}

, then

{\tilde{X}}^{(1)} (t) = {\tilde{X}}^{(2)} (t), and {\tilde{X}}^{(3)} (t) = {\tilde{X}}^{(4)} (t), t \geq τ_{m} .

Therefore,

\begin{array}{l} \int_{0}^{\infty} E_{{\tilde{X}}^{(1)} (0) = x} ((h ({\tilde{X}}^{(4)} (t)) - h ({\tilde{X}}^{(3)} (t))) - (h ({\tilde{X}}^{(2)} (t)) - h ({\tilde{X}}^{(1)} (t)))) d t \\ = & E_{{\tilde{X}}^{(1)} (0) = x} \int_{0}^{τ_{m}} ((h ({\tilde{X}}^{(4)} (t)) - h ({\tilde{X}}^{(3)} (t))) - (h ({\tilde{X}}^{(2)} (t)) - h ({\tilde{X}}^{(1)} (t)))) d t \\ + ℙ_{{\tilde{X}}^{(1)} (0) = x} (τ_{m} = τ_{1} (n)) E_{{\tilde{X}}^{(1)} (0) = x} [Δ_{1}^{2} f_{h} (0, {\tilde{X}}_{2}^{(1)} (τ_{1} (n)), 0, \dots, 0) | τ_{m} = τ_{1} (n)] . \end{array}

(34)

Because ${\tilde{X}}^{(4)} (t) = {\tilde{X}}^{(3)} (t) + δ e^{(1)} = {\tilde{X}}^{(2)} (t) + δ e^{(1)} = {\tilde{X}}^{(1)} (t) + 2 δ e^{(1)}$ for $0 \leq t \leq τ_{m}$ ,

| (h ({\tilde{X}}^{(4)} (t)) - h ({\tilde{X}}^{(3)} (t))) - (h ({\tilde{X}}^{(2)} (t)) - h ({\tilde{X}}^{(1)} (t))) | = | Δ_{1}^{2} h ({\tilde{X}}^{(1)} (t)) | \leq C δ^{2},

where the last inequality follows from

h \in M_{disc, 2} (C)

. Combining this with the facts that

{\tilde{X}}_{2}^{(1)} (τ_{1} (n)) \leq {\tilde{X}}_{2}^{(1)} (0)

and

E_{x} τ_{m} \leq E τ_{s} = 1

, we conclude that the right-hand side of (34) is bounded by

C δ^{2} + | Δ_{1}^{2} f_{h} (0, x_{2}^{q}, 0, \dots, 0) |

, which proves the bound on

| Δ_{1}^{2} f_{h} (x^{q}) |

The remaining bounds are proved similarly, starting with $| Δ_{2} Δ_{1} f_{h} (x^{q}) |$ . Fix $x^{q} \in S$ with $x_{3}^{q} = 0$ , and consider

Δ_{2} Δ_{1} f_{h} (x^{q}) = (f_{h} (x^{q} + δ e^{(1)} + δ e^{(2)}) - f_{h} (x^{q} + δ e^{(1)})) - (f_{h} (x^{q} + δ e^{(2)}) - f_{h} (x^{q})) .

We again construct a coupling ${{\tilde{X}}^{(1)} (t)}, \dots, {{\tilde{X}}^{(4)} (t)}$ corresponding to the four initial states on the right-hand side above. The initial conditions of the unscaled processes are visualized in Figure 7. Our construction yields

Δ_{2} Δ_{1} f_{h} (x^{q}) = \int_{0}^{\infty} E_{{\tilde{X}}^{(1)} (0) = x^{q}} ((h ({\tilde{X}}^{(4)} (t)) - h ({\tilde{X}}^{(3)} (t))) - (h ({\tilde{X}}^{(2)} (t)) - h ({\tilde{X}}^{(1)} (t)))) d t .

(35)

Let $ν_{1} = \inf_{t \geq 0} {{\tilde{Q}}_{1}^{(3)} (t) = n}$ . We again let τ_s and τ_d be the remaining service time of the server with the star and diamond customer, respectively, and set $τ_{m} = \min {τ_{s}, τ_{d}, ν_{1}}$ . Just like we argued before, if $τ_{m} = τ_{s}$ , then the integrand in (35) is zero after τ_m. If, however, $τ_{m} = τ_{d}$ , then

\begin{array}{l} {\tilde{X}}_{2}^{(1)} (τ_{m}) = & {\tilde{X}}_{2}^{(2)} (τ_{m}) = {\tilde{X}}_{2}^{(3)} (τ_{m}) = {\tilde{X}}_{2}^{(4)} (τ_{m}), \\ {\tilde{X}}_{1}^{(2)} (τ_{m}) + 2 δ = & {\tilde{X}}_{1}^{(1)} (τ_{m}) + δ = {\tilde{X}}_{1}^{(4)} (τ_{m}) + δ = {\tilde{X}}_{1}^{(3)} (τ_{m}); \end{array}

τ_{m} = ν_{1}

, then

{\tilde{X}}_{1}^{(i)} (τ_{m}) = 0

for

1 \leq i \leq 4

and

{\tilde{X}}_{2}^{(2)} (τ_{m}) = {\tilde{X}}_{2}^{(1)} (τ_{m}) + δ = {\tilde{X}}_{2}^{(4)} (τ_{m}) + δ = {\tilde{X}}_{2}^{(3)} (τ_{m}) + 2 δ .

Therefore,

\begin{array}{l} Δ_{2} Δ_{1} f_{h} (x^{q}) = E_{{\tilde{X}}^{(1)} (0) = x^{q}} \int_{0}^{τ_{m}} ((h ({\tilde{X}}^{(4)} (t)) - h ({\tilde{X}}^{(3)} (t))) - (h ({\tilde{X}}^{(2)} (t)) - h ({\tilde{X}}^{(1)} (t)))) d t \\ + ℙ_{{\tilde{X}}^{(1)} (0) = x^{q}} (τ_{m} = τ_{d}) E_{x} [- Δ_{1}^{2} f_{h} ({\tilde{X}}^{(2)} (τ_{d})) | τ_{m} = τ_{d}] \\ + ℙ_{{\tilde{X}}^{(1)} (0) = x^{q}} (τ_{m} = ν_{1}) E_{x} [Δ_{2}^{2} f_{h} (0, {\tilde{X}}_{2}^{(3)} (ν_{1}), 0, \dots, 0) | τ_{m} = ν_{1}] \\ \leq C δ^{2} + | Δ_{1}^{2} f_{h} (0, x_{2}^{q}, 0, \dots, 0) | + | Δ_{2}^{2} f_{h} (0, x_{2}^{q}, 0, \dots, 0) | . \end{array}

(36)

Figure 8 illustrates the coupling needed to bound $| Δ_{2}^{2} f_{h} (x^{q}) |$ . The idea of the proof is again to wait until $τ_{1} (n)$ and analyze what could happen if one of the servers containing the star or diamond customer completes service before $τ_{1} (n)$ . We leave the details to the reader. □

Figure 6. The Initial State of the Four Systems
*Notes*. The red customers represent those common to all four systems. The diamond and star are the extra customers.

Figure 7. The Initial State of the Four Systems
*Note*. The red customers represent those common to all four systems.

**Figure 8. The Coupling Needed to Bound $| Δ_{2}^{2} f_{h} (x^{q}) |$**

Remark 1.

Let us say a few words on the advantage of using the prelimit generator comparison approach over the classical generator comparison approach. Lemma 6 is proved using a synchronous coupling of four JSQ systems. The four systems are initialized one or two customers apart from one another; because of the discrete state space of the CTMC, all four systems stay one or two customers apart until they couple. Had we used the classical generator comparison approach, we would have needed to carry out a similar analysis by coupling four copies of the diffusion ${Y (t)}$ . However, unlike the JSQ coupling, the four diffusions would not maintain their initial spacing relative to each other because ${Y (t)}$ takes values in a continuous state space. This would further complicate the analysis as we would now need to keep track of the positions of the four diffusions relative to each other.

Proof of Lemma 7.

We want to bound

| f_{h} (0, x_{2}^{q}, 0, \dots, 0) - f_{h} (δ, x_{2}^{q} + δ, 0, \dots, 0) | = | \int_{0}^{\infty} (E_{(0, x_{2}^{q}, 0, \dots, 0)} h (X (t)) - E_{(δ, x_{2}^{q} + δ, 0, \dots, 0)} h (X (t))) d t | .

As we are accustomed to doing by now, let us construct a coupling ${{\tilde{Q}}^{(1)} (t), {\tilde{Q}}^{(2)} (t)}$ with

{\tilde{Q}}^{(1)} (0) = (n, q_{2}, 0, \dots, 0), and {\tilde{Q}}^{(2)} (0) = (n - 1, q_{2} + 1, 0, \dots, 0) .

System two has one less idle server and one more customer waiting in a buffer compared to system one, but the total initial customer count is identical across both systems. The initial condition of both systems is visualized in Figure 9. We assume that the diamond and star customers are independent of each other, that the systems see identical arrivals, and that the rest of the customers are identical across both systems.

Now define τ_d and τ_s to be the remaining service times of the server that has the diamond and star customer, respectively; let $ν_{1} = \inf_{t \geq 0} {{\tilde{Q}}_{1}^{(2)} (t) = n}$ ; set $τ_{m} = \min {τ_{s}, τ_{d}, ν_{1}}$ . If $τ_{m} = ν_{1}$ or $τ_{m} = τ_{d}$ , then ${\tilde{Q}}^{(1)} (t) \overset{d}{=} {\tilde{Q}}^{(2)} (t)$ for $t \geq τ_{m}$ . Letting ${{\tilde{X}}^{(i)} (t)}$ be the scaled version of ${{\tilde{Q}}^{(i)} (t)}$ , it follows that

\begin{array}{l} f_{h} (0, x_{2}^{q}, 0, \dots, 0) - f_{h} (δ, x_{2}^{q} + δ, 0, \dots, 0) \\ = & E_{{\tilde{X}}^{(1)} (0) = (0, x_{2}^{q}, 0, \dots, 0)} \int_{0}^{τ_{m}} (h ({\tilde{X}}^{(1)} (t)) - h ({\tilde{X}}^{(2)} (t))) d t \\ + ℙ_{{\tilde{X}}^{(1)} (0) = (0, x_{2}^{q}, 0, \dots, 0)} (τ_{m} = τ_{s}) E_{x^{q}} [- Δ_{2} f_{h} ({\tilde{X}}^{(1)} (τ_{s})) | τ_{m} = τ_{s}] . \end{array}

To bound the first term on the right-hand side, note that

| E_{{\tilde{X}}^{(1)} (0) = (0, x_{2}^{q}, 0, \dots, 0)} \int_{0}^{τ_{m}} (h ({\tilde{X}}^{(1)} (t)) - h ({\tilde{X}}^{(2)} (t))) d t | \leq C δ E_{{\tilde{X}}^{(2)} (0) = (δ, x_{2}^{q} + δ, 0, \dots, 0)} ν_{1} \leq C (β) δ^{2} .

The first inequality is true because $h \in M_{disc, 2} (C)$ , and the last inequality follows from Lemma 8 with $x_{1}^{q} = δ$ there. Furthermore,

\begin{array}{l} ℙ_{{\tilde{X}}^{(1)} (0) = (0, x_{2}^{q}, 0, \dots, 0)} (τ_{m} = τ_{s}) | E_{x} [- Δ_{2} f_{h} ({\tilde{X}}^{(1)} (τ_{s})) | τ_{m} = τ_{s}] | \\ \leq ℙ_{{\tilde{X}}^{(1)} (0) = (0, x_{2}^{q}, 0, \dots, 0)} (τ_{s} < ν_{1}) C (b, β) δ (1 + x_{2}^{q}) \leq C (b, β) δ^{2} (1 + x_{2}^{q}) . \end{array}

The first inequality follows from the bound on the first-order difference in (19) together with the fact that ${\tilde{X}}_{2}^{(1)} (t) \leq x_{2}^{q}$ for all $t \in [0, τ_{m}]$ . The second inequality follows by noting that τ_s is independent of ν₁ and using Lemma 8 with $x_{1}^{q} = δ, τ^{-} (x_{1}^{q}) = ν_{1}$ , and $V = τ_{s}$ there. □

Figure 9. The Initial State of the Two Systems in an Example Where n = 6
*Note*. The red circles represent customers common to both systems.

4. Conclusion

As stated in the introduction, the Stein factor bounds require the bulk of our efforts. Proving the first-order bounds in Section 3.1 amounts to considering two coupled JSQ systems, initialized with a difference of one customer, and bounding the expected coupling time of this joint chain. We bound the coupling time by considering a sequence of coupling attempts where the probability of coupling in a single attempt is bounded away from zero uniformly in n; the expected interattempt times are also bounded from above, uniformly in n. The coupling time can then be bounded by a sum of a geometrically distributed number of random variables representing the interattempt durations. This renewal-like argument applies more generally to settings where (a) there is a region of the state space where the joint chain is guaranteed to couple provided it spends enough time there and (b) one can control the expected time to reach this region and the probability of coupling in the region before leaving it.

With the first-order Stein factor bounds in hand, the higher-order bounds require less effort. Our proofs of the high-order bounds make heavy use of the transition structure of the JSQ system and, in particular, that $Q_{2} (t), \dots, Q_{b + 1} (t)$ increase only at those times when $Q_{1} (t) = n$ . Readers should not be misled into thinking that high-order Stein factor bounds require less effort than first-order bounds for all models. Indeed, in the classical generator comparison approach, high-order bounds require much more effort; see, for example, Mackey and Gorham (2016), Erdogdu et al. (2018), Jin et al. (2022).

Regarding extending our results, we note that Proposition 2, which compares G_X to G_Y, can be easily adjusted to hold for other parameter regimes and load-balancing policies. The main difficulty would be establishing Stein factor bounds. As mentioned in the introduction, Zhao et al. (2021) considered the super-Halfin-Whitt regime ( $1 / 2 < α < 1$ ) and established several hitting-time estimates similar to the ones we use in the proof of Lemma 5 to bound the first-order Stein factors. It may be possible to build on their results and obtain rates of convergence for the super-Halfin-Whitt regime too.

Furthermore, it seems that the sub-Halfin-Whitt regime ( $0 < α < 1 / 2$ ) should present less of a challenge than our own setting. Recall from the discussion in Section 3.1 that coupling of the joint CTMC is guaranteed provided it enters $Θ_{1}^{Q}$ and spends an exponentially distributed amount of time there before all servers become busy. Compared to the Halfin-Whitt regime, the rate at which customers arrive in the sub-Halfin-Whitt regime is much smaller, so the event that all servers are busy should happen less frequently. Indeed, Liu and Ying (2020) showed that the steady-state probability that all servers are busy tends to zero in the sub-Halfin-Whitt regime. Consequently, the Stein factor bounds should be simpler to establish.

Appendix A. Supporting Proofs for Section 2

We first prove Lemma 2 and then introduce the operator A in Section A.1 in Appendix A. Once A is introduced, we prove Proposition 2 in Section A.2 in Appendix A.

Proof of Lemma 2.

Initialize Y(0) according to Y. Because ${Y (t)}$ satisfies (1), for any $f \in C^{2} (R_{+}^{b + 1})$ with $E | f (Y) | < \infty$ , Itô’s lemma implies that

\begin{array}{l} 0 = & E f (Y (1)) - E f (Y (0)) \\ = & E \int_{0}^{1} G_{Y} f (Y (s)) d s + E (\int_{0}^{1} (\frac{\partial}{\partial x_{1}} f (Y (s)) + \frac{\partial}{\partial x_{2}} f (Y (s))) 1 (Y_{1} (s) = 0) d U (s)) . \end{array}

(A.1)

If $E | G_{Y} f (Y) | < \infty$ , then $E \int_{0}^{1} G_{Y} f (Y (s)) d s = E G_{Y} f (Y)$ follows from the Fubini-Tonelli theorem. □

A.1. The Interpolator A

The operator A discussed in this section is identical to the one introduced in appendix A of Braverman (2022), but we repeat its key properties here as they are needed for the proof of Proposition 2. Consider a one-dimensional function $f : δ ℤ \to R$ . We can extend it to $R$ by defining

A f (x) = \sum_{i = 0}^{4} α_{k (x) + i}^{k (x)} (x) f (δ (k (x) + i)),

where

k (x) = ⌊ x / δ ⌋

and

α_{k + i}^{k} : R \to R

are weights defined for all

k \in ℤ

and

i = 0, \dots, 4

. The function

A f (x)

is a weighted sum of the five points

f (δ k (x)), \dots, f (δ (k (x) + 4))

. We mention the reason for using five points after stating Theorem A.1. Note that if f(x) is defined only on a subset of

δ ℤ

, then

A f (x)

can still be defined provided that

f (δ k (x)), \dots, f (δ (k (x) + 4))

are defined. Braverman (2022) described how to choose these weights to make Af(x) coincide with

f (\cdot)

on grid points and also to make it a differentiable function whose derivatives behave like the corresponding finite differences of

f (\cdot)

. The idea can be applied to multidimensional grid-valued functions as well.

The following result is theorem A.1 of Braverman (2022). We use this as an interface that contains the important properties of A without delving into the low-level details behind its construction.

Theorem A.1.

Given a convex set $K \subset R^{d}$ , define

K_{4} = {x \in K \cap δ ℤ^{d} : δ (k (x) + i) \in K \cap δ ℤ^{d} for all 0 \leq i \leq 4 e};

let

Conv (K_{4})

be the convex hull of K₄; and, for

x \in R^{d}

, define k(x) by

k_{j} (x) = ⌊ x_{j} / δ ⌋

. There exist weights

{α_{k + i}^{k} : R \to R, k \in ℤ, i = 0, 1, 2, 3, 4}

such that for any

f : K \cap δ ℤ^{d} \to R

, the function

\begin{array}{l} A f (x) = \sum_{i_{d} = 0}^{4} α_{k_{d} (x) + i_{d}}^{k_{d} (x)} (x_{d}) \dots \sum_{i_{1} = 0}^{4} α_{k_{1} (x) + i_{1}}^{k_{1} (x)} (x_{1}) f (δ (k (x) + i)) \\ = \sum_{i_{1}, \dots, i_{d} = 0}^{4} (\prod_{j = 1}^{d} α_{k_{j} (x) + i_{j}}^{k_{j} (x)} (x_{j})) f (δ (k (x) + i)), x \in Conv (K_{4}) \end{array}

(A.2)

satisfies

A f (x) \in C^{3} (Conv (K_{4}))

, where

i = (i_{1}, \dots, i_{d})

in (A.2). Additionally, Af(x) is infinitely differentiable almost everywhere on

Conv (K_{4})

A f (δ k) = f (δ k), δ k \in K_{4},

(A.3)

and there exists a constant C(d) > 0 independent of

f (\cdot)

, x, and δ such that

| \frac{\partial^{a}}{\partial x^{a}} A f (x) | \leq C (d) δ^{- {‖ a ‖}_{1}} \max_{\begin{matrix} 0 \leq i_{j} \leq 4 - a_{j} \\ j = 1, \dots, d \end{matrix}} | Δ_{1}^{a_{1}} \dots Δ_{d}^{a_{d}} f (δ (k (x) + i)) |, x \in Conv (K_{4}),

(A.4)

for

0 \leq {‖ a ‖}_{1} \leq 3

; (A.4) also holds when

{‖ a ‖}_{1} = 4

for almost all

x \in Conv (K_{4})

. Additionally, the weights

{α_{k + i}^{k} : R \to R, k \in ℤ, i = 0, 1, 2, 3, 4}

are degree-7 polynomials in

(x - δ k) / δ

whose coefficients do not depend on k or δ. They satisfy

α_{k}^{k} (δ k) = 1, and α_{k + i}^{k} (δ k) = 0, k \in ℤ, i = 1, 2, 3, 4,

(A.5)

\sum_{i = 0}^{4} α_{k + i}^{k} (x) = 1, k \in ℤ, x \in R,

(A.6)

and also the following translational invariance property:

α_{k + j + i}^{k + j} (x + δ j) = α_{k + i}^{k} (x), i, j, k \in ℤ, x \in R .

(A.7)

Remark A.1.

The bound in (A.4) holds almost everywhere when ${‖ a ‖}_{1} = 4$ . This bound is the reason we need to use $f (δ k (x))$ and the four points to the right of it (in each dimension). By using more (fewer) points, one can alter the theorem so that (A.4) holds for larger (smaller) values of ${‖ a ‖}_{1}$ . It is worth noting that to prove the results in this paper, we do not go beyond ${‖ a ‖}_{1} = 3$ .

Going forward, we let A be the operator described in Theorem A.1. Because Af coincides with f on the grid, we refer to A as an interpolator. For the interested reader, A is a degree-7 polynomial spline. From (A.3) we see that A is a linear operator, and (A.6) implies that A applied to a constant simply equals that constant. Before we can prove Proposition 2, we require one more lemma.

Lemma A.1.

In the setting of Theorem A.1, for any $k \in K_{4}$ and $1 \leq j \leq d$ ,

\frac{\partial}{\partial x_{j}} A f (x) |_{x = δ k} = δ^{- 1} (Δ_{j} - \frac{1}{2} Δ_{j}^{2} + \frac{1}{3} Δ_{j}^{3}) f (δ k) .

(A.8)

Furthermore, there exists some $ϵ : Conv (K_{4}) \to R$ satisfying

| ϵ (x) | \leq C (d) δ^{- 1} \max_{\begin{matrix} 0 \leq i \leq 4 e \\ {‖ a ‖}_{1} = 2 \end{matrix}} | Δ_{1}^{a_{1}} \dots Δ_{d}^{a_{d}} f (δ (k (x) + i)) |

such that for any

x \in Conv (K_{4})

\frac{\partial}{\partial x_{j}} A f (x) = δ^{- 1} Δ_{j} f (δ k (x)) + ϵ (x) .

Proof of Lemma A.1.

The proof is identical for all indices, so we assume that j = 1. Fix $δ k \in K_{4}$ and let $g (x_{1}) = A f (x_{1}, δ k_{2}, \dots, δ k_{d})$ be a function in x₁ only. The form of $A f (x)$ in (A.2), together with (A.5), implies that

\frac{\partial}{\partial x_{1}} A f (x) |_{x = δ k} = g^{'} (δ k_{1}) .

It follows that $g^{'} (δ k_{1}) = P_{k_{1}}^{'} (δ k_{1})$ , where $P_{k_{1}} (x)$ is a polynomial defined in (A.1) of Braverman (2022). Furthermore, (A.1) implies that

P_{k_{1}}^{'} (δ k_{1}) = δ^{- 1} (Δ_{1} - \frac{1}{2} Δ_{1}^{2} + \frac{1}{3} Δ_{1}^{3}) = g (δ k_{1}) = δ^{- 1} (Δ_{1} - \frac{1}{2} Δ_{1}^{2} + \frac{1}{3} Δ_{1}^{3}) f (δ k),

from which (A.8) follows. To prove the second claim of the lemma, we write

\frac{\partial}{\partial x_{j}} A f (x) = δ^{- 1} (Δ_{j} - \frac{1}{2} Δ_{j}^{2} + \frac{1}{3} Δ_{j}^{3}) f (δ k (x)) + \frac{\partial}{\partial x_{j}} A f (x) - \frac{\partial}{\partial x_{j}} A f (x) |_{x = δ k (x)} .

Now $| Δ_{j}^{2} f (δ k (x)) | \leq \max {| Δ_{1}^{a_{1}} \dots Δ_{d}^{a_{d}} f (δ (k (x) + i)) | : 0 \leq i \leq 4 e, {‖ a ‖}_{1} = 2}$ ,

| Δ_{j}^{3} f (δ k (x)) | = | Δ_{j}^{2} f (δ (k (x) + e^{(j)})) - Δ_{j}^{2} f (δ k (x)) | \leq \max_{\begin{matrix} 0 \leq i \leq 4 e \\ {‖ a ‖}_{1} = 2 \end{matrix}} | Δ_{1}^{a_{1}} \dots Δ_{d}^{a_{d}} f (δ (k (x) + i)) |,

and

\begin{array}{l} | \frac{\partial}{\partial x_{j}} A f (x) - \frac{\partial}{\partial x_{j}} A f (x) |_{x = δ k (x)} | \\ \leq \sum_{j^{'} = 1}^{d} | x_{j^{'}} - δ k_{j^{'}} (x) | | \frac{\partial^{2}}{\partial x_{j} \partial x_{j^{'}}} A f (ξ) | \leq C (d) δ^{- 1} \max_{\begin{matrix} 0 \leq i \leq 4 e \\ {‖ a ‖}_{1} = 2 \end{matrix}} | Δ_{1}^{a_{1}} \dots Δ_{d}^{a_{d}} f (δ (k (x) + i)) |, \end{array}

where ξ is some point between

δ k (x)

and x. The last inequality follows from (A.4) and the fact that

| x_{j} - δ k_{j} (x) | \leq δ

. □

Note that some of the bounds in Theorem A.1 and Lemma A.1 have a constant C(d) depending on the dimension d of the function (e.g., (A.4)). In the JSQ model $d = b + 1$ , but when proving Proposition 2 in the next section, we can assume that d = 2 because of the following. Given a function $f : δ N^{b + 1} \to R$ , we can use (A.2) and (A.5) of Theorem A.1, and the fact that Y_i = 0 for i > 2, to see that

\begin{array}{l} A f (Y) = \sum_{i_{b + 1} = 0}^{4} α_{i_{b + 1}}^{0} (0) \dots \sum_{i_{3} = 0}^{4} α_{i_{3}}^{0} (0) \sum_{i_{2} = 0}^{4} α_{k_{2} (Y) + i_{2}}^{k_{2} (Y)} (Y_{2}) \sum_{i_{1} = 0}^{4} α_{k_{1} (Y) + i_{1}}^{k_{1} (Y)} (Y_{1}) f (δ (k (Y) + i)) \\ = \sum_{i_{2} = 0}^{4} α_{k_{2} (Y) + i_{2}}^{k_{2} (Y)} (Y_{2}) \sum_{i_{1} = 0}^{4} α_{k_{1} (Y) + i_{1}}^{k_{1} (Y)} (Y_{1}) f (δ (k_{1} (Y) + i_{1}), δ (k_{2} (Y) + i_{2}), 0, \dots, 0) . \end{array}

Because $k_{j} (Y)$ depends only on Y_j, we see that $A f (Y)$ is actually a bivariate function. In Section A.2, we treat any function of the form $A f (Y)$ as a function of two variables.

A.2. Proving Proposition 2

Fix $h \in M_{disc, 2} (C)$ . We recall from (5) that for $x^{q} \in S$ ,

\begin{array}{l} G_{X} f (x^{q}) = & - 1 (q_{1} < n) n λ Δ_{1} f (x^{q} - δ e^{(1)}) + n λ \sum_{j = 1}^{b} 1 (q_{1} = \dots = q_{j} = n, q_{j + 1} < n) Δ_{j + 1} f (x^{q}) \\ + (q_{1} - q_{2}) Δ_{1} f (x^{q}) - \sum_{j = 2}^{b} (q_{j} - q_{j + 1}) Δ_{j} f (x^{q} - δ e^{(j)}) - q_{b + 1} Δ_{b + 1} f (x^{q} - δ e^{(b + 1)}) \end{array}

and

f_{h} (x^{q})

is the unique solution to the Poisson equation

G_{X} f_{h} (x^{q}) = E h (X) - h (x^{q}), x^{q} \in S

(A.9)

with

f_{h} (0) = 0

. Also recall that we extended

f_{h} (x^{q})

by setting

f_{h} (x^{q}) = 0

for

x^{q} \in δ N^{b + 1} ∖ S

and defined

\begin{array}{l} B = & {(x_{1}, x_{2}, 0, \dots, 0) \in R_{+}^{b + 1} : x_{2} + x_{1} \leq δ (n / 2 - 8) = (n / 2 - 8) / \sqrt{n}} and \\ I = & {i = (i_{1}, i_{2}, 0, \dots, 0) \in N^{b + 1} : 0 \leq i_{1}, i_{2} \leq 4} . \end{array}

We first argue that $E | A h (Y) | < \infty, E | A f_{h} (Y) | < \infty$ , and $E | G_{Y} A f_{h} (Y) | < \infty$ , which together imply that (12) holds. The latter two statements follow immediately from the fact that $f_{h} (x^{q})$ , and therefore $A f_{h} (x)$ , have compact support. Because $h \in M_{disc, 2} (C)$ , inequality (A.4) of Theorem A.1 implies that Ah(Y) is Lipschitz and therefore, $E | A h (Y) | < \infty$ because of Lemma 3, which states that the moments of Y_i are finite.

Next we argue that $A G_{X} f_{h} (x) = E h (X) - A h (x)$ for all $x \in B$ . Given the Poisson Equation (A.9) and the definition of A in (A.2) of Theorem A.1, it suffices to show that $δ (k (x) + i) \in S$ for all $i \in I$ . From the definition of S_Q in (4) we know that any point $q \in S_{Q}$ satisfies $0 \leq q_{2} \leq q_{1} \leq n$ . The corresponding points $x^{q} \in S$ satisfy $x_{1}^{q} \geq 0, x_{2}^{q} \geq 0$ , and $x_{1}^{q} + x_{2}^{q} = δ (n - q_{1}) + δ q_{2} \leq δ n$ . The latter inequality says that the combined number of idle servers and servers with at least one person waiting in the buffer cannot exceed n. Now, provided that n > 16, any point δk in

B \cap δ N^{b + 1} = {(x_{1}, x_{2}, 0, \dots, 0) \in R_{+}^{b + 1} : x_{2} + x_{1} \leq δ (n / 2 - 8)} \cap δ N^{b + 1}

must satisfy

δ (k + i) \in S

for all

i \in I

because

δ (k_{1} + i_{1}) + δ (k_{2} + i_{2}) \leq δ n / 2

. Finally, recall that

\begin{array}{l} ε_{1} (Y) = (A G_{X} f_{h} (Y) - G_{Y} A f_{h} (Y)) 1 (Y \in B), \\ ε_{2} (Y) = (E h (X) - A h (Y) - G_{Y} A f_{h} (Y)) 1 (Y \notin B), \\ ε_{3} (Y) = (\frac{\partial}{\partial x_{1}} A f_{h} (Y) + \frac{\partial}{\partial x_{2}} A f_{h} (Y)) 1 (Y \in B), and \\ ε_{4} (Y) = (\frac{\partial}{\partial x_{1}} A f_{h} (Y) + \frac{\partial}{\partial x_{2}} A f_{h} (Y)) 1 (Y \notin B) . \end{array}

We bound $ε_{2} (Y), ε_{3} (Y)$ , and $ε_{4} (Y)$ in Section A.2.1 and bound $ε_{1} (Y)$ in Section A.2.2.

A.2.1. Bounding $ε_{2} (Y)$ through $ε_{4} (Y)$ .

We begin with the bound on

| ε_{2} (Y) | \leq | A h (Y) | 1 (Y \notin B) + 1 (Y \notin B) E | h (X) | + | G_{Y} A f_{h} (Y) | 1 (Y \notin B) .

The fact that $A h (Y)$ is Lipschitz, that $A h (0) = h (0) = 0$ , and that $h \in M_{disc, 2} (C)$ imply that

\begin{array}{l} | A h (Y) 1 (Y \notin B) | \leq & C (Y_{1} + Y_{2}) 1 (Y \notin B) and \\ 1 (Y \notin B) E | h (X) | \leq & 1 (Y \notin B) C E (X_{1} + \dots + X_{b + 1}) \leq 1 (Y \notin B) C (b, β), \end{array}

where the last inequality follows from Inequality (32). To bound the remaining term, we recall (A.4) of Theorem A.1, which says that

| \frac{\partial^{a}}{\partial x^{a}} A f (Y) | \leq C δ^{- {‖ a ‖}_{1}} \max_{\begin{matrix} i \in I \\ 0 \leq i_{j} \leq 4 - a_{j} \end{matrix}} | Δ_{1}^{a_{1}} Δ_{2}^{a_{2}} f (δ (k (Y) + i)) | \leq C δ^{- {‖ a ‖}_{1}} \max_{i \in I} | f (δ (k (Y) + i)) |

(A.10)

for

1 \leq {‖ a ‖}_{1} \leq 3

. Combined with this bound, the definition of G_Y in (8) implies that

\begin{array}{l} | G_{Y} A f_{h} (Y) | = | (β - (Y_{1} + Y_{2})) \frac{\partial}{\partial x_{1}} A f_{h} (Y) - Y_{2} \frac{\partial}{\partial x_{2}} A f_{h} (Y) + \frac{\partial^{2}}{\partial x_{1}^{2}} A f_{h} (Y) | \\ \leq C (β) δ^{- 2} (1 + Y_{1} + Y_{2}) \max_{i \in I} | f (δ (k (Y) + i)) | . \end{array}

Combining the bounds on the three terms yields the bound on $ε_{2} (Y)$ . Lemma A.1 implies the bound on $ε_{3} (Y)$ , and (A.10) implies the bound on $ε_{4} (Y)$ .

A.2.2. Bounding $ε_{1} (Y)$ .

Bounding $ε_{1} (Y)$ requires more effort. The first thing to note is that the weighted sum representation of $A G_{X} f_{h} (Y)$ is difficult to work with. Our first task is therefore to write it in a form that is more amenable to analysis. To this end, we extend the domain of $f_{h} (x^{q})$ to allow either the first or second coordinate to take the value $- δ$ by defining

\begin{array}{l} {\hat{f}}_{h} (x^{q}) = f_{h} (x^{q}), & x^{q} \in δ N^{b + 1}, \\ {\hat{f}}_{h} (- δ, x_{2}^{q}, \dots, x_{b + 1}^{q}) = f_{h} (0, x_{2}^{q} + δ, x_{3}^{q}, \dots, x_{b + 1}^{q}), & (0, x_{2}^{q}, \dots, x_{b + 1}^{q}) \in δ N^{b + 1}, \\ {\hat{f}}_{h} (x_{1}^{q}, - δ, x_{3}^{q}, \dots, x_{b + 1}^{q}) = (1 - Δ_{2}) f_{h} (x_{1}^{q}, 0, x_{3}^{q}, \dots, x_{b + 1}^{q}), & (x_{1}^{q}, 0, x_{3}^{q}, \dots, x_{b + 1}^{q}) \in δ N^{b + 1} . \end{array}

(A.11)

The form of ${\hat{f}}_{h} (x^{q})$ is tied to the transition structure of the JSQ model and specifically to the “reflection” that occurs near the boundaries ${x_{1}^{q} = 0}$ and ${x_{2}^{q} = 0}$ . Furthermore, the definition of A in Theorem A.1 implies that $A f_{h} (x) = A {\hat{f}}_{h} (x)$ for $x \in R_{+}^{b + 1}$ because ${\hat{f}}_{h} = f_{h}$ on $δ N^{b + 1}$ . Having defined ${\hat{f}}_{h} (x^{q})$ , we present the following lemma, which is proved in Section A.2.3.

Lemma A.2.

For any $x^{q} \in B \cap δ N^{b + 1}$ ,

\begin{array}{l} G_{X} f_{h} (x^{q}) = & n λ ({\hat{f}}_{h} (x^{q} - δ e^{(1)}) - {\hat{f}}_{h} (x^{q})) + (n - (x_{1}^{q} + x_{2}^{q}) / δ) ({\hat{f}}_{h} (x^{q} + δ e^{(1)}) - {\hat{f}}_{h} (x^{q})) \\ + \frac{1}{δ} x_{2}^{q} ({\hat{f}}_{h} (x^{q} - δ e^{(2)}) - {\hat{f}}_{h} (x^{q})) . \end{array}

(A.12)

Consequently, for any $x \in B$ ,

\begin{array}{l} A G_{X} f_{h} (x) = & n λ (A {\hat{f}}_{h} (x - δ e^{(1)}) - A {\hat{f}}_{h} (x)) + (n - (x_{1} + x_{2}) / δ) (A {\hat{f}}_{h} (x + δ e^{(1)}) - A {\hat{f}}_{h} (x)) \\ + \frac{1}{δ} x_{2} (A {\hat{f}}_{h} (x - δ e^{(2)}) - A {\hat{f}}_{h} (x)) + ε_{5} (x), \end{array}

(A.13)

where

\begin{array}{l} ε_{5} (x) = & \sum_{i_{2} = 0}^{4} α_{k_{2} (x) + i_{2}}^{k_{2} (x)} (x_{2}) \sum_{i_{1} = 0}^{4} α_{k_{1} (x) + i_{1}}^{k_{1} (x)} (x_{1}) \frac{1}{δ} (δ (k_{2} (x) + i_{2}) - x_{2}) \\ \times (- Δ_{2} {\hat{f}}_{h} (δ (k (x) + i - e^{(2)})) + Δ_{2} {\hat{f}}_{h} (δ (k (x) - e^{(2)})) \\ + \sum_{i_{2} = 0}^{4} α_{k_{2} (x) + i_{2}}^{k_{2} (x)} (x_{2}) \sum_{i_{1} = 0}^{4} α_{k_{1} (x) + i_{1}}^{k_{1} (x)} (x_{1}) \frac{1}{δ} (- δ (k_{1} (x) + i_{1} + k_{2} (x) + i_{2}) + x_{1} + x_{2}) \\ \times (Δ_{1} {\hat{f}}_{h} (δ (k (x) + i)) - Δ_{1} {\hat{f}}_{h} (δ k (x))) . \end{array}

(A.14)

We now bound $ε_{1} (Y)$ using Lemma A.2. Applying Taylor expansion to (A.13), we have

\begin{array}{l} A G_{X} f_{h} (Y) = & n λ (- δ \frac{\partial}{\partial x_{1}} A {\hat{f}}_{h} (Y) + \frac{1}{2} δ^{2} \frac{\partial^{2}}{\partial x_{1}^{2}} A {\hat{f}}_{h} (Y) - \frac{1}{6} δ^{3} \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{1})) \\ + (n - (Y_{1} + Y_{2}) / δ) (δ \frac{\partial}{\partial x_{1}} A {\hat{f}}_{h} (Y) + \frac{1}{2} δ^{2} \frac{\partial^{2}}{\partial x_{1}^{2}} A {\hat{f}}_{h} (Y) + \frac{1}{6} δ^{3} \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{2})) \\ + \frac{1}{δ} Y_{2} (- δ \frac{\partial}{\partial x_{2}} A {\hat{f}}_{h} (Y) + \frac{1}{2} δ^{2} \frac{\partial^{2}}{\partial x_{2}^{2}} A {\hat{f}}_{h} (ξ^{3})) + ε_{5} (Y), \end{array}

where

ξ^{1}, ξ^{2}

, and

ξ^{3}

are points strictly between

Y - δ e^{(1)}

and Y, Y and

Y + δ e^{(1)}

, and

Y - δ e^{(2)}

and Y, respectively. Recall that

δ^{2} = 1 / n, δ (n - n λ) = β

, and G_Y from (8), which imply that

A G_{X} f_{h} (Y) - G_{Y} A {\hat{f}}_{h} (Y) = - \frac{1}{6} δ λ \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{1}) + \frac{1}{6} δ (1 - δ (Y_{1} + Y_{2})) \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{2}) + δ Y_{2} \frac{\partial^{2}}{\partial x_{2}^{2}} A {\hat{f}}_{h} (ξ^{3}) + ε_{5} (Y) .

Note that $A {\hat{f}}_{h} (Y) = A f_{h} (Y)$ because $Y \geq 0$ , so $G_{Y} A {\hat{f}}_{h} (Y) = G_{Y} A f_{h} (Y)$ . We now prove the following four bounds, which together imply the bound on $ε_{1} (Y)$ :

| \frac{1}{6} δ (1 - δ (Y_{1} + Y_{2})) \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{2}) | \leq C δ^{- 2} \max_{i \in I} | Δ_{1}^{3} f_{h} (δ (k (Y) + i)) |,

(A.15)

\begin{array}{l} | \frac{1}{6} δ λ \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{1}) | \leq C δ^{- 2} \max_{i \in I} | Δ_{1}^{3} f_{h} (δ (k (Y) + i)) | + C δ^{- 2} 1 (Y_{1} \leq δ) \max_{\begin{matrix} i \in I \\ i_{1} = 0 \end{matrix}} | (Δ_{1}^{2} - (Δ_{1} + Δ_{2})) f_{h} (δ (k (Y) + i)) |, \end{array}

(A.16)

| δ Y_{2} \frac{\partial^{2}}{\partial x_{2}^{2}} A {\hat{f}}_{h} (ξ^{3}) | \leq C δ^{- 1} Y_{2} \max_{i \in I} | Δ_{2}^{2} f_{h} (δ (k (Y) + i)) |, and

(A.17)

| ε_{5} (Y) | \leq C \max_{\begin{matrix} a_{1} + a_{2} = 2 \\ i \in I \end{matrix}} | Δ_{1}^{a_{1}} Δ_{2}^{a_{2}} f_{h} (δ (k (Y) + i)) | .

(A.18)

We begin with (A.15). Observe that $(1 - δ (Y_{1} + Y_{2})) \in (0, 1 / 2)$ because $Y \in B$ . Furthermore, $Y < ξ^{2} < Y + δ e^{(1)}$ implies $k (ξ^{2}) = k (Y) \geq 0$ . Combining this with (A.4) of Theorem A.1, we get

| \frac{1}{6} δ (1 - δ (Y_{1} + Y_{2})) \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{2}) | \leq C δ^{- 2} \max_{i \in I} | Δ_{1}^{3} {\hat{f}}_{h} (δ (k (Y) + i)) | = C δ^{- 2} \max_{i \in I} | Δ_{1}^{3} f_{h} (δ (k (Y) + i)) | .

We now prove (A.16). As before, $Y - δ e^{(1)} < ξ^{1} < Y$ implies that $k (ξ^{1}) = k (Y - δ e^{(1)}) = k (Y) - e^{(1)}$ , so

| \frac{1}{6} δ λ \frac{\partial^{3}}{\partial x_{1}^{3}} A {\hat{f}}_{h} (ξ^{1}) | \leq C δ^{- 2} \max_{i \in I} | Δ_{1}^{3} {\hat{f}}_{h} (δ (k (Y) - e^{(1)} + i)) | .

(A.19)

Now when $Y \in [0, δ)$ and $i_{1} = 0$ , the definition of ${\hat{f}}_{h} (x^{q})$ in (A.11) implies that

{\hat{f}}_{h} (δ (k (Y) - e^{(1)} + i)) = {\hat{f}}_{h} (- δ, δ (k_{2} (Y) + i_{2}), 0, \dots, 0) = f_{h} (0, δ (k_{2} (Y) + i_{2} + 1), 0, \dots, 0),

from which we see that

Δ_{1} {\hat{f}}_{h} (- δ, δ (k_{2} (Y) + i_{2}), 0, \dots, 0) = - Δ_{2} f_{h} (0, δ (k_{2} (Y) + i_{2}), 0, \dots, 0)

, and therefore

Δ_{1}^{3} {\hat{f}}_{h} (- δ, δ (k_{2} (Y) + i_{2}), 0, \dots, 0) = (Δ_{1}^{2} - (Δ_{1} + Δ_{2})) f_{h} (0, δ (k_{2} (Y) + i_{2}), 0, \dots, 0) .

Combining this with (A.19) implies (A.16). To prove (A.17), we note that $k (ξ^{3}) = k (Y) - e^{(2)}$ because $Y - δ e^{(2)} < ξ^{3} < Y$ , so

| δ Y_{2} \frac{\partial^{2}}{\partial x_{2}^{2}} A {\hat{f}}_{h} (ξ^{3}) | \leq C δ^{- 1} Y_{2} \max_{i \in I} | Δ_{2}^{2} {\hat{f}}_{h} (δ (k (Y) - e^{(2)} + i)) | .

The definition of ${\hat{f}}_{h} (x^{q})$ in (A.11) says that $Δ_{2} {\hat{f}}_{h} (Y_{1}, - δ, 0, \dots, 0) = Δ_{2} {\hat{f}}_{h} (Y_{1}, 0, \dots, 0)$ , so $Δ_{2}^{2} {\hat{f}}_{h} (Y_{1}, - δ, 0, \dots, 0) = 0$ , implying (A.17). Lastly, we prove (A.18). Theorem A.1 tells us that $α_{k_{j} + i_{j}}^{k_{j}} (x_{j})$ are degree-7 polynomials in $(x_{j} - δ k_{j}) / δ$ whose coefficients do not depend on k_j or δ, so there exists a constant C > 0 such that $| α_{k_{j} (x) + i_{j}}^{k_{j} (x)} (x_{j}) | \leq C$ for j = 1, 2, so

\begin{array}{l} | \sum_{i_{2} = 0}^{4} α_{k_{2} (x) + i_{2}}^{k_{2} (x)} (x_{2}) \sum_{i_{1} = 0}^{4} α_{k_{1} (x) + i_{1}}^{k_{1} (x)} (x_{1}) \frac{1}{δ} (δ (k_{2} (x) + i_{2}) - x_{2}) \\ \times (- Δ_{2} {\hat{f}}_{h} (δ (k (x) + i - e^{(2)})) + Δ_{2} {\hat{f}}_{h} (δ (k (x) - e^{(2)})) | \\ \leq & C \max_{i \in I} | Δ_{2} {\hat{f}}_{h} (δ (k (x) + i - e^{(2)})) - Δ_{2} {\hat{f}}_{h} (δ (k (x) - e^{(2)}) | . \end{array}

Now

\begin{array}{l} Δ_{2} {\hat{f}}_{h} (δ (k (x) + i - e^{(2)})) - Δ_{2} {\hat{f}}_{h} (δ (k (x) - e^{(2)})) \\ = & Δ_{2} {\hat{f}}_{h} (δ (k (x) + i - e^{(2)})) - Δ_{2} {\hat{f}}_{h} (δ (k (x) + i_{2} e^{(2)} - e^{(2)})) + Δ_{2} {\hat{f}}_{h} (δ (k (x) + i_{2} e^{(2)} - e^{(2)})) - Δ_{2} {\hat{f}}_{h} (δ (k (x) - e^{(2)})) \\ = & \sum_{i_{1}^{'} = 0}^{i_{1} - 1} Δ_{1} Δ_{2} {\hat{f}}_{h} (δ (k (x) + i_{1}^{'} e^{(1)} + i_{2} e^{(2)} - e^{(2)})) + \sum_{i_{2}^{'} = 0}^{i_{2} - 1} Δ_{2}^{2} {\hat{f}}_{h} (δ (k (x) + i_{2}^{'} e^{(2)} - e^{(2)})), \end{array}

implying that

\begin{array}{l} C \max_{i \in I} | Δ_{2} {\hat{f}}_{h} (δ (k (x) + i - e^{(2)})) - Δ_{2} {\hat{f}}_{h} (δ (k (x) - e^{(2)}) | \\ \leq & C \max_{i \in I} | Δ_{2}^{2} {\hat{f}}_{h} (δ (k (Y) - e^{(2)} + i)) | + C \max_{i \in I} | Δ_{1} Δ_{2} {\hat{f}}_{h} (δ (k (Y) - e^{(2)} + i)) | . \end{array}

An identical argument allows us to bound the second term on the right-hand side of (A.14), yielding

\begin{array}{l} ε_{5} (Y) \leq & C \max_{i \in I} | Δ_{2}^{2} {\hat{f}}_{h} (δ (k (Y) - e^{(2)} + i)) | + C \max_{i \in I} | Δ_{1} Δ_{2} {\hat{f}}_{h} (δ (k (Y) - e^{(2)} + i)) | \\ + C \max_{i \in I} | Δ_{1}^{2} {\hat{f}}_{h} (δ (k (Y) + i)) | + C \max_{i \in I} | Δ_{1} Δ_{2} {\hat{f}}_{h} (δ (k (Y) + i)) | . \end{array}

Using $Δ_{2} {\hat{f}}_{h} (Y_{1}, - δ, 0, \dots, 0) = Δ_{2} {\hat{f}}_{h} (Y_{1}, 0, \dots, 0)$ and $Δ_{2}^{2} {\hat{f}}_{h} (Y_{1}, - δ, 0, \dots, 0) = 0$ , we conclude (A.18).

A.2.3. Proving Lemma A.2

To prove Lemma A.2, we need the following result.

Lemma A.3.

Suppose $P \subset δ ℤ^{d}$ and let $f, g : P \to R$ . Given $ℓ \in ℤ^{d}$ , for those k such that $δ k \in P$ and $δ (k + ℓ) \in P$ , we define

F (δ k) = g (δ k) (f (δ (k + ℓ)) - f (δ k)) .

Then $A F (x)$ is well defined for those $x \in R^{d}$ such that $δ (k (x) + i) \in P$ and $δ (k (x) + ℓ + i) \in P$ for all $0 \leq i \leq 4 e$ , where $k_{i} (x) = ⌊ x_{i} / δ ⌋$ . Furthermore, for all such x,

\begin{array}{l} A F (x) = & A g (x) (A f (x + δ ℓ) - A f (x)) \\ + \sum_{i_{1}, \dots, i_{d} = 0}^{4} (\prod_{j = 1}^{d} α_{k_{j} (x) + i_{j}}^{k_{j} (x)} (x_{j})) (g (δ (k (x) + i)) - A g (x)) \\ \times (f (δ (k (x) + ℓ + i)) - f (δ (k (x) + i)) - (f (δ (k (x) + ℓ)) - f (δ k (x)))) . \end{array}

Proof of Lemma A.3.

The proof is identical to the proof of proposition 3 of Braverman (2022). □

Proof of Lemma A.2.

First, we prove (A.12). Any $x^{q} = \in B \cap δ N^{b + 1}$ satisfies $x_{2}^{q} \leq δ (n / 2 - 8)$ , or $q_{2} \leq n / 2 - 8$ . It follows from the definition of G_X in (5) that for $x^{q} \in B \cap δ N^{b + 1}$ ,

\begin{array}{l} G_{X} f_{h} (x^{q}) = & - 1 (q_{1} < n) n λ Δ_{1} f_{h} (x^{q} - δ e^{(1)}) + 1 (q_{1} = n) n λ Δ_{2} f (x^{q}) \\ + (q_{1} - q_{2}) Δ_{1} f_{h} (x^{q}) - q_{2} Δ_{2} f_{h} (x^{q} - δ e^{(2)}) . \end{array}

Note that $q_{1} - q_{2} = n - (n - q_{1}) - q_{2} = n - (x_{1}^{q} + x_{2}^{q}) / δ$ , and $q_{2} = x_{2}^{q} / δ$ . Although $Δ_{2} f_{h} (x^{q} - δ e^{(2)})$ is technically not defined when $x_{2}^{q} = 0$ , we adopt the convention that $1 (q_{2} = 0) q_{2} Δ_{2} f_{h} (x^{q} - δ e^{(2)}) = 0$ . Using the definition of $\hat{f} (x^{q})$ in (A.11), we have

1 (q_{2} = 0) q_{2} Δ_{2} f (x^{q} - δ e^{(2)}) = 0 = 1 (q_{2} = 0) q_{2} Δ_{2} \hat{f} (x^{q} - δ e^{(2)}) .

Similarly, because $q_{1} = n$ corresponds to $x_{1}^{q} = 0$ ,

n λ 1 (q_{1} = n) Δ_{2} f (x^{q}) = - 1 (q_{1} = n) n λ Δ_{1} \hat{f} (x^{q} - δ e^{(1)}),

which proves (A.12). To prove (A.13), note that if

g (x^{q}) = (n - x_{1}^{q} - x_{2}^{q}) / δ

, then

A g (x) = n - (x_{1}^{q} + x_{2}^{q}) / δ

. To see why, note that

Δ_{i} Δ_{j} g (x^{q}) = 0

for any i, j, so Theorem A.1 implies that all second-order partial derivatives of

A g (x)

are zero. Because Ag(x) is twice continuously differentiable, it must be a linear function, and the only linear function that coincides with

g (x^{q})

on the grid is

A g (x) = n - (x_{1}^{q} + x_{2}^{q}) / δ

. Similarly, if

g (x^{q}) = q_{2} = x_{2}^{q} / δ

, then

A g (x) = x_{2} / δ

. Applying Lemma A.3 to each of the three terms on the right-hand side of (A.12) proves (A.13). □

Appendix B. Supporting Proofs for Section 3

Apart from the short proof of Lemma 8 in Section B.5, this appendix is devoted to the proof of Lemma 5. Going forward, we fix $γ = 2 (17 / β + β + 1)$ and recall from Section 3.1 that

\begin{array}{l} θ_{1} = n - ⌊ \sqrt{n} β / 2 ⌋, θ_{2} = ⌊ γ \sqrt{n} ⌋, τ_{i} (q_{i}) = \inf {t \geq 0 : Q_{i} (t) = q_{i}}, q_{i} \in {0, 1, \dots, n}, i = 1, 2 . \end{array}

Following the proof roadmap of Lemma 5, we need an upper bound on the expected start of the first cycle and the expected duration of a single cycle. The following two lemmas provide the ingredients for these bounds and are proved in Sections B.1 and B.2, respectively.

Lemma B.1.

For all $n \geq 1$ ,

\max_{\begin{matrix} θ_{1} < q_{1} \leq n \\ q_{2} = θ_{2}, q \in S_{Q} \end{matrix}} E_{q} (τ_{2} (2 θ_{2}) \land τ_{1} (θ_{1})) \leq C (b, β) .

(B.1)

Lemma B.2.

For all $n \geq 1$ and $q \in S_{Q}$ with $q_{2} > θ_{2}$ ,

E_{q} τ_{2} (θ_{2}) \leq C (b, β) (1 + δ q_{2}) = C (b, β) (1 + x_{2}^{q}), q \in S_{Q} with q_{2} > θ_{2} .

To bound the probability of coupling in a given cycle, we require the following two lemmas.

Lemma B.3.

There exists a constant $p_{1} (β) \in (0, 1)$ such that for all $n \geq 1$ ,

\min_{\begin{matrix} θ_{1} < q_{1} \leq n \\ q_{2} = θ_{2}, q \in S_{Q} \end{matrix}} ℙ_{q} (τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})) \geq p_{1} (β) .

Lemma B.4.

There exists a constant $p_{2} (b, β) \in (0, 1)$ such that for all $n \geq 1$ ,

\min_{\begin{matrix} 0 \leq q_{1} \leq θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \\ q \in S_{Q} \end{matrix}} ℙ (τ_{C} < τ_{1} (n) | Q (0) = q, (Q (0), \tilde{Q} (0)) \in \cup_{i = 1}^{b + 1} Θ_{i}^{Q}) \geq p_{2} (b, β) .

Lemmas B.3 and B.4 are proved in Sections B.3 and B.4, respectively.

Proof of Lemma 5.

Throughout the proof, we use C to denote a positive constant that may change from line to line but depends only on β and b. Given any initial condition $(q, \tilde{q}) \in \cup_{i = 1}^{b + 1} Θ_{i}^{Q}$ ,

E_{(q, \tilde{q})} τ_{C} \leq C (b, β) (1 + δ q_{2}) .

For convenience, we abuse notation and adopt the convention that

E_{q} τ_{C} = \max_{\tilde{q} : (q, \tilde{q}) \in \cup_{i = 1}^{b + 1} Θ_{i}^{Q}} E [τ_{C} | (Q (0), \tilde{Q} (0)) = (q, \tilde{q})], q \in S_{Q},

but

E_{q} (W) = E (W | Q (0) = q)

for any random variable W other than τ_C. We also assume that every

\max

operator in this proof automatically considers the maximum over all

q \in S_{Q}

; that is,

\max_{q_{2} = θ_{2}} E_{q} τ_{C} = \max_{\begin{matrix} q \in S_{Q} \\ q_{2} = θ_{2} \end{matrix}} E_{q} τ_{C} .

Lemma B.2 implies that for any $q \in S_{Q}$ ,

E_{q} τ_{C} \leq E_{q} τ_{2} (θ_{2}) + \max_{q_{2} = θ_{2}} E_{q} τ_{C} \leq C (1 + δ q_{2}) + \max_{q_{2} = θ_{2}} E_{q} τ_{C} .

(B.2)

We will argue that if $p_{1} = p_{1} (β)$ and $p_{2} = p_{2} (b, β)$ are the constants from Lemmas B.3 and B.4, then

\max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} τ_{C} \leq C + (1 - p_{1} p_{2}) \max_{q_{2} = θ_{2}} E_{q} τ_{C}, and

(B.3)

\max_{\begin{matrix} 0 \leq q_{1} < θ_{1} \\ q_{2} = θ_{2} \end{matrix}} E_{q} τ_{C} \leq C + (1 - p_{2}) \max_{q_{2} = θ_{2}} E_{q} τ_{C} .

(B.4)

As a result, choosing $p_{3} = \max {(1 - p_{1} p_{2}), (1 - p_{2})} \in (0, 1)$ implies that

\max_{q_{2} = θ_{2}} E_{q} τ_{C} = \max {\max_{\begin{matrix} 0 \leq q_{1} < θ_{1} \\ q_{2} = θ_{2} \end{matrix}} E_{q} τ_{C}, \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} τ_{C}} \leq C + p_{3} \max_{q_{2} = θ_{2}} E_{q} τ_{C},

and therefore

\max_{q_{2} = θ_{2}} E_{q} τ_{C} \leq C {(1 - p_{3})}^{- 1} \leq C

. Combining this with (B.2) implies the lemma. We now prove (B.3), followed by (B.4). Defining

τ_{M} = τ_{2} (2 θ_{2}) \land τ_{1} (θ_{1})

, we have

\max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} τ_{C} \leq \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} τ_{M} + \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} [E_{Q (τ_{M})} τ_{C}] \leq C + \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} [E_{Q (τ_{M})} τ_{C}],

(B.5)

where in the second inequality we used (B.1) of Lemma B.2. To bound the right-hand side, let us define the events

E_{1} = {τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})}, and E_{2} = {τ_{C} < τ_{1} (n)},

and their complements

E_{1}^{c}

and

E_{2}^{c}

, respectively. Note that if

Q_{2} (0) < 2 θ_{2}

, and then the event

E_{1}^{c}

implies that

Q (τ_{M}) = (n, 2 θ_{2})

because

Q_{2} (t)

increases only at times when

Q_{1} (t) = n

. Using the law of total probability,

\max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} [E_{Q (τ_{M})} τ_{C}] \leq \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} {ℙ_{q} (E_{1}^{c}) \max_{\begin{matrix} q_{1}^{'} = n \\ q_{2}^{'} = 2 θ_{2} \end{matrix}} E_{q^{'}} τ_{C} + ℙ_{q} (E_{1}) \max_{\begin{matrix} q_{1}^{'} = θ_{1} \\ 0 \leq q_{2}^{'} \leq 2 θ_{2} \end{matrix}} E_{q^{'}} τ_{C}} .

(B.6)

We note that

\max_{\begin{matrix} q_{1} = n \\ q_{2} = 2 θ_{2} \end{matrix}} E_{q} τ_{C} \leq \max_{\begin{matrix} q_{1} = n \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} τ_{C} \leq \max_{\begin{matrix} q_{1} = n \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} τ_{2} (θ_{2}) + \max_{q_{2} = θ_{2}} E_{q} τ_{C} \leq C + \max_{q_{2} = θ_{2}} E_{q} τ_{C},

(B.7)

where we used Lemma B.2 in the last inequality, so

ℙ_{q} (E_{1}^{c}) \max_{\begin{matrix} q_{1} = n \\ q_{2} = 2 θ_{2} \end{matrix}} E_{q} τ_{C} \leq ℙ_{q} (E_{1}^{c}) (C + \max_{q_{2} = θ_{2}} E_{q} τ_{C}) .

(B.8)

Provided we can show that

ℙ_{q} (E_{1}) \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} τ_{C} \leq ℙ_{q} (E_{1}) (C + (1 - p_{2}) \max_{q_{2} = θ_{2}} E_{q} τ_{C}),

(B.9)

we can combine (B.8) and (B.9) with (B.6) to get

\begin{array}{l} \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} E_{q} [E_{Q (τ_{M})} τ_{C}] \leq \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} {ℙ_{q} (E_{1}^{c}) (C + \max_{q_{2}^{'} = θ_{2}} E_{q^{'}} τ_{C}) + ℙ_{q} (E_{1}) (C + (1 - p_{2}) \max_{q_{2}^{'} = θ_{2}} E_{q^{'}} τ_{C})} \\ = C + (\max_{q_{2} = θ_{2}} E_{q} τ_{C}) \max_{\begin{matrix} θ_{1} \leq q_{1} \leq n \\ q_{2} = θ_{2} \end{matrix}} {1 - p_{2} ℙ_{q} (E_{1})} \\ \leq C + (1 - p_{2} p_{1}) \max_{q_{2} = θ_{2}} E_{q} τ_{C}, \end{array}

where the last inequality follows from the lower bound on

ℙ_{q} (E_{1})

in Lemma B.3. Combining this bound with (B.5) proves (B.3). We now prove (B.9). Recall that

E_{2} = {τ_{C} < τ_{1} (n)}

and observe that

\begin{array}{l} \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} τ_{C} \\ \leq \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} ([τ_{C} \land τ_{1} (n)] 1 (E_{2})) + \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} ([τ_{C} \land τ_{1} (n) + E_{Q (τ_{1} (n))} τ_{C}] 1 (E_{2}^{c})) \\ \leq 2 \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} [τ_{C} \land τ_{1} (n)] + \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} ℙ (E_{2}^{c} | Q (0) = q, (Q (0), \tilde{Q} (0)) \in \cup_{i = 1}^{b + 1} Θ_{i}^{Q}) E_{q} [E_{Q (τ_{1} (n))} τ_{C}] \\ \leq 2 \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} [τ_{C} \land τ_{1} (n)] + (1 - p_{2}) \max_{\begin{matrix} q_{1} = n \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} τ_{C}, \end{array}

where in the last inequality we used Lemma B.4 and the fact that

Q_{2} (τ_{1} (n)) \leq Q_{2} (0)

because

Q_{2} (t)

increases only at times when

Q_{1} (t) = n

. Applying (B.7) to the right-hand side, we arrive at

\max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} τ_{C} \leq 2 \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} [τ_{C} \land τ_{1} (n)] + C + (1 - p_{2}) \max_{q_{2} = θ_{2}} E_{q} τ_{C} .

To conclude, we argue that

\max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} [τ_{C} \land τ_{1} (n)] \leq b + 1 .

(B.10)

If $(Q (0), \tilde{Q} (0)) \in Θ_{1}^{Q}$ , then $(Q (t), \tilde{Q} (t)) \in Θ_{1}^{Q}$ for all $t \in [0, τ_{1} (n)]$ by construction. The joint CTMC couples before $τ_{1} (n)$ if $τ_{1} (n) > V$ , where V is as in (17). If $(Q (0), \tilde{Q} (0)) \in Θ_{i}^{Q}$ for $i \geq 2$ , coupling will happen before $τ_{1} (n)$ if the joint CTMC transitions to $Θ_{1}^{Q}$ and then spends V time units there, all before $τ_{1} (n)$ . From the construction of $\tilde{Q} (\cdot)$ , we know that the time taken to get from $Θ_{i}^{Q}$ to $Θ_{1}^{Q}$ equals the sum of i – 1 unit-mean exponentially distributed random variables, so the worst case is when $i = b + 1$ . Letting $Γ_{b + 1}$ represent this sum, it follows that

\max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} [τ_{C} \land τ_{1} (n)] \leq \max_{\begin{matrix} q_{1} = θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \end{matrix}} E_{q} [Γ_{b + 1} \land τ_{1} (n)] \leq E (Γ_{b + 1}) \leq b + 1,

which proves (B.10). Our argument for (B.9) can be repeated to prove (B.4). □

B.1. Proving Lemma B.1

Proof of Lemma B.1.

Define $V (x^{q}) = \sum_{i = 1}^{b + 1} q_{i}$ and observe that

G_{X} V (x^{q}) = n λ 1 (q_{b + 1} < n) - q_{1}, x^{q} \in S .

Because $θ_{1} = n - ⌊ \sqrt{n} β / 2 ⌋$ , it follows that for any $q \in S_{Q}$ with $θ_{1} < q_{1} \leq n$ ,

G_{X} V (x^{q}) = n λ - q_{1} \leq n λ - (n - ⌊ \sqrt{n} β / 2 ⌋) = - β \sqrt{n} + ⌊ \sqrt{n} β / 2 ⌋ \leq - \sqrt{n} β / 2 .

Let M > 0, $t^{(M)} = \min {τ_{1} (θ_{1}), τ_{2} (2 θ_{2}), M}$ , and note that $Q_{1} (t) \geq n - ⌊ \sqrt{n} β / 2 ⌋$ for $t \leq t^{(M)}$ . Dynkin’s formula, for example, lemma 17.2 in Kallenberg (2001), then implies that for any $q \in S_{Q}$ with $θ_{1} < q_{1} \leq n$ and $q_{2} = θ_{2}$ ,

E_{x^{q}} V (X (t^{(M)})) - V (x^{q}) = E_{x^{q}} \int_{0}^{t^{(M)}} G_{X} V (X (s)) d s \leq - \frac{\sqrt{n} β}{2} E_{x^{q}} t^{(M)} .

Because $Q_{1} (t^{(M)}) \geq n - ⌊ \sqrt{n} β / 2 ⌋$ and $θ_{1} < q_{1} \leq n$ , it follows that $q_{1} - Q_{1} (t^{(M)}) \leq ⌊ \sqrt{n} β / 2 ⌋$ , so

\frac{\sqrt{n} β}{2} E_{x^{q}} t^{(M)} \leq V (x^{q}) - E_{x^{q}} V (X (t^{(M)})) \leq q_{1} - E_{x^{q}} Q_{1} (t^{(M)}) + \sum_{i = 2}^{b + 1} q_{i} \leq ⌊ \sqrt{n} β / 2 ⌋ + b θ_{2},

where in the last inequality we used

q_{2} \geq q_{3} \geq \dots \geq q_{b + 1}

. Dividing both sides by

\sqrt{n}

, and noting that

θ_{2} / \sqrt{n} \leq γ = 2 (17 / β + β + 1)

, yields

E_{x^{q}} t^{(M)} \leq C (b, β)

. We conclude by taking

M \to \infty

and using the monotone convergence theorem. □

B.2. Proving Lemma B.12

Recall that $θ_{2} = ⌊ γ \sqrt{n} ⌋$ and $γ = 2 (17 / β + β + 1)$ . In this section, we show that $E_{q} τ_{2} (θ_{2}) \leq C (b, β) (1 + δ q_{2})$ if $q_{2} > θ_{2}$ . Our proof is based on a Lyapunov function characterized by the following proposition, proved in Section B.2.1.

Lemma B.5.

There exists a function $V : R_{+}^{b + 1} \to R$ such that for any $n \geq 1$ and any $x^{q} \in S$ with $x_{2}^{q} \geq 2 (17 / β + β) + δ$ ,

G_{X} V (x^{q}) \leq - 3 / 17 + \frac{δ}{β} (q_{3} 1 (b > 1) - n λ 1 (q_{1} = q_{2} = n)) .

(B.11)

Furthermore, there exists a constant $C (β) > 0$ such that for any $n \geq 1$ ,

0 \leq V (x) \leq C (β) (1 + x_{2}), x \in R^{b + 1} with x_{2} \geq 2 (17 / β + β) .

Proof of Lemma B.2.

Let V(x) be the function in Lemma B.5, fix $X (0) = x^{q} \in S$ with $x_{2}^{q} \geq δ θ_{2}$ , M > 0, and define $τ_{2}^{M} (θ_{2}) = M \land τ_{2} (θ_{2})$ . Dynkin’s formula says that

E_{x^{q}} V (X (τ_{2}^{M} (θ_{2}))) - V (x^{q}) = E_{x^{q}} \int_{0}^{τ_{2}^{M} (θ_{2})} G_{X} V (X (t)) d t .

(B.12)

Because $X_{2} (t) \geq δ θ_{2} \geq 2 (17 / β + β) + δ$ for all $t \in [0, τ_{2}^{M} (θ_{2})]$ , Lemma B.5 implies that

G_{X} V (X (t)) \leq - 3 / 17 + \frac{δ}{β} (Q_{3} (t) 1 (b > 1) - n λ 1 (Q_{1} (t) = Q_{2} (t) = n)), t \in [0, τ_{2}^{M} (θ_{2})] .

Combining this inequality with (B.12) and that $V (X (τ_{2}^{M} (θ_{2}))) \geq 0$ and $V (x^{q}) \leq C (β) x_{2}^{q}$ yields

\frac{3}{17} E_{x^{q}} (τ_{2}^{M} (θ_{2})) \leq C (β) x_{2}^{q} + \frac{δ}{β} E_{x^{q}} \int_{0}^{τ_{2}^{M} (θ_{2})} (Q_{3} (t) 1 (b > 1) - n λ 1 (Q_{1} (t) = Q_{2} (t) = n)) d t .

If b = 1, the lemma follows trivially, so we assume that b > 1. It suffices to show that

E_{x^{q}} \int_{0}^{M} (Q_{3} (t) 1 (b > 1) - n λ 1 (Q_{1} (t) = Q_{2} (t) = n)) d t \leq \sum_{i = 3}^{b + 1} q_{i}

because

\sum_{i = 3}^{b + 1} q_{i} \leq b q_{2}

. Because

Q_{3} (t)

is the number of servers with at least two customers in their buffers, it is also the number of customers that are second in line at time t. Thus,

\int_{0}^{M} Q_{3} (t) d t

is the cumulative time spent by customers being second in line. This cumulative time is contributed to by customers already in the system at time t = 0 and by new arrivals after t = 0. Of those customers present in the system at t = 0, the number that are, or could at some point become, second in line is

\sum_{i = 3}^{b + 1} q_{i}

, and each will spend at most one unit of time being second in line, in expectation.

Let N be the number of customers in the interval $[0, M]$ that arrive when all servers are busy and all queues have at least one customer in them; that is, $Q_{1} (t) = Q_{2} (t) = n$ . For $1 \leq i \leq N$ , let ξ_i be the time customer i spends being second in line, even if that customer becomes second in line after time M. We argue that conditioned on ${N \geq i}$ , each ξ_i is exponentially distributed with unit mean. Upon entry into the system, if customer i is routed to a busy server with only one other customer waiting in the buffer, then ξ_i is distributed according to the remaining service time of the server, which is exponentially distributed with unit mean. If the buffer has more than one customer waiting, then ξ_i equals the service time of the customer two spots ahead of customer i, which is also exponentially distributed with unit mean. Further note that the CTMC can be constructed in such a way that the value of ξ_i is determined at the instant when customer i enters the system, so

E_{x^{q}} \int_{0}^{M} Q_{3} (t) d t \leq \sum_{i = 3}^{b + 1} q_{i} + E_{x^{q}} \sum_{i = 1}^{\infty} ξ_{i} 1 (N \geq i) = \sum_{i = 3}^{b + 1} q_{i} + \sum_{i = 1}^{\infty} E_{x^{q}} (ξ_{i} | N \geq i) ℙ (N \geq i) = \sum_{i = 3}^{b + 1} q_{i} + E_{x^{q}} N .

Let η_i be the time spent by the CTMC in a state with $Q_{1} (t) = Q_{2} (t) = n$ before customer i’s arrival. Because the arrivals to the JSQ system are governed by a rate- $n λ$ Poisson process, the arrival of customer i corresponds to a time when η_i accumulates to equal an exponentially distributed random variable with rate $n λ$ ; therefore,

E_{x^{q}} \int_{0}^{M} 1 (Q_{1} (t) = Q_{2} (t) = n) d t \geq E_{x^{q}} \sum_{i = 1}^{\infty} η_{i} 1 (N \geq i) = \sum_{i = 1}^{\infty} E_{x^{q}} (η_{i} | N \geq i) ℙ (N \geq i) = \frac{1}{n λ} E_{x^{q}} N . □

B.2.1. Proving Lemma B.5.

The Lyapunov function in Lemma B.5 is based on the fluid limit of the JSQ system, studied in Braverman (2020). Lemma B.5 was, unfortunately, not proved there; but that paper contains all the necessary ingredients for the proof. We now recall them using notation from Braverman (2020).

Consider the two-dimensional process ${(Q_{1} (t) - n) / n, Q_{2} (t) / n}$ . Note that the first coordinate is nonpositive, whereas so far we have been using a nonnegative first coordinate. Section 4.1 of Braverman (2020) described the fluid limit of this process. Letting

Ω = {x \in R^{2} : x_{1} \leq 0, x_{2} \geq 0},

the fluid limit is a dynamical system

v : R_{+} \to Ω

with initial condition

v (0) = x \in Ω

; we write

v^{x} (t)

to emphasize the relationship on x. Postponing the discussion of the behavior of

v^{x} (t)

, for

ℓ, u \in R

with

ℓ < u

define the smoothed indicator function

ϕ^{(ℓ, u)} : R \to [0, 1]

ϕ^{(ℓ, u)} (x) = {\begin{array}{l} 0, & x \leq ℓ, \\ {(x - ℓ)}^{2} (\frac{- (x - ℓ)}{{((u + ℓ) / 2 - ℓ)}^{2} (u - ℓ)} + \frac{2}{((u + ℓ) / 2 - ℓ) (u - ℓ)}), & x \in [ℓ, (u + ℓ) / 2], \\ 1 - {(x - u)}^{2} (\frac{(x - u)}{{((u + ℓ) / 2 - u)}^{2} (u - ℓ)} - \frac{2}{((u + ℓ) / 2 - u) (u - ℓ)}), & x \in [(u + ℓ) / 2, u], \\ 1, & x \geq u, \end{array}

(B.13)

and let

f^{(2)} (x) = \int_{0}^{\infty} ϕ^{(δ κ_{1}, δ κ_{2})} (v^{x} (t)) d t, x \in Ω,

where

δ = 1 / \sqrt{n}

and

κ_{1}, κ_{2} \in R

are to be determined. The function

f^{(2)} (x)

appeared in section 5.1 of Braverman (2020), where it was used as a Lyapunov function for the diffusion limit of the JSQ system; that is, the process

{Y (t)}

in (1). We show that this is also a Lyapunov function for the CTMC. Define

V (x) = f^{(2)} (- δ x_{1}, δ x_{2}), x \in R_{+}^{b + 1} .

(B.14)

The following result proved in Section B.2.2 gives us control over the derivatives of V(x).

Lemma B.6.

For any $x \in R_{+}^{b + 1}$ with $x_{2} \geq κ_{2}$ ,

(β - (x_{1} + x_{2})) \frac{\partial V (x)}{\partial x_{1}} - δ x_{2} \frac{\partial V (x)}{\partial x_{2}} = - 1, and 1 (x_{1} = 0) (\frac{\partial}{\partial x_{1}} + \frac{\partial}{\partial x_{2}}) V (x) = 0 .

(B.15)

Furthermore, if we choose $κ_{1} = 17 / β + β$ and $κ_{2} = 2 κ_{1}$ , then for any $x \in R_{+}^{b + 1}$ with $x_{2} \geq κ_{2}$ , and any $x_{2}^{'} \geq x_{2}$ ,

\frac{\partial^{2}}{\partial x_{1}^{2}} V (x) \leq 9 / 17,

(B.16)

\frac{\partial}{\partial x_{2}} V (x) \leq \frac{\partial}{\partial x_{2}} V (0, x_{2}^{'}) = \frac{1}{β}, \frac{\partial^{2}}{\partial x_{2}^{2}} V (x) \leq 5 / 17;

(B.17)

and there exists a constant

C (β)

such that

0 \leq V (x) \leq C (β) (1 + x_{2})

Proof of Lemma B.5.

Let $κ_{1} = 17 / β + β$ and $κ_{2} = 2 κ_{1}$ and V(x) be the function from Lemma B.6, and recall G_X defined in (5). Because V(x) depends only on x₁ and x₂,

\begin{array}{l} G_{X} V (x^{q}) = & 1 (q_{1} < n) n λ (- Δ_{1} V (x^{q} - δ e^{(1)})) + n λ 1 (q_{1} = n, q_{2} < n) Δ_{2} V (x^{q}) \\ + (q_{1} - q_{2}) Δ_{1} V (x^{q}) + (q_{2} - q_{3} 1 (b > 1)) (- Δ_{2} V (x^{q} - δ e^{(2)})), x^{q} \in S . \end{array}

Using Taylor expansion, we get

\begin{array}{l} - Δ_{1} V (x^{q} - δ e^{(1)}) = V (x^{q} - δ e^{(1)}) - V (x^{q}) = - δ \frac{\partial}{\partial x_{1}} V (x^{q}) + \int_{x_{1}^{q} - δ}^{x_{1}^{q}} (u - (x_{1}^{q} - δ)) \frac{\partial^{2}}{\partial x_{1}^{2}} V (u, x_{2}^{q}) d u, \\ Δ_{1} V (x^{q}) = V (x + δ e^{(1)}) - V (x^{q}) = δ \frac{\partial}{\partial x_{1}} V (x^{q}) + \int_{x_{1}^{q}}^{x_{1}^{q} + δ} (x_{1}^{q} + δ - u) \frac{\partial^{2}}{\partial x_{1}^{2}} V (u, x_{2}^{q}) d u; \end{array}

and a similar expression holds for

Δ_{2} V (x^{q})

and

- Δ_{2} V (x^{q} - δ e^{(2)})

. Therefore,

\begin{array}{l} G_{X} V (x^{q}) = & - δ (1 (q_{1} < n) n λ - (q_{1} - q_{2})) \frac{\partial}{\partial x_{1}} V (x^{q}) + δ (1 (q_{1} = n, q_{2} < n) n λ - q_{2}) \frac{\partial}{\partial x_{2}} V (x^{q}) \\ - q_{3} 1 (b > 1) (- Δ_{2} V (x^{q} - δ e^{(2)})) + ψ (x^{q}), \end{array}

(B.18)

where

\begin{array}{l} ψ (x^{q}) = & n λ 1 (q_{1} < n) \int_{x_{1}^{q} - δ}^{x_{1}^{q}} (u - (x_{1}^{q} - δ)) \frac{\partial^{2}}{\partial x_{1}^{2}} V (u, x_{2}^{q}) d u \\ + n λ 1 (q_{1} = n, q_{2} < n) \int_{x_{2}^{q}}^{x_{2}^{q} + δ} (x_{2}^{q} + δ - u) \frac{\partial^{2}}{\partial x_{2}^{2}} V (x_{1}^{q}, u) d u \\ + (q_{1} - q_{2}) \int_{x_{1}^{q}}^{x_{1}^{q} + δ} (x_{1}^{q} + δ - u) \frac{\partial^{2}}{\partial x_{1}^{2}} V (u, x_{2}^{q}) d u + q_{2} \int_{x_{2}^{q} - δ}^{x_{2}^{q}} (u - (x_{2}^{q} - δ)) \frac{\partial^{2}}{\partial x_{2}^{2}} V (x_{1}^{q}, u) d u . \end{array}

Now suppose $x_{2}^{q} \geq κ_{2} + δ$ . The bounds on the second-order derivatives of V(x) from Lemma B.6, together with the facts that $q_{1} - q_{2} \geq 0, q_{2} \geq 0, δ^{2} n λ \leq 1$ , and $δ^{2} q_{i} \leq 1$ , imply that $ψ (x^{q}) \leq 14 / 17$ . Next, we rewrite the first line on the right-hand side of (B.18), for which we note that

\begin{array}{l} λ = 1 - β / \sqrt{n}, x_{1}^{q} = δ (n - q_{1}), 1 (q_{1} = n) = 1 (x_{1}^{q} = 0), \\ 1 (q_{1} < n) = 1 - 1 (x_{1}^{q} = 0), 1 (q_{1} = n, q_{2} < n) = 1 (x_{1}^{q} = 0) - 1 (q_{1} = q_{2} = n), \end{array}

\begin{array}{l} - δ (1 (q_{1} < n) n λ - (q_{1} - q_{2})) \frac{\partial}{\partial x_{1}} V (x^{q}) + δ (1 (q_{1} = n, q_{2} < n) n λ - q_{2}) \frac{\partial}{\partial x_{2}} V (x^{q}) \\ = & (β - (x_{1}^{q} + x_{2}^{q})) \frac{\partial}{\partial x_{1}} V (x^{q}) - x_{2}^{q} \frac{\partial}{\partial x_{2}} V (x) - 1 (q_{1} = q_{2} = n) δ n λ \frac{\partial}{\partial x_{2}} V (x^{q}) + δ n λ 1 (x_{1}^{q} = 0) (\frac{\partial}{\partial x_{1}} + \frac{\partial}{\partial x_{2}}) V (x^{q}) \\ = & - 1 - 1 (q_{1} = q_{2} = n) δ n λ \frac{\partial}{\partial x_{2}} V (x^{q}), \end{array}

where the last equality is due to (B.15) from Lemma B.6. We have thus shown that

G_{X} V (x^{q}) \leq - 1 + 14 / 17 - 1 (q_{1} = q_{2} = n) δ n λ \frac{\partial}{\partial x_{2}} V (x^{q}) - q_{3} 1 (b > 1) (- Δ_{2} V (x^{q} - δ e^{(2)})) .

Now $V (x^{q}) = V (0, \sqrt{n})$ when $q_{1} = q_{2} = n$ , so (B.17) in Lemma B.6 tells us that $V (0, \sqrt{n}) = 1 / β$ provided that $\sqrt{n} \geq κ_{2} = 2 (β / 17 + β)$ , which we assume, so

\begin{array}{l} - 1 (q_{1} = q_{2} = n) δ n λ \frac{\partial}{\partial x_{2}} V (x) - q_{3} 1 (b > 1) (- Δ_{2} V (x^{q} - δ e^{(2)})) \\ = & - 1 (q_{1} = q_{2} = n) δ n λ \frac{1}{β} + q_{3} 1 (b > 1) \int_{x_{2}^{q} - δ}^{x_{2}^{q}} \frac{\partial}{\partial x_{2}} V (x_{1}^{q}, u) d u \\ \leq & \frac{δ}{β} (q_{3} 1 (b > 1) - n λ 1 (q_{1} = q_{2} = n)), \end{array}

where the inequality follows from (B.17) in Lemma B.6. □

B.2.2. Proof of Lemma B.6.

Fix $κ_{1} = 17 / β + β$ and $κ_{2} = 2 κ_{1}$ . The function $f^{(2)} (x)$ was considered in lemma 8 of Braverman (2020), which tells us that

\begin{array}{l} - (β δ + x_{1} - x_{2}) \frac{\partial}{\partial x_{1}} f^{(2)} (x) - x_{2} \frac{\partial}{\partial x_{2}} f^{(2)} (x) = - 1, & x \in Ω with x_{2} > κ_{2} / \sqrt{n}, \\ \frac{\partial}{\partial x_{1}} f^{(2)} (x) = \frac{\partial}{\partial x_{2}} f^{(2)} (x), & x \in Ω with x_{1} = 0 . \end{array}

Combining this with

\frac{\partial}{\partial x_{1}} V (x) = - δ \frac{\partial}{\partial x_{1}} f^{(2)} (- δ x_{1}, δ x_{2}), \frac{\partial}{\partial x_{2}} V (x) = δ \frac{\partial}{\partial x_{2}} f^{(2)} (- δ x_{1}, δ x_{2})

(B.19)

gives us (B.15). Going forward, we assume that

x \in Ω

. Let us bound the derivatives of V(x). On page 1100 of Braverman (2020), it was shown that

\frac{\partial^{2}}{\partial x_{1}^{2}} f^{(2)} (x) \leq \frac{n}{β (κ_{1} - β)} + \frac{κ_{1}}{κ_{1} - β} \frac{4 n}{β (κ_{2} - κ_{1})} = \frac{n}{17} + \frac{17 / β + β}{17 / β} \frac{4 n}{β (17 / β + β)} = \frac{5 n}{17}, x \in Ω,

implying the bound on

\partial^{2} V (x) / \partial x_{1}^{2}

in (B.16). We now prove (B.17), followed by the bound on V(x). Unfortunately,

\partial f^{(2)} (x) / \partial x_{2}

and

\partial^{2} f^{(2)} (x) / \partial x_{2}^{2}

are not bounded in Braverman (2020), so we must bound these partial derivatives ourselves.

We write the equation for $\partial f^{(2)} (x) / \partial x_{2}$ in (B.22), but writing it requires us to introduce some nontrivial objects from Braverman (2020). The first object we need is the family of curves ${Γ^{(κ)} \subset Ω}_{κ \geq β}$ , where $Γ^{(κ)}$ is the graph of the unique fluid-limit trajectory that intersects the x₂ axis at the point $(0, κ / \sqrt{n})$ . For the purposes of this proof, it suffices to treat $Γ^{(κ)}$ as a two-dimensional geometric object satisfying the following properties:

The set $Γ^{(κ)}$ is a graph of a continuous function; that is, $Γ^{(κ)} = {(x_{1}, f (x_{1})}$ for some continuous function $f : R_{+} \to R_{+}$ .
The intersection $Γ^{(κ)} \cap {x \in Ω : x_{1} = 0} = (0, κ / \sqrt{n})$ .
If $x \in Γ^{(κ)}$ and $x_{1} < 0$ , then $x_{2} > κ / \sqrt{n}$ .
If $κ^{'} > κ$ , then $Γ^{(κ)} \cap Γ^{(κ^{'})} = \emptyset$ and $Γ^{(κ^{'})}$ lies above $Γ^{(κ)}$ .

The first three properties are implied by lemma 5 of Braverman (2020), and the fourth one follows from (39) there. Because $Γ^{(κ)}$ is a graph, sets of the form ${x < Γ^{(κ)}}, {x \leq Γ^{(κ)}}$ , etc., are well defined. Let us use $Γ^{(κ_{1})}$ and $Γ^{(κ_{2})}$ to partition Ω into the four sets

\begin{array}{l} S_{0} = {x \in Ω : x_{2} \leq κ_{1} / \sqrt{n}}, S_{1} = {x \in Ω : x_{2} \geq κ_{1} / \sqrt{n}, x \leq Γ^{(κ_{1})}}, \\ S_{2} = {x \in Ω : Γ^{(κ_{1})} \leq x \leq Γ^{(κ_{2})}}, S_{3} = {x \in Ω : x \geq Γ^{(κ_{2})}} . \end{array}

The four properties of $Γ^{(κ)}$ are sufficient to argue that $S_{0} \cup S_{1} \cup S_{2} \cup S_{3} = Ω$ and that the interiors of S_i and S_j are disjoint when $i \neq j$ ; we refer the reader to section C.2 of Braverman (2020) for more details.

The last object we need is the function $τ (x)$ , which represents the first time that the fluid limit hits the x₂ axis starting from a state $x > Γ^{(β)}$ . The precise definition of $τ (x)$ is bulky and involves the Lambert-W function, but we can get by with only a few of its properties. Namely, for any $κ > β$ , lemma 6 of Braverman (2020) introduces a nonnegative function $τ : {x \in Ω : x \geq Γ^{(κ)}} \to R_{+}$ with $τ (0, x_{2}) = 0$ , which is differentiable for all $x \in {x \in Ω : x \geq Γ^{(κ)}}$ and satisfies

\frac{\partial}{\partial x_{1}} τ (x) = - \frac{e^{- τ (x)}}{x_{2} e^{- τ (x)} - β / \sqrt{n}} \leq 0, \frac{\partial}{\partial x_{2}} τ (x) = τ (x) \frac{\partial}{\partial x_{1}} τ (x) \leq 0, x \in {x \in Ω : x \geq Γ^{(κ)}} .

(B.20)

By choosing $κ = κ_{1} = 17 / β + β$ , we are assured that $τ (x)$ is defined on the set ${x \in Ω : x \geq Γ^{(κ_{1})}} = S_{2} \cup S_{3}$ . Item 1 of lemma 6 in Braverman (2020) tells us that $τ (x)$ is tied to $Γ^{(κ)}$ for any $κ > β$ via

x_{2} e^{- τ (x)} \geq κ / \sqrt{n}, x \geq Γ^{(κ)} .

(B.21)

We are now ready to bound the derivatives of $f^{(2)} (x)$ . Equation (C.9) of Braverman (2020) tells us that

\frac{\partial}{\partial x_{2}} f^{(2)} (x) = {\begin{array}{l} 0, & x \in S_{0}, \\ \frac{1}{x_{2}} ϕ (x_{2}), & x \in S_{1}, \\ \frac{1}{x_{2}} (ϕ (x_{2}) - ϕ (x_{2} e^{- τ (x)})) + ϕ (x_{2} e^{- τ (x)}) \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1), & x \in S_{2}, \\ \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1), & x \in S_{3}, \end{array}

(B.22)

where

ϕ (x) = ϕ^{(δ κ_{1}, δ κ_{2})} (x)

is the smoothed indicator defined in (B.13). By differentiating both sides of (B.13), it is straightforward to check that

ϕ (x)

is nondecreasing, and

ϕ^{'} (x) \leq \frac{4}{δ (κ_{2} - κ_{1})} = \frac{4 \sqrt{n}}{17 / β + β} .

(B.23)

Let us now argue that $\partial f^{(2)} (x) / \partial x_{2} \leq \sqrt{n} / β$ for any $x \in Ω$ . If $x \in S_{3}$ , this bound is implied by the inequality $e^{- t} (t + 1) \leq 1$ for $t \geq 0$ . If $x \in S_{1}$ , the bound is implied by the fact that $ϕ (x_{2}) \leq 1$ and $1 / x_{2} \leq \sqrt{n} / κ_{1} \leq \sqrt{n} / β$ . If $x \in S_{2}$ , we note that $ϕ (x_{2}) - ϕ (x_{2} e^{- τ (x)}) \geq 0$ , and $1 / x_{2} \leq \sqrt{n} / β$ , meaning that

\begin{array}{l} \frac{\partial}{\partial x_{2}} f^{(2)} (x) = & \frac{1}{x_{2}} (ϕ (x_{2}) - ϕ (x_{2} e^{- τ (x)})) + ϕ (x_{2} e^{- τ (x)}) \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1) \\ \leq & \frac{\sqrt{n}}{β} (ϕ (x_{2}) - ϕ (x_{2} e^{- τ (x)})) + ϕ (x_{2} e^{- τ (x)}) \frac{\sqrt{n}}{β} = \frac{\sqrt{n}}{β} . \end{array}

Observe that $\partial f^{(2)} (x) / \partial x_{2} = \sqrt{n} / β$ when $τ (x) = 0$ , which is true for any $x \in S_{2} \cup S_{3}$ with $x_{1} = 0$ , implying the claim about $\partial V (x) / \partial x_{2}$ in (B.17). To conclude the proof, it remains to show $\partial^{2} V (x) / \partial x_{2}^{2} \leq 9 / 17$ by differentiating both sides in (B.22). Note that $\partial^{2} f^{(2)} (x) / \partial x_{2}^{2} = 0$ for $x \in S_{0}$ . When $x \in S_{1}$ , we use the bound on $ϕ^{'} (x)$ in (B.23), as well as the fact that $1 / x_{2} \leq \sqrt{n} / β$ , to see that

\frac{\partial^{2}}{\partial x_{2}^{2}} f^{(2)} (x) = - \frac{1}{x_{2}^{2}} ϕ (x_{2}) + \frac{1}{x_{2}} ϕ^{'} (x_{2}) \leq \frac{1}{x_{2}} ϕ^{'} (x_{2}) \leq \frac{\sqrt{n}}{β} \frac{4 \sqrt{n}}{17 / β + β} \leq \frac{4 n}{17}, x \in S_{1} .

When $x \in S_{3}$ ,

\frac{\partial^{2}}{\partial x_{2}^{2}} f^{(2)} (x) = - \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1) \frac{\partial}{\partial x_{2}} τ (x) + \frac{\sqrt{n}}{β} e^{- τ (x)} \frac{\partial}{\partial x_{2}} τ (x) = - \frac{\sqrt{n}}{β} e^{- τ (x)} τ (x) \frac{\partial}{\partial x_{2}} τ (x) .

Using the expression for $\partial τ (x) / \partial x_{2}$ in (B.20), we see that

\frac{\partial^{2}}{\partial x_{2}^{2}} f^{(2)} (x) = \frac{e^{- τ (x)}}{x_{2} e^{- τ (x)} - β / \sqrt{n}} τ^{2} (x) \frac{\sqrt{n}}{β} e^{- τ (x)} \leq \frac{n}{7 β (κ_{2} - β)} \leq \frac{n}{7 β (34 / β + β)} \leq \frac{4 n}{17}, x \in S_{3} .

(B.24)

The first inequality follows from $x_{2} e^{- τ (x)} \geq κ_{2} / \sqrt{n}$ because of (B.21) and the fact that $t^{2} e^{- 2 t} \leq 1 / 7$ for $t \geq 0$ . Lastly, we consider the case when $x \in S_{2}$ , for which we recall that

\frac{\partial}{\partial x_{2}} f^{(2)} (x) = \frac{1}{x_{2}} (ϕ (x_{2}) - ϕ (x_{2} e^{- τ (x)})) + ϕ (x_{2} e^{- τ (x)}) \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1), x \in S_{2} .

(B.25)

To help organize terms, let $g (x_{2}) = x_{2} e^{- τ (x)}$ and note from (B.20) that

\begin{array}{l} g^{'} (x_{2}) = e^{- τ (x)} (1 - x_{2} \frac{\partial}{\partial x_{2}} τ (x)) & = e^{- τ (x)} (1 + \frac{x_{2} e^{- τ (x)}}{x_{2} e^{- τ (x)} - β / \sqrt{n}} τ (x)) \\ = e^{- τ (x)} (1 + τ (x) + \frac{β / \sqrt{n}}{x_{2} e^{- τ (x)} - β / \sqrt{n}} τ (x)) . \end{array}

We see that $g^{'} (x_{2}) \geq 0$ because $τ (x) \geq 0$ and $x_{2} e^{- τ (x)} \geq κ_{1} / \sqrt{n}$ for $x \in S_{2}$ because of (B.21). Furthermore, because $e^{- t} t \leq 1$ and $e^{- t} (t + 1) \leq 1$ for $t \geq 0$ , we conclude that

0 \leq g^{'} (x_{2}) \leq 1 + \frac{β / \sqrt{n}}{x_{2} e^{- τ (x)} - β / \sqrt{n}} \leq 1 + \frac{β}{κ_{1} - β} = 1 + \frac{β^{2}}{17} .

(B.26)

Let us now differentiate and bound each term on the right-hand side of (B.25) individually. First,

\begin{array}{l} \frac{\partial}{\partial x_{2}} (\frac{1}{x_{2}} (ϕ (x_{2}) - ϕ (x_{2} e^{- τ (x)}))) = & - \frac{1}{x_{2}^{2}} (ϕ (x_{2}) - ϕ (x_{2} e^{- τ (x)})) + \frac{1}{x_{2}} (ϕ^{'} (x_{2}) - g^{'} (x_{2}) ϕ^{'} (x_{2} e^{- τ (x)})) \\ \leq & \frac{1}{x_{2}} ϕ^{'} (x_{2}) \leq \frac{\sqrt{n}}{β} \frac{4 \sqrt{n}}{17 / β + β} = \frac{4 n}{17} . \end{array}

The first inequality is because $ϕ (x)$ is nondecreasing and $g^{'} (x_{2}) \geq 0$ , and the second inequality follows from the fact that $1 / x_{2} \leq \sqrt{n} / β$ and the bound on $ϕ^{'} (x)$ in (B.23). Differentiating the second term in (B.25), we get

\begin{array}{l} \frac{\partial}{\partial x_{2}} (ϕ (x_{2} e^{- τ (x)}) \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1)) = & ϕ (x_{2} e^{- τ (x)}) \frac{\partial}{\partial x_{2}} (\frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1)) \\ + ϕ^{'} (x_{2} e^{- τ (x)}) g^{'} (x_{2}) \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1) . \end{array}

To bound the first term, we use the fact that $ϕ (x) \leq 1$ ; we repeat the argument used to prove (B.24) to see that

ϕ (x_{2} e^{- τ (x)}) \frac{\partial}{\partial x_{2}} (\frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1)) \leq \frac{n}{7 β (κ_{1} - β)} = \frac{n}{7 β (17 / β)} = \frac{n}{119} .

Furthermore, the bounds on $ϕ^{'} (x)$ and $g^{'} (x_{2})$ in (B.23) and (B.26), together with the fact that $e^{- t} (t + 1) \leq 1$ , imply that

ϕ^{'} (x_{2} e^{- τ (x)}) g^{'} (x_{2}) \frac{\sqrt{n}}{β} e^{- τ (x)} (τ (x) + 1) \leq \frac{4 \sqrt{n}}{17 / β + β} (1 + \frac{β^{2}}{17}) \frac{\sqrt{n}}{β} = \frac{4 \sqrt{n} β}{17 + β^{2}} \frac{17 + β^{2}}{17} \frac{\sqrt{n}}{β} = \frac{4 n}{17} .

Combining the pieces yields $\partial^{2} f^{(2)} (x) / \partial x_{2}^{2} \leq 9 n / 17$ , proving (B.17).

To conclude, we prove that $0 \leq V (x) \leq C (β) (1 + x_{2})$ for $x_{2} \geq κ_{2}$ by proving that $0 \leq f^{(2)} (x) \leq C (β) (1 + \sqrt{n} x_{2})$ for $x_{2} \geq κ_{2} / \sqrt{n}$ . The form of $f^{(2)} (x)$ below can be found in lemma B.1 of Braverman (2020):

f^{(2)} (x) = {\begin{array}{l} 0, x \in S_{0}, \\ \int_{0}^{\log (\sqrt{n} x_{2} / κ_{1})} ϕ (x_{2} e^{- t}) d t, x_{2} \leq κ_{2} / \sqrt{n}, x \in S_{1}, \\ \log (\sqrt{n} x_{2} / κ_{2}) + \int_{0}^{\log (κ_{2} / κ_{1})} ϕ (\frac{κ_{2}}{\sqrt{n}} e^{- t}) d t, x_{2} \geq κ_{2} / \sqrt{n}, x \in S_{1}, \\ \int_{0}^{τ (x)} ϕ (x_{2} e^{- t}) d t + \frac{\sqrt{n}}{β} \int_{κ_{1} / \sqrt{n}}^{x_{2} e^{- τ (x)}} ϕ (t) d t, x_{2} \leq κ_{2} / \sqrt{n}, x \in S_{2}, \\ \log (\sqrt{n} x_{2} / κ_{2}) + \int_{\log (\sqrt{n} x_{2} / κ_{2})}^{τ (x)} ϕ (x_{2} e^{- t}) d t + \frac{\sqrt{n}}{β} \int_{κ_{1} / \sqrt{n}}^{x_{2} e^{- τ (x)}} ϕ (t) d t, x_{2} \geq κ_{2} / \sqrt{n}, x \in S_{2}, \\ τ (x) + \frac{x_{2} e^{- τ (x)} - κ_{2} / \sqrt{n}}{β / \sqrt{n}} + \frac{\sqrt{n}}{β} \int_{κ_{1} / \sqrt{n}}^{κ_{2} / \sqrt{n}} ϕ (t) d t, x \in S_{3} . \end{array}

The fact that $f^{(2)} (x) \geq 0$ follows from $ϕ (x), τ (x) \geq 0$ , the definitions of S₁, S₂, and S₃, and (B.21). We combine all the cases above into the single upper bound

\begin{array}{l} f^{(2)} (x) \leq & \log (\sqrt{n} x_{2} / κ_{1}) 1 (x \in S_{1} \cup S_{2} \cup S_{3}) + \log (κ_{2} / κ_{1}) + τ (x) 1 (x \in S_{2} \cup S_{3}) \\ + \frac{\sqrt{n}}{β} (x_{2} e^{- τ (x)} - κ_{1} / \sqrt{n}) 1 (x \in S_{2}) + \frac{\sqrt{n}}{β} (x_{2} e^{- τ (x)} - κ_{2} / \sqrt{n}) 1 (x \in S_{3}) + \frac{κ_{2} - κ_{1}}{β} . \end{array}

(B.27)

Using the inequality $\log (t) \leq 1 + t$ for $t \geq 0$ , and the fact that $κ_{1} = 17 / β + β$ and $κ_{2} = 2 κ_{1}$ , we see that $\log (κ_{2} / κ_{1}) = \log (2), (κ_{2} - κ_{1}) / β = 1 + 17 / β^{2}$ ,

\begin{array}{l} \log (\sqrt{n} x_{2} / κ_{1}) 1 (x \in S_{1} \cup S_{2} \cup S_{3}) \leq 1 + \frac{\sqrt{n} x_{2}}{κ_{1}} \leq 1 + \frac{\sqrt{n} x_{2}}{β}, \\ \frac{\sqrt{n}}{β} (x_{2} e^{- τ (x)} - κ_{1} / \sqrt{n}) 1 (x \in S_{2}) \leq \frac{\sqrt{n} x_{2}}{β}, and \frac{\sqrt{n}}{β} (x_{2} e^{- τ (x)} - κ_{2} / \sqrt{n}) 1 (x \in S_{3}) \leq \frac{\sqrt{n} x_{2}}{β} . \end{array}

Furthermore, (B.21) and the definitions of S₂ and S₃ imply that

τ (x) 1 (x \in S_{2} \cup S_{3}) \leq \log (x_{2} \sqrt{n} / κ_{1}) \leq 1 + \frac{\sqrt{n} x_{2}}{κ_{1}} = 1 + \frac{\sqrt{n} x_{2}}{β} .

We conclude by combining all of these bounds with (B.27). □

B.3. Proof of Lemma B.3

Assume without loss of generality that $Q_{1} (0) = n$ and $Q_{2} (0) = θ_{2}$ , because starting from $(q_{1}, θ_{2}, q_{3}, \dots, q_{b + 1}) \in S_{Q}$ , a state with $q_{1} = n$ and $q_{2} = θ_{2}$ must be visited before $τ_{2} (2 θ_{2})$ , so

\min_{\begin{matrix} q_{2} = θ_{2} \\ q \in S_{Q} \end{matrix}} ℙ_{q} (τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})) \geq \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})) .

We bound the right-hand side by relating it to the ruin probability in a certain gambler’s ruin problem. Namely, we construct a random walk ${\bar{R} (t)}$ with $\bar{R} (0) = 0$ that satisfies

\min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})) \geq ℙ (\inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋} > \inf_{t \geq 0} {\bar{R} (t) = - ⌊ \sqrt{n} β / 2 ⌋}) .

(B.28)

Jumps in the random walk are governed by a Poisson process with rate $n λ + θ_{1} - 3 θ_{2}$ ; the up-step and down-step probabilities are

\frac{n λ}{n λ + θ_{1} - 3 θ_{2}} and \frac{θ_{1} - 3 θ_{2}}{n λ + θ_{1} - 3 θ_{2}},

(B.29)

respectively. Note that we implicitly assume n is large enough so that

θ_{1} - 3 θ_{2} > 0

. The right-hand side in (B.28) is therefore the ruin probability in a gambler’s ruin problem with initial wealth

⌊ \sqrt{n} β / 2 ⌋

and opponent’s wealth

⌊ \sqrt{n} β / 2 ⌋ + ⌊ γ \sqrt{n} ⌋

. A formula for the ruin probability was given by equation (2.4) in Section XIV.2 of Feller (1968):

ℙ (\inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋} > \inf_{t \geq 0} {\bar{R} (t) = - ⌊ \sqrt{n} β / 2 ⌋}) = 1 - \frac{1 - {((θ_{1} - 3 θ_{2}) / n λ)}^{⌊ \sqrt{n} β / 2 ⌋}}{1 - {((θ_{1} - 3 θ_{2}) / n λ)}^{⌊ \sqrt{n} β / 2 ⌋ + ⌊ γ \sqrt{n} ⌋}} .

Recalling the values of θ₁ and θ₂ and the fact that $γ > β$ , we see that

\frac{θ_{1} - 3 θ_{2}}{n λ} = \frac{n - ⌊ \sqrt{n} β / 2 ⌋ - 3 ⌊ γ \sqrt{n} ⌋}{n λ} = 1 - \frac{- β \sqrt{n} + ⌊ \sqrt{n} β / 2 ⌋ + 3 ⌊ γ \sqrt{n} ⌋}{n λ} < 1,

and therefore,

\lim_{n \to \infty} \frac{1 - {((θ_{1} - 3 θ_{2}) / n λ)}^{⌊ \sqrt{n} β / 2 ⌋}}{1 - {((θ_{1} - 3 θ_{2}) / n λ)}^{⌊ \sqrt{n} β / 2 ⌋ + ⌊ γ \sqrt{n} ⌋}} = \lim_{n \to \infty} \frac{1 - {(1 - \frac{- β \sqrt{n} + ⌊ \sqrt{n} β / 2 ⌋ + 3 ⌊ γ \sqrt{n} ⌋}{n λ})}^{⌊ \sqrt{n} β / 2 ⌋}}{1 - {(1 - \frac{- β \sqrt{n} + ⌊ \sqrt{n} β / 2 ⌋ + 3 ⌊ γ \sqrt{n} ⌋}{n λ})}^{⌊ \sqrt{n} β / 2 ⌋ + ⌊ γ \sqrt{n} ⌋}} < 1,

implying Lemma B.3. It remains to construct

{\bar{R} (t)}

Recall that $Q_{1} (0) = n$ and $Q_{2} (0) = θ_{2}$ , and let ${\hat{Q} (t)}$ be a copy of ${Q (t)}$ , but with the modification that any server with a nonempty buffer permanently halts all its work. Then ${\hat{Q}}_{i} (t) \geq Q_{i} (t)$ for all $t \geq 0$ and all $1 \leq i \leq b + 1$ because this modified system has the same arrival stream as ${Q (t)}$ but serves fewer customers. It follows that

\begin{array}{l} τ_{1} (θ_{1}) = \inf_{t \geq 0} {Q_{1} (t) = θ_{1}} \leq \inf_{t \geq 0} {{\hat{Q}}_{1} (t) = θ_{1}}, \\ τ_{2} (2 θ_{2}) = \inf_{t \geq 0} {Q_{2} (t) = 2 θ_{2}} \geq \inf_{t \geq 0} {{\hat{Q}}_{2} (t) = 2 θ_{2}}, and \\ \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})) \geq \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (\inf_{t \geq 0} {{\hat{Q}}_{2} (t) = 2 θ_{2}} > \inf_{t \geq 0} {{\hat{Q}}_{1} (t) = θ_{1}}) . \end{array}

Now consider the process

R (t) = {\hat{Q}}_{1} (t) + {\hat{Q}}_{2} (t) - {\hat{Q}}_{1} (0) - {\hat{Q}}_{2} (0) = {\hat{Q}}_{1} (t) + {\hat{Q}}_{2} (t) - (n + θ_{2}) .

Note that $R (t) \geq {\hat{Q}}_{1} (t) - {\hat{Q}}_{1} (0) = {\hat{Q}}_{1} (t) - n$ because ${\hat{Q}}_{2} (t)$ is nondecreasing in t, which implies that

\inf_{t \geq 0} {R (t) = - ⌊ \sqrt{n} β / 2 ⌋} \geq \inf_{t \geq 0} {{\hat{Q}}_{1} (t) = n - ⌊ \sqrt{n} β / 2 ⌋} = \inf_{t \geq 0} {{\hat{Q}}_{1} (t) = θ_{1}} .

Note also that $\inf_{t \geq 0} {{\hat{Q}}_{2} (t) = 2 θ_{2}} = \inf_{t \geq 0} {R (t) = θ_{2}}$ because ${\hat{Q}}_{2} (t)$ is nondecreasing in t and ${\hat{Q}}_{2} (t)$ increases only when ${\hat{Q}}_{1} (t) = n$ . Hence,

\begin{array}{l} \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (\inf_{t \geq 0} {R (t) = θ_{2}} > \inf_{t \geq 0} {R (t) = - ⌊ \sqrt{n} β / 2 ⌋}) \\ \leq & \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (\inf_{t \geq 0} {{\hat{Q}}_{2} (t) = 2 θ_{2}} > \inf_{t \geq 0} {{\hat{Q}}_{1} (t) = θ_{1}}) \leq \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})) . \end{array}

(B.30)

An arrival to ${\hat{Q} (t)}$ increases the value of ${R (t)}$ , and a service completion by a server with an empty buffer decreases its value. However, ${R (t)}$ is still not the random walk we desire because the rate at which it decreases depends on the state of $\hat{Q} (t)$ . Instead, we want a random walk with a constant downward rate.

To construct this random walk, for $0 \leq t \leq \inf_{t \geq 0} {{\hat{Q}}_{2} (t) = 2 θ_{2}}$ let us define ${\bar{Q} (t) = ({\bar{Q}}_{1} (t), {\bar{Q}}_{2} (t))}$ by setting $\bar{Q} (0) = \hat{Q} (0)$ and defining the transitions of the joint process ${(\hat{Q} (t), \bar{Q} (t))}$ in Tables B.1 –B.3. Because we are defining $\bar{Q} (t)$ only until the time ${\hat{Q}}_{2} (t)$ hits $2 θ_{2}$ , we do not need to specify the transitions for states where ${\hat{Q}}_{2} (t) > 2 θ_{2}$ . The intuition for the transition structure is as follows. Because arrivals occur at the constant rate of $n λ$ , we want any arrival to ${\hat{Q} (t)}$ to also occur in ${\bar{Q} (t)}$ . However, we want to keep the rate at which ${\bar{Q} (t)}$ decreases a constant value of $θ_{1} - 3 θ_{2}$ . To accomplish this, when ${\hat{Q}}_{1} (t) \geq θ_{1} - θ_{2}$ , the transitions in Table B.2 have ${\bar{Q} (t)}$ ignore some departures from ${\hat{Q} (t)}$ ; when ${\hat{Q}}_{1} (t) < θ_{1} - θ_{2}$ , we supplement the departures from ${\hat{Q} (t)}$ ; for example, see transition $# 8$ in Table B.3. Having defined $\bar{Q} (t)$ , let us define

\bar{R} (t) = {\bar{Q}}_{1} (t) - {\bar{Q}}_{1} (0) + {\bar{Q}}_{2} (t) - {\bar{Q}}_{2} (0), t \leq \inf_{t \geq 0} {{\hat{Q}}_{2} (t) = 2 θ_{2}} .

Table B.1. Arrival Transitions for the Joint Process in State $(({\hat{u}}_{1}, {\hat{u}}_{2}), ({\bar{u}}_{1}, {\bar{u}}_{2}))$

Table B.1. Arrival Transitions for the Joint Process in State $(({\hat{u}}_{1}, {\hat{u}}_{2}), ({\bar{u}}_{1}, {\bar{u}}_{2}))$

#	Rate	Transition
1	$n λ 1 ({\hat{u}}_{1} < n, {\bar{u}}_{1} < n)$	$(({\hat{u}}_{1} + 1, {\hat{u}}_{2}), ({\bar{u}}_{1} + 1, {\bar{u}}_{2}))$
2	$n λ 1 ({\hat{u}}_{1} = n, {\bar{u}}_{1} < n)$	$(({\hat{u}}_{1}, {\hat{u}}_{2} + 1), ({\bar{u}}_{1} + 1, {\bar{u}}_{2}))$
3	$n λ 1 ({\hat{u}}_{1} < n, {\bar{u}}_{1} = n)$	$(({\hat{u}}_{1} + 1, {\hat{u}}_{2}), ({\bar{u}}_{1}, {\bar{u}}_{2} + 1))$
4	$n λ 1 ({\hat{u}}_{1} = n, {\bar{u}}_{1} = n)$	$(({\hat{u}}_{1}, {\hat{u}}_{2} + 1), ({\bar{u}}_{1}, {\bar{u}}_{2} + 1))$

Table B.2. Departure Transitions for the Joint Process in State $(({\hat{u}}_{1}, {\hat{u}}_{2}), ({\bar{u}}_{1}, {\bar{u}}_{2}))$ with ${\hat{u}}_{2} \leq 2 θ_{2}$ and ${\hat{u}}_{1} \geq θ_{1} - θ_{2}$

#	Rate	Transition
5	$θ_{1} - 3 θ_{2}$	$(({\hat{u}}_{1} - 1, {\hat{u}}_{2}), ({\bar{u}}_{1} - 1, {\bar{u}}_{2}))$
6	${\hat{u}}_{1} - {\hat{u}}_{2} - (θ_{1} - 3 θ_{2})$	$(({\hat{u}}_{1} - 1, {\hat{u}}_{2}), ({\bar{u}}_{1}, {\bar{u}}_{2}))$

Table B.3. Departure Transitions for the Joint Process in State $(({\hat{u}}_{1}, {\hat{u}}_{2}), ({\bar{u}}_{1}, {\bar{u}}_{2}))$ with ${\hat{u}}_{2} \leq 2 θ_{2}$ and ${\hat{u}}_{1} < θ_{1} - θ_{2}$

#	Rate	Transition
7	$({\hat{u}}_{1} - 2 θ_{2}) 1 ({\hat{u}}_{1} \geq 2 θ_{2})$	$(({\hat{u}}_{1} - 1, {\hat{u}}_{2}), ({\bar{u}}_{1} - 1, {\bar{u}}_{2}))$
8	$θ_{1} - θ_{2} - {\hat{u}}_{1} \lor 2 θ_{2}$	$(({\hat{u}}_{1}, {\hat{u}}_{2}), ({\bar{u}}_{1} - 1, {\bar{u}}_{2}))$
9	$2 θ_{2} \land {\hat{u}}_{1} - {\hat{u}}_{2}$	$(({\hat{u}}_{1} - 1, {\hat{u}}_{2}), ({\bar{u}}_{1}, {\bar{u}}_{2}))$

To prove that ${\bar{R} (t)}$ satisfies (B.28), we show that

\bar{R} (t) \geq R (t) for all times t \leq \min {\inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋}, \inf_{t \geq 0} {\bar{R} (t) = - ⌊ \sqrt{n} β / 2 ⌋}}

(B.31)

and, as a result,

\begin{array}{l} \inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋} \leq \inf_{t \geq 0} {R (t) = ⌊ γ \sqrt{n} ⌋}, \\ \inf_{t \geq 0} {\bar{R} (t) = - ⌊ \sqrt{n} β / 2 ⌋} \geq \inf_{t \geq 0} {R (t) = - ⌊ \sqrt{n} β / 2 ⌋} . \end{array}

Together with (B.30), these inequalities imply that

\begin{array}{l} \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (\inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋} > \inf_{t \geq 0} {\bar{R} (t) = - ⌊ \sqrt{n} β / 2 ⌋}) \\ \leq & \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (\inf_{t \geq 0} {R (t) = ⌊ γ \sqrt{n} ⌋} > \inf_{t \geq 0} {R (t) = - ⌊ \sqrt{n} β / 2 ⌋}) \leq \min_{\begin{matrix} q_{2} = θ_{2}, q_{1} = n \\ q \in S_{Q} \end{matrix}} ℙ_{q} (τ_{1} (θ_{1}) < τ_{2} (2 θ_{2})) . \end{array}

To see why (B.31) is true, let us study the transitions in Tables B.1 –B.3. Table B.1 tells us that $\bar{R} (t)$ and R(t) increase at the same times. The transitions in Table B.2 show that any decrease in ${\bar{Q}}_{1} (t)$ , and consequently $\bar{R} (t)$ , must be accompanied by a decrease in ${\hat{Q}}_{1} (t)$ and R(t) but not vice versa. The only way ${\bar{Q}}_{1} (t)$ can ever drop below ${\hat{Q}}_{1} (t)$ is via transition 8, which can happen only if ${\hat{Q}}_{1} (t) < θ_{1} - θ_{2}$ , so the first intersection of ${\bar{Q}}_{1} (t)$ and ${\hat{Q}}_{1} (t)$ has to occur below $θ_{1} - θ_{2}$ . Therefore, $\bar{R} (t) \geq R (t)$ for all times

t \leq \min {\inf_{t \geq 0} {{\bar{Q}}_{1} (t) = θ_{1} - θ_{2}}, \inf_{t \geq 0} {{\hat{Q}}_{2} (t) = 2 θ_{2}}} = \min {\inf_{t \geq 0} {{\bar{Q}}_{1} (t) = θ_{1} - θ_{2}}, \inf_{t \geq 0} {R (t) = θ_{2}}} .

Let us now prove (B.31) by showing that the right-hand side is greater than

\min {\inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋}, \inf_{t \geq 0} {\bar{R} (t) = - ⌊ \sqrt{n} β / 2 ⌋}} .

Because $\bar{R} (t) \geq R (t)$ ,

\min {\inf_{t \geq 0} {{\bar{Q}}_{1} (t) = θ_{1} - θ_{2}}, \inf_{t \geq 0} {R (t) = θ_{2}}} \geq \min {\inf_{t \geq 0} {{\bar{Q}}_{1} (t) = θ_{1} - θ_{2}}, \inf_{t \geq 0} {\bar{R} (t) = θ_{2}}} .

Furthermore, because ${\bar{Q}}_{2} (t)$ is nondecreasing and increases only at those times when ${\bar{Q}}_{1} (t) = n$ , it follows that for all $t \leq \inf_{t \geq 0} {\bar{R} (t) = θ_{2}}$ ,

\bar{R} (t) = {\bar{Q}}_{1} (t) + {\bar{Q}}_{2} (t) - n - θ_{2} \leq {\bar{Q}}_{1} (t) - n + θ_{2},

and therefore

\begin{array}{l} \min {\inf_{t \geq 0} {{\bar{Q}}_{1} (t) = θ_{1} - θ_{2}}, \inf_{t \geq 0} {\bar{R} (t) = θ_{2}}} \\ = & \min {\inf_{t \geq 0} {{\bar{Q}}_{1} (t) = n - ⌊ \sqrt{n} β / 2 ⌋ - θ_{2}}, \inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋}} \\ \geq & \min {\inf_{t \geq 0} {\bar{R} (t) = - ⌊ \sqrt{n} β / 2 ⌋}, \inf_{t \geq 0} {\bar{R} (t) = ⌊ γ \sqrt{n} ⌋}} . □ \end{array}

B.4. Proving Lemma B.4

Central to our argument is a result about the moment-generating function of the duration of a gambler’s ruin game. We now describe this result and then prove Lemma B.4. Consider a discrete-time gambler’s ruin problem where the initial player’s wealth is z, the win probability is p, the loss probability is q, and the player keeps playing until he or she goes broke or accumulates a total wealth of a. Let $D_{z} \in ℤ_{+}$ be the number of turns until the game ends, given an initial wealth of z. An expression for the generating function $E s^{D_{z}}$ was given in (4.11) and (4.12) in section XIV.4 of Feller (1968):

E s^{D_{z}} = \frac{λ_{1}^{a} (s) λ_{2}^{z} (s) - λ_{1}^{z} (s) λ_{2}^{a} (s)}{λ_{1}^{a} (s) - λ_{2}^{a} (s)} + \frac{λ_{1}^{z} (s) - λ_{2}^{z} (s)}{λ_{1}^{a} (s) - λ_{2}^{a} (s)}, s \in (0, 1),

(B.32)

where

λ_{1} (s) = \frac{1 + \sqrt{1 - 4 p q s^{2}}}{2 p s}, and λ_{2} (s) = \frac{1 - \sqrt{1 - 4 p q s^{2}}}{2 p s}, s \in (0, 1) .

Now consider the continuous-time gambler’s ruin problem, where the durations between turns are governed by an i.i.d. sequence ${E_{i}}$ of rate r exponentially distributed random variables. Given initial wealth z, the duration of the continuous game equals $\sum_{i = 1}^{D_{z}} E_{i}$ . Because the E_i are independent of D_z, it follows that

E e^{- \sum_{i = 1}^{D_{z}} E_{i}} = E {(E e^{- E_{1}})}^{D_{z}} = E {(\frac{r}{r + 1})}^{D_{z}},

E e^{- \sum_{i = 1}^{D_{z}} E_{i}}

is related to (B.32). The following result proved in Section B.4.1 is needed to prove Lemma B.4.

Lemma B.7.

Let i and q₂ be integers such that $1 \leq i \leq b + 1$ and $0 \leq q_{2} \leq θ_{2}$ , and define

q^{(B, i)} = n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ (b + 1) ⌋ .

Consider the continuous-time gambler’s ruin problem with probabilities

p = \frac{n λ}{n λ + q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋}, and q = \frac{q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋}{n λ + q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋},

rate

r = n λ + q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋

, initial wealth z and terminal wealth a given by

z = ⌊ \sqrt{n} β / 2 ⌋, and a = ⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ (b + 1) ⌋,

(B.33)

and game duration

\sum_{i = 1}^{D_{z}} E_{i}

. Then

\lim_{n \to \infty} \max_{0 \leq q_{2} \leq 2 ⌊ γ \sqrt{n} ⌋} E e^{- \sum_{i = 1}^{D_{z}} E_{i}} < 1 .

(B.34)

Proof of Lemma B.4.

As discussed below (B.10), ${τ_{C} < τ_{1} (n)} \supset {Γ_{b + 1} < τ_{1} (n)}$ , where $Γ_{b + 1}$ is the sum of b + 1 unit-mean exponentially distributed random variables. The same discussion says that $Γ_{b + 1}$ represents the time needed by the joint CTMC $(Q (t), \tilde{Q} (t))$ to transition from $Θ_{b + 1}^{Q}$ to $Θ_{1}^{Q}$ and to then couple by spending an exponentially distributed amount of time in $Θ_{1}^{Q}$ . Thus,

\min_{\begin{matrix} 0 \leq q_{1} \leq θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \\ q \in S_{Q} \end{matrix}} ℙ (τ_{C} < τ_{1} (n) | Q (0) = q, (Q (0), \tilde{Q} (0)) \in \cup_{i = 1}^{b + 1} Θ_{i}^{Q}) \geq \min_{\begin{matrix} 0 \leq q_{1} \leq θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \\ q \in S_{Q} \end{matrix}} ℙ (Γ_{b + 1} < τ_{1} (n) | Q (0) = q) .

Let us analyze the probability above. At time t = 0, there are q₂ servers with nonempty buffers and another server containing the extra customer in ${\tilde{Q} (t)}$ . We group these $q_{2} + 1$ servers together into group A and the remaining $n - q_{2} - 1$ servers into group B. Let $Q_{1}^{(A)} (t)$ and $Q_{1}^{(B)} (t)$ be the number of busy group A and B servers, respectively. Because

Q_{1}^{(A)} (0) = q_{2} + 1, and Q_{1}^{(A)} (0) + Q_{1}^{(B)} (0) = Q_{1} (0) \leq n - ⌊ \sqrt{n} β / 2 ⌋,

it follows that

Q_{1}^{(B)} (0) \leq n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋

. We are implicitly assuming that n is large enough so

n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋ \geq 0

. Note that the buffer of any group B server is empty for all

t \leq τ_{1} (n)

If a customer arrives when more than one server is idle, we prioritize assigning this customer to servers in group B over group A. Note that this tie-breaking rule is consistent with the tie-breaking rule we imposed in the proof of Lemma 4. Let $τ_{B} = \inf_{t \geq 0} {Q_{1}^{(B)} (t) = n - q_{2} - 1}$ be the first time that all servers in group B are busy. By construction, $τ_{B} \leq τ_{1} (n)$ , so

\begin{array}{l} \min_{\begin{matrix} 0 \leq q_{1} \leq θ_{1} \\ 0 \leq q_{2} \leq 2 θ_{2} \\ q \in S_{Q} \end{matrix}} ℙ (Γ_{b + 1} < τ_{1} (n) | Q (0) = q) \geq \min_{\begin{matrix} 0 \leq q_{2} \leq 2 θ_{2} \\ 0 \leq q^{(B)} \leq n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋ \end{matrix}} ℙ (Γ_{b + 1} < τ_{B} | Q_{1}^{(B)} (0) = q^{(B)}) \\ \geq \min_{0 \leq q_{2} \leq 2 ⌊ γ \sqrt{n} ⌋} ℙ (Γ_{b + 1} < τ_{B} | Q_{1}^{(B)} (0) = n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋) . \end{array}

The last inequality is true because increasing the value of the initial condition $Q_{1}^{(B)} (0)$ does not increase the chance that $Γ_{b + 1} < τ_{B}$ . We now relate the right-hand side to the moment-generating function considered in Lemma B.7 and use that lemma to conclude the proof. We can write $Γ_{b + 1} = \sum_{i = 1}^{b + 1} G_{i}$ , where G_i are i.i.d. unit-mean exponentially distributed random variables independent of $Q_{1}^{(B)} (t)$ for $t \in [0, τ_{B}]$ because they correspond to service times of the server containing the additional customer in ${\tilde{Q} (t)}$ , which is a server in group A.

Fixing $0 \leq q_{2} \leq 2 θ_{2}$ and $Q_{1}^{(B)} (0) = n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋$ , for $0 \leq i \leq b + 1$ , we define

\begin{array}{l} q^{(B, i)} = & n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ \frac{i}{b + 1} ⌋, and \\ τ_{B, i} = & \inf_{t \geq 0} {Q_{1}^{(B)} (t) - Q_{1}^{(B)} (0) = ⌊ ⌊ \sqrt{n} β / 2 ⌋ \frac{i}{b + 1} ⌋} = \inf_{t \geq 0} {Q_{1}^{(B)} (t) = q^{(B, i)}} \end{array}

and note that

τ_{B} = τ_{B, b + 1}

. We are guaranteed that

Γ_{b + 1} < τ_{B}

if for each

1 \leq i \leq b + 1

, the exponentially distributed G_i is smaller than the time it takes for

Q_{1}^{(B)} (t)

to reach

q^{(B, i)}

if started from

q^{(B, i - 1)}

, so

ℙ (Γ_{b + 1} < τ_{B} | Q_{1}^{(B)} (0) = n - q_{2} - 1 - ⌊ \sqrt{n} β / 2 ⌋) \geq \prod_{i = 1}^{b + 1} ℙ (G_{i} < τ_{B, i} | Q_{1}^{(B)} (0) = q^{(B, i - 1)}) .

We now show that $τ_{B, i}$ can be bounded from below by the duration of a gambler’s ruin game, which allows us to apply Lemma B.7. Fix $1 \leq i \leq b + 1$ , and consider the time interval $t \in [0, τ_{B, i}]$ , on which we construct the coupling ${(Q_{1}^{(B)} (t), {\bar{Q}}_{1}^{(B)} (t))}$ by setting

{\bar{Q}}_{1}^{(B, i)} (0) = Q_{1}^{(B)} (0) = q^{(B, i - 1)}

and defining the transitions of the joint process in Tables B.4 and B.5. We implicitly assume that n is large enough that

q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋ > 0

Note that the only time ${\bar{Q}}_{1}^{(B, i)} (t)$ decreases but $Q_{1}^{(B)} (t)$ does not is when the latter is smaller than $q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋$ , so we are guaranteed that

{\bar{Q}}_{1}^{(B, i)} (t) \geq Q_{1}^{(B)} (t), for all t \leq \min {τ_{B, i}, \inf_{t \geq 0} {{\bar{Q}}_{1}^{(B, i)} (t) = q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋}} .

(B.35)

Recalling the definitions of $τ_{B, i}$ and $q^{(B, i)}$ , we have

\begin{array}{l} \min {τ_{B, i}, \inf_{t \geq 0} {{\bar{Q}}_{1}^{(B, i)} (t) = q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋}} \\ = & \min {\inf_{t \geq 0} {Q_{1}^{(B)} (t) = q^{(B, i)}}, \inf_{t \geq 0} {{\bar{Q}}_{1}^{(B, i)} (t) = q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋}} \\ = & \min {\inf_{t \geq 0} {Q_{1}^{(B)} (t) = q^{(B, i - 1)} + ⌊ ⌊ \sqrt{n} β / 2 ⌋ / (b + 1) ⌋}, \inf_{t \geq 0} {{\bar{Q}}_{1}^{(B, i)} (t) = q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋}} \\ \geq & \min {\inf_{t \geq 0} {{\bar{Q}}_{1}^{(B, i)} (t) = q^{(B, i - 1)} + ⌊ ⌊ \sqrt{n} β / 2 ⌋ / (b + 1) ⌋}, \inf_{t \geq 0} {{\bar{Q}}_{1}^{(B, i)} (t) = q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋}}, \end{array}

where the last inequality follows from (B.35). Let

{\bar{τ}}_{B, i}

equal the right-hand side and note that

{\bar{τ}}_{B, i} = \inf_{t \geq 0} {({\bar{Q}}_{1}^{(B, i)} (t) - {\bar{Q}}_{1}^{(B, i)} (0)) \in {- ⌊ \sqrt{n} β / 2 ⌋, ⌊ ⌊ \sqrt{n} β / 2 ⌋ / (b + 1) ⌋}}

because

{\bar{Q}}_{1}^{(B, i)} (0) = q^{(B, i - 1)}

. Because

{\bar{τ}}_{B, i} \leq τ_{B, i}

, it follows that

\min_{0 \leq q_{2} \leq 2 ⌊ γ \sqrt{n} ⌋} \prod_{i = 1}^{b + 1} ℙ (G_{i} < τ_{B, i} | Q_{1}^{(B)} (0) = q^{(B, i - 1)}) \geq \min_{0 \leq q_{2} \leq 2 ⌊ γ \sqrt{n} ⌋} \prod_{i = 1}^{b + 1} ℙ (G_{i} < {\bar{τ}}_{B, i} | Q_{1}^{(B)} (0) = q^{(B, i - 1)}) .

Recall that G_i corresponds to the service time of a group A server and is therefore independent of ${\bar{τ}}_{B, i}$ . Furthermore, because G_i is exponentially distributed with unit mean, conditioning on the value of ${\bar{τ}}_{B, i}$ yields

\begin{array}{l} \min_{0 \leq q_{2} \leq 2 ⌊ γ \sqrt{n} ⌋} ℙ (G_{i} < {\bar{τ}}_{B, i} | {\bar{Q}}_{1}^{(B, i)} (0) = q^{(B, i - 1)}) = \min_{0 \leq q_{2} \leq 2 ⌊ γ \sqrt{n} ⌋} (1 - E (e^{- {\bar{τ}}_{B, i}} | {\bar{Q}}_{1}^{(B, i)} (0) = q^{(B, i - 1)})) \\ = 1 - \max_{0 \leq q_{2} \leq 2 ⌊ γ \sqrt{n} ⌋} E (e^{- {\bar{τ}}_{B, i}} | {\bar{Q}}_{1}^{(B, i)} (0) = q^{(B, i - 1)}) . \end{array}

Applying (B.34) of Lemma B.7 concludes because our construction of ${{\bar{Q}}_{1}^{(B, i)} (t)}$ implies that ${\bar{τ}}_{B, i}$ is the duration of a gambler’s ruin game with initial wealth $z = ⌊ \sqrt{n} β / 2 ⌋$ , terminal wealth $a = ⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ / (b + 1) ⌋$ , rate $n λ + q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋$ , and up-step and down-step probabilities

\frac{n λ}{n λ + q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋} and \frac{q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋}{n λ + q^{(B, i)} - ⌊ \sqrt{n} β / 2 ⌋} . □

Table B.4. Transition Rates in State $(u, \bar{u})$ with $u \geq q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋$

Table B.4. Transition Rates in State $(u, \bar{u})$ with $u \geq q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋$

Rate	Transition
$n λ$	$(u + 1, \bar{u} + 1)$
$q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋$	$(u - 1, \bar{u} - 1)$
$u - (q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋)$	$(u - 1, \bar{u})$

Table B.5. Transition Rates in State $(u, \bar{u})$ with $u < q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋$

Table B.5. Transition Rates in State $(u, \bar{u})$ with $u < q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋$

Rate	Transition
$u$	$(u - 1, \bar{u} - 1)$
$q^{(B, i - 1)} - ⌊ \sqrt{n} β / 2 ⌋ - u$	$(u, \bar{u} - 1)$

B.4.1 Proving the Gambler’s Ruin Result.

We require the following auxiliary lemma.

Lemma B.8.

Assume ${x_{n} \in R}$ is a sequence that converges to $\bar{x}$ . Then

\lim_{n \to \infty} {(1 + \frac{x_{n}}{n})}^{n} \to e^{\bar{x}} .

Proof of Lemma B.8.

Let $f (x) = e^{x}$ and $f_{n} (x) = {(1 + \frac{x}{n})}^{n}$ ; note that for any $n \geq 0$ ,

| f_{n} (x_{n}) - e^{\bar{x}} | \leq | f_{n} (x_{n}) - f_{n} (\bar{x}) | + | f_{n} (\bar{x}) - e^{\bar{x}} | .

From the mean-value theorem, we know that there exists some c_n between x_n and $\bar{x}$ such that

| f_{n} (x_{n}) - f_{n} (\bar{x}) | \leq | x_{n} - \bar{x} | f_{n}^{'} (c_{n}) = | x_{n} - \bar{x} | {(1 + \frac{c_{n}}{n})}^{n - 1} .

Because $x_{n} \to \bar{x}$ , it follows that ${(1 + c_{n} / n)}^{n - 1} \leq {(1 + 2 | \bar{x} | / n)}^{n - 1}$ for n large enough; therefore,

| f_{n} (x_{n}) - e^{\bar{x}} | \leq | x_{n} - \bar{x} | {(1 + \frac{2 | \bar{x} |}{n})}^{n - 1} + | f_{n} (\bar{x}) - e^{\bar{x}} | .

We can make the right-hand side arbitrarily small by increasing n. □

Proof of Lemma B.7.

Recall that $E e^{- \sum_{i = 1}^{D_{z}} E_{i}} = E {(r / (r + 1))}^{D_{z}}$ and that

E s^{D_{z}} = \frac{λ_{1}^{a} (s) λ_{2}^{z} (s) - λ_{1}^{z} (s) λ_{2}^{a} (s)}{λ_{1}^{a} (s) - λ_{2}^{a} (s)} + \frac{λ_{1}^{z} (s) - λ_{2}^{z} (s)}{λ_{1}^{a} (s) - λ_{2}^{a} (s)} = \frac{λ_{2}^{z} (s) (λ_{1}^{a} (s) - 1) - λ_{1}^{z} (s) (λ_{2}^{a} (s) - 1)}{λ_{1}^{a} (s) - λ_{2}^{a} (s)},

where

λ_{1} (s) = \frac{1 + \sqrt{1 - 4 p q s^{2}}}{2 p s} and λ_{2} (s) = \frac{1 - \sqrt{1 - 4 p q s^{2}}}{2 p s}, s \in (0, 1) .

Fix $s = r / (r + 1)$ . To show that $\lim_{n \to \infty} E s^{D_{z}} < 1$ , we derive expressions for $\lim_{n \to \infty} λ_{j}^{z} (s)$ and $\lim_{n \to \infty} λ_{j}^{a} (s)$ . For notational economy, we let $θ_{3} = ⌊ \sqrt{n} β / 2 ⌋$ . We can write p and q as

p = \frac{n λ}{n λ + q^{(B, i)} - θ_{3}} = \frac{1}{2} + \frac{1}{2} \frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}}, q = \frac{1}{2} - \frac{1}{2} \frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}},

and

p q = \frac{1}{4} - \frac{1}{4} {(\frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{2} .

Let us first consider $λ_{1} (s)$ , which satisfies

\begin{array}{l} λ_{1} (s) = & (1 + \sqrt{1 - s^{2} + {(\frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{2} s^{2}}) s^{- 1} {(1 + \frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{- 1} \\ = & (1 + \frac{1}{\sqrt{n}} [\sqrt{n (1 - s^{2}) + n {(\frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{2} s^{2}}]) s^{- 1} {(1 + \frac{1}{\sqrt{n}} [\sqrt{n} \frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}}])}^{- 1} . \end{array}

(B.36)

We now show that the terms inside the square brackets have limits $\bar{x}, \bar{y} \in R$ as $n \to \infty$ ; that is,

\lim_{n \to \infty} \sqrt{n (1 - s^{2}) + n {(\frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{2} s^{2}} = \bar{x} and \lim_{n \to \infty} \sqrt{n} \frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}} = \bar{y} .

(B.37)

Note that $\lim_{n \to \infty} s^{2} = 1$ ; recall the definition of r to see that $\lim_{n \to \infty} r / n = 1 + λ$ , so

\lim_{n \to \infty} n (1 - s^{2}) = \lim_{n \to \infty} \frac{n (2 r + 1)}{1 + 2 r + r^{2}} = \lim_{n \to \infty} \frac{(2 r + 1) / n}{(1 + 2 r + r^{2}) / n^{2}} = \lim_{n \to \infty} \frac{2}{r / n} = \frac{2}{1 + λ} .

Furthermore, recalling the definition of $q^{(B, i)}$ , we have

\begin{array}{l} \lim_{n \to \infty} n {(\frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{2} = & \lim_{n \to \infty} n {(\frac{- β \sqrt{n} + q_{2} + 1 + 2 ⌊ \sqrt{n} β / 2 ⌋ - ⌊ ⌊ \sqrt{n} β / 2 ⌋ \frac{i - 1}{b + 1} ⌋}{n λ + n - q_{2} - 1 - 2 ⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ \frac{i - 1}{b + 1} ⌋})}^{2} \\ = & {(\frac{\lim_{n \to \infty} q_{2} / \sqrt{n} - \frac{i - 1}{b + 1} β / 2}{λ + 1})}^{2} . \end{array}

(B.38)

We know that $\lim_{n \to \infty} q_{2} / \sqrt{n}$ exists because q₂ is fixed between zero and $2 ⌊ γ \sqrt{n} ⌋$ . This proves (B.37). Recall that $z = ⌊ \sqrt{n} β / 2 ⌋$ and $a = ⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ \frac{1}{b + 1} ⌋$ . Because $r / n \to 1 + λ$ , it follows that

\lim_{n \to \infty} s^{a} = \lim_{n \to \infty} {(1 - 1 / (r + 1))}^{⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ / (b + 1) ⌋} = 1 and \lim_{n \to \infty} s^{z} = \lim_{n \to \infty} s^{⌊ \sqrt{n} β / 2 ⌋} = 1;

combined with (B.36), (B.37), and Lemma B.8, this implies that

\begin{array}{l} \lim_{n \to \infty} λ_{1}^{z} (s) = & \lim_{n \to \infty} λ_{1}^{⌊ \sqrt{n} β / 2 ⌋} (s) = \exp (\frac{\bar{x} β}{2}) \exp (- \frac{\bar{y} β}{2}); \\ \lim_{n \to \infty} λ_{1}^{a} (s) = & \lim_{n \to \infty} λ_{1}^{⌊ \sqrt{n} β / 2 ⌋ + ⌊ ⌊ \sqrt{n} β / 2 ⌋ \frac{1}{b + 1} ⌋} (s) = \exp (\frac{\bar{x} β}{2} \frac{b + 2}{b + 1}) \exp (- \frac{\bar{y} β}{2} \frac{b + 2}{b + 1}) . \end{array}

The expressions for $\lim_{n \to \infty} λ_{2}^{z} (s)$ and $\lim_{n \to \infty} λ_{2}^{a} (s)$ follow similarly. Comparing

λ_{2} (s) = (1 - \sqrt{1 - s^{2} + {(\frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{2} s^{2}}) s^{- 1} {(1 + \frac{n λ - (q^{(B, i)} - θ_{3})}{n λ + q^{(B, i)} - θ_{3}})}^{- 1}

to the form of

λ_{1} (s)

in (B.36), we see that we can use (B.37) and Lemma B.8 again to conclude that

\begin{array}{l} \lim_{n \to \infty} λ_{2}^{z} (s) = & \exp (- \frac{\bar{x} β}{2}) \exp (- \frac{\bar{y} β}{2}), and \\ \lim_{n \to \infty} λ_{2}^{a} (s) = & \exp (- \frac{\bar{x} β}{2} \frac{b + 2}{b + 1}) \exp (- \frac{\bar{y} β}{2} \frac{b + 2}{b + 1}) . \end{array}

For convenience, we define $x = (\bar{x} - \bar{y}) β / 2$ and $y = (\bar{x} + \bar{y}) β / 2$ , so that

\lim_{n \to \infty} λ_{1}^{z} (s) = e^{x}, \lim_{n \to \infty} λ_{1}^{a} (s) = e^{x (b + 2) / (b + 1)}, \lim_{n \to \infty} λ_{2}^{z} (s) = e^{- y}, \lim_{n \to \infty} λ_{1}^{a} (s) = e^{- y (b + 2) / (b + 1)} .

It is straightforward to check that $x, y > 0$ using (B.37). Let us now prove that $\lim_{n \to \infty} E s^{D_{z}} < 1$ . Using the definition of $E s^{D_{z}}$ , we have

\begin{array}{l} \lim_{n \to \infty} E s^{D_{z}} = & \lim_{n \to \infty} \frac{λ_{2}^{z} (s) (λ_{1}^{a} (s) - 1) - λ_{1}^{z} (s) (λ_{2}^{a} (s) - 1)}{λ_{1}^{a} (s) - λ_{2}^{a} (s)} \\ = & \frac{e^{- y} (e^{x (b + 2) / (b + 1)} - 1) - e^{x} (e^{- y (b + 2) / (b + 1)} - 1)}{e^{x (b + 2) / (b + 1)} - e^{- y (b + 2) / (b + 1)}} . \end{array}

Set $c = (b + 2) / (b + 1)$ . We want to show that for any $x, y > 0$ ,

e^{- y} (e^{x c} - 1) - e^{x} (e^{- y c} - 1) < e^{x c} - e^{- y c}, or e^{- y} e^{xc} {− e}^{x} e^{− y c} < e^{x c} {− e}^{− y c} {+ e}^{− y} {− e}^{x} .

Rearranging terms, this is equivalent to

e^{x c} (e^{- y} - 1) - e^{x} (e^{- y c} - 1) < - e^{- y c} + e^{- y} .

Fix $y > 0$ and treat the left-hand side as a function of x. Both sides are equal when x = 0, so it suffices to show that the derivative of the left-hand side with respect to x is negative. Now

\frac{\partial}{\partial x} (e^{x c} (e^{- y} - 1) - e^{x} (e^{- y c} - 1)) = c e^{x c} (e^{- y} - 1) - e^{x} (e^{- y c} - 1) .

(B.39)

For the right-hand side to be negative, we must have

c e^{x (c - 1)} > \frac{1 - e^{- y c}}{1 - e^{- y}} .

Because $c = (b + 2) / (b + 1) > 1$ , the left-hand side is bounded from below by c provided that $x \geq 0$ . The right-hand side converges to c as $y ↓ 0$ , so we must show that the derivative of the right-hand side is negative. Differentiating yields

\frac{\partial}{\partial y} \frac{1 - e^{- y c}}{1 - e^{- y}} = \frac{c e^{- y c} (1 - e^{- y}) - e^{- y} (1 - e^{- y c})}{{(1 - e^{- y})}^{2}} = e^{- y} \times \frac{c e^{- y (c - 1)} - c e^{- y c} - 1 + e^{- y c}}{{(1 - e^{- y})}^{2}} .

The numerator $c e^{- y (c - 1)} - (c - 1) e^{- y (c - 1)} - 1$ equals zero when y = 0. Its derivative equals

- c (c - 1) e^{- y (c - 1)} + {(c - 1)}^{2} e^{- y c} < - c (c - 1) e^{- y (c - 1)} + {(c - 1)}^{2} e^{- y (c - 1)} \frac{c}{c - 1} = 0, y \geq 0,

where the inequality is due to

e^{- y} \leq 1 < c / (c - 1)

. Therefore, the numerator is strictly negative for y > 0, meaning that (B.39) holds. □

B.5. Proof of Lemma 8

It suffices to show that $E τ^{-} (x_{1}^{q}) \leq C (β) δ$ because

ℙ (V \leq τ^{-} (x_{1}^{q})) = \int_{0}^{\infty} ℙ (V \leq t) d F (t) = \int_{0}^{\infty} (1 - e^{- t}) d F (t) = 1 - E e^{- τ^{-} (x_{1}^{q})} \leq E τ^{-} (x_{1}^{q}),

where F(t) is the distribution function of

τ^{-} (x_{1}^{q})

. Define

τ^{+} (q_{1}) = \inf_{t \geq 0} {Q (t) = (q_{1} + 1, 0, \dots, 0) | Q (0) = (q_{1}, 0, \dots, 0)}, 0 \leq q_{1} \leq n - 1,

and note that

τ^{+} (q_{1}) = τ^{-} (x_{1}^{q})

. If we let

{π_{q}}_{q \in S_{Q}}

be the stationary distribution of the unscaled CTMC, it follows from (2.11) of Brown and Xia (2001) that

E τ^{+} (q_{1}) = \frac{\sum_{i = 0}^{q_{1}} π_{i, 0, \dots, 0}}{n λ π_{q_{1}, 0, \dots, 0}} .

Letting $f (x^{q}) = 1 (x_{1}^{q} \leq i)$ and using $E G_{X} f (X) = 0$ yields $n λ π_{i, 0, \dots, 0} = (i + 1) π_{i + 1, 0, \dots, 0}$ , which implies that $π_{i, 0, \dots, 0} = π_{0, \dots, 0} {(n λ)}^{i} / i!$ , so

E τ^{+} (q_{1}) = \frac{\sum_{k = 0}^{q_{1}} \frac{{(n λ)}^{k}}{k!}}{n λ \frac{{(n λ)}^{q_{1}}}{q_{1}!}} = \frac{q_{1}!}{{(n λ)}^{q_{1} + 1}} \sum_{k = 0}^{q_{1}} \frac{{(n λ)}^{k}}{k!} .

Note that $x_{1}^{q} = β + δ (n λ - ⌊ n λ ⌋)$ is equivalent to $q_{1} = ⌊ n λ ⌋$ . If $⌊ n λ ⌋ = 0$ , we observe that the right-hand side equals $1 / (n λ)$ , which verifies (31) when $x_{1}^{q} = β + δ (n λ - ⌊ n λ ⌋)$ . If, however, $⌊ n λ ⌋ > 0$ , we may use Stirling’s approximation to see that for $q_{1} > 0$ ,

\frac{q_{1}!}{{(n λ)}^{q_{1} + 1}} \sum_{k = 0}^{q_{1}} \frac{{(n λ)}^{k}}{k!} \leq \frac{3 q_{1}^{q_{1} + 1 / 2} e^{- q_{1}}}{{(n λ)}^{q_{1} + 1}} \sum_{k = 0}^{q_{1}} \frac{{(n λ)}^{k}}{k!} \leq \frac{3 q_{1}^{q_{1} + 1 / 2} e^{- q_{1}}}{{(n λ)}^{q_{1} + 1}} e^{n λ} .

Setting $q_{1} = ⌊ n λ ⌋$ proves (31) when $x_{1} = β + δ (n λ - ⌊ n λ ⌋)$ . To prove (31) when $x_{1}^{q} = δ$ and $x_{1}^{q} = 2 δ$ requires just a little more work. Setting $q_{1} = n - 1$ ,

E τ_{n - 1}^{+} \leq \frac{3 {(n - 1)}^{n - 1 / 2} e^{- (n - 1)}}{{(n λ)}^{n}} e^{n λ} \leq \frac{3 e}{\sqrt{n - 1}} \frac{n^{n}}{{(n - β \sqrt{n})}^{n}} e^{- n} e^{n λ} = \frac{3 e}{\sqrt{n - 1}} {(1 - \frac{β}{\sqrt{n}})}^{- n} e^{- β \sqrt{n}} .

To conclude, we need to bound

{({(1 - \frac{β}{\sqrt{n}})}^{- \sqrt{n}} e^{- β})}^{\sqrt{n}} = {(\exp (- \sqrt{n} \log (1 - \frac{β}{\sqrt{n}}) - β))}^{\sqrt{n}} .

Using Taylor expansion,

\log (1 - \frac{β}{\sqrt{n}}) = - \frac{β}{\sqrt{n}} - \frac{1}{2} {(\frac{β}{\sqrt{n}})}^{2} \frac{1}{{(1 + ξ (β / \sqrt{n}))}^{2}},

where

ξ (β / \sqrt{n}) \in [- β / \sqrt{n}, 0]

. Therefore,

{(\exp (- \sqrt{n} \log (1 - \frac{β}{\sqrt{n}}) - β))}^{\sqrt{n}} = \exp (\frac{β^{2} / 2}{{(1 + ξ (β / \sqrt{n}))}^{2}}),

and we conclude that

\sup_{n \geq 0} {({(1 - \frac{β}{\sqrt{n}})}^{- \sqrt{n}} e^{- β})}^{\sqrt{n}} < \infty .

The argument when $q_{1} = n - 2$ is identical. This proves (31) when $x_{1} = δ, 2 δ$ . □

References

Atar R (2012) A diffusion regime with nondegenerate slowdown. Oper. Res. 60(2):490–500.Link, Google Scholar
Banerjee S, Mukherjee D (2019) Join-the-shortest queue diffusion limit in Halfin-Whitt regime: Tail asymptotics and scaling of extrema. Ann. Appl. Probab. 29(2):1262–1309.Google Scholar
Banerjee S, Mukherjee D (2020) Join-the-shortest Queue diffusion limit in Halfin-Whitt regime: Sensitivity on the heavy-traffic parameter. Ann. Appl. Probab. 30(1):80–144.Google Scholar
Barbour AD (1988) Stein’s method and Poisson process convergence. J. Appl. Probab. 25:175–184.Google Scholar
Barbour A (1990) Stein’s method for diffusion approximations. Probab. Theory Related Fields 84(3):297–322.Google Scholar
Braverman A (2020) Steady-state analysis of the join the shortest queue model in the Halfin-Whitt regime. Math. Oper. Res. 45(3):1069–1103.Link, Google Scholar
Braverman A (2022) The prelimit generator comparison approach of Stein’s method. Stochast. Systems 12(2):181–204.Link, Google Scholar
Braverman A, Dai JG (2017) Stein’s method for steady-state diffusion approximations of $M / P h / n + M$ systems. Ann. of Appl. Probab. 27(1):550–581.Google Scholar
Brown TC, Xia A (2001) Stein’s method and birth-death processes. Ann. Probab. 29(3):1373–1403.Google Scholar
Cao P, He S, Huang J, Liu Y (2021) To pool or not to pool: Queueing design for large-scale service systems. Oper. Res. 69(6):1866–1885.Link, Google Scholar
Erdogdu MA, Mackey L, Shamir O (2018) Global non-convex optimization with discretized diffusions. Preprint, submitted October 29, https://arxiv.org/abs/1810.12361v1.Google Scholar
Eryilmaz A, Srikant R (2012) Asymptotically tight steady-state queue length bounds implied by drift conditions. Queueing Systems 72(3-4):311–359.Google Scholar
Eschenfeldt P, Gamarnik D (2018) Join the shortest queue with many servers: The heavy-traffic asymptotics. Math. Oper. Res. 43(3):867–886.Link, Google Scholar
Fang X, Shao QM, Xu L (2018) Multivariate approximations in Wasserstein distance by Stein’s method and Bismut’s formula. Preprint, submitted January 24, https://arxiv.org/abs/1801.07815.Google Scholar
Feller W (1968) An Introduction to Probability Theory and Its Applications, vol. I, 3rd ed. (John Wiley & Sons Inc., New York).Google Scholar
Gast N (2017) Expected values estimated via mean-field approximation are 1/n-accurate. Proc. ACM Measurement Anal. Comput. Syst. 1(1):1–26.Google Scholar
Gast N, Van Houdt B (2017) A refined mean field approximation. Proc. ACM Measurement Anal. Comput. Syst. 1(2):1–28.Google Scholar
Gast N, Bortolussi L, Tribastone M (2019) Size expansions of mean field approximation: Transient and steady-state analysis. Performance Evaluation 129:60–80.Google Scholar
Gaunt RE, Walton N (2020) Stein’s method for the single server queue in heavy traffic. Statist. Probab. Lett. 156:108566.Google Scholar
Götze F (1991) On the rate of convergence in the multivariate CLT. Ann. Probab. 19(2):724–739.Google Scholar
Gupta V, Walton N (2019) Load balancing in the nondegenerate slowdown regime. Oper. Res. 67(1):281–294.Link, Google Scholar
Gurvich I (2014) Diffusion models and steady-state approximations for exponentially ergodic Markovian queues. Ann. Appl. Probab. 24(6):2527–2559.Google Scholar
Hairi LX, Ying L (2021) Beyond scaling: Calculable error bounds of the power-of-two-choices mean-field model in heavy-traffic. Proc. Twenty-Second Internat. Sympos. Theory, Algorithmic Foundations, Protocol Design Mobile Networks Mobile Comput. (Association for Computing Machinery, New York), 1–10.Google Scholar
Halfin S, Whitt W (1981) Heavy-traffic limits for queues with many exponential servers. Oper. Res. 29(3):567–588.Link, Google Scholar
Hurtado-Lange D, Maguluri ST (2022) A load balancing system in the many-server heavy-traffic asymptotics. Queueing Systems Theory Appl. 101(3–4):353–391.Google Scholar
Jin X, Pang G, Xu L, Xu X (2022) An approximation to the invariant measure of the limiting diffusion of G/Ph/n+GI queues in the Halfin-Whitt regime and related asymptotics. Preprint, submitted September 15, https://arxiv.org/abs/2209.07361.Google Scholar
Kallenberg O (2001) Foundations of Modern Probability, Springer Series in Statistics, Probability and Its Applications, 2nd ed. (Springer, New York).Google Scholar
Liu X, Ying L (2019) A simple steady-state analysis of load balancing algorithms in the sub-halfin-whitt regime. SIGMETRICS Perform. Eval. Rev. 46(2):15–17.Google Scholar
Liu X, Ying L (2020) Steady-state analysis of load-balancing algorithms in the Sub-Halfin-Whitt regime. J. Appl. Probab. 57(2):578–596.Google Scholar
Liu X, Gong K, Ying L (2022) Steady-state analysis of load balancing with coxian-2 distributed service times. Naval Res. Logist. 69(1):57–75.Google Scholar
Lu Y (2021) On a stein method based approximation for a two-dimensional Markov chain. Preprint, submitted June 6, https://arxiv.org/abs/2106.03145.Google Scholar
Mackey L, Gorham J (2016) Multivariate Stein factors for a class of strongly log-concave distributions. Electronic Comm. Probab. 21:1–14.Google Scholar
Mitzenmacher M (2001) The power of two choices in randomized load balancing. IEEE Trans. Parallel Distributed Systems 12(10):1094–1104.Google Scholar
Mukherjee D, Borst SC, van Leeuwaarden JSH, Whiting PA (2016) Universality of load balancing schemes on the diffusion scale. J. Appl. Probab. 53(4):1111–1124.Google Scholar
Ross N (2011) Fundamentals of Stein’s method. Probab. Surveys 8:210–293.Google Scholar
Stein C (1972) Probability theory: A bound for the error in the normal approximation to the distribution of a sum of dependent random variables, vol. 2, Proc. Sixth Berkeley Sympos. Math. Statist. Probab. (University of California Press, Berkeley), 583–602.Google Scholar
Stolyar AL (2015) Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems 80(4):341–361.Google Scholar
van der Boor M, Borst SC, van Leeuwaarden JSH, Mukherjee D (2021) Scalable load balancing in networked systems: A survey of recent advances. Preprint, submitted November 4, https://arxiv.org/abs/1806.05444.Google Scholar
Vvedenskaya N, Dobrushin R, Karpelevich F (1996) Queueing system with selection of the shortest of two queues: An asymptotic approach. Problems Inform. Transmission 32(1):15–27.Google Scholar
Weber RR (1978) On the optimal assignment of customers to parallel servers. J. Appl. Probab. 15(2):406–413.Google Scholar
Winston W (1977) Optimality of the shortest line discipline. J. Appl. Probab. 14(1):181–189.Google Scholar
Ying L (2017) Stein’s method for mean field approximations in light and heavy traffic regimes. Proc. ACM Measurement Anal. Comput. Systems 1(1):1–27.Google Scholar
Zhao Z, Banerjee S, Mukherjee D (2021) Many-server asymptotics for join-the-shortest queue in the super-Halfin-Whitt scaling window. Preprint submitted May 31, https://arxiv.org/abs/2106.00121.Google Scholar
Zhou X, Shroff N (2020a) A note on load balancing in many-server heavy-traffic regime. Preprint, submitted April 20, https://arxiv.org/abs/2004.09574v1.Google Scholar
Zhou X, Shroff N (2020b) A note on Stein’s method for heavy-traffic analysis. Preprint, submitted Mar 13, https://arxiv.org/abs/2003.06454v1.Google Scholar

Volume 13, Issue 1

March 2023

Pages 1-180

Article Information

Metrics

Information

Received:February 06, 2022
Accepted:August 26, 2022
Published Online:November 17, 2022

Cite as

Anton Braverman (2022) The Join-the-Shortest-Queue System in the Halfin-Whitt Regime: Rates of Convergence to the Diffusion Limit. Stochastic Systems 13(1):1-39.

https://doi.org/10.1287/stsy.2022.0102

Keywords

PDF download

Available Issues

Available Issues

Available Issues

The Join-the-Shortest-Queue System in the Halfin-Whitt Regime: Rates of Convergence to the Diffusion Limit

Abstract

1. Introduction

1.1. Literature Review

1.2. Notation

2. Main Result

2.1. Proving Theorem 1

3. Stein Factor Bounds

3.1. First-Order Differences

3.2. Higher-Order Bounds

3.2.1. Bounding $\sum_{i = 1}^{b + 1} E X_{i}$ .

3.2.2 Third-Order Bounds.

3.2.3. Proving Lemmas 6 and 7.

4. Conclusion

Appendix A. Supporting Proofs for Section 2

A.1. The Interpolator A

A.2. Proving Proposition 2

A.2.1. Bounding $ε_{2} (Y)$ through $ε_{4} (Y)$ .

A.2.2. Bounding $ε_{1} (Y)$ .

A.2.3. Proving Lemma A.2

Appendix B. Supporting Proofs for Section 3

B.1. Proving Lemma B.1

B.2. Proving Lemma B.12

B.2.1. Proving Lemma B.5.

B.2.2. Proof of Lemma B.6.

B.3. Proof of Lemma B.3

B.4. Proving Lemma B.4

B.4.1 Proving the Gambler’s Ruin Result.

B.5. Proof of Lemma 8

References

Volume 13, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News

Available Issues

Available Issues

The Join-the-Shortest-Queue System in the Halfin-Whitt Regime: Rates of Convergence to the Diffusion Limit

Abstract

1. Introduction

1.1. Literature Review

1.2. Notation

2. Main Result

2.1. Proving Theorem 1

3. Stein Factor Bounds

3.1. First-Order Differences

3.2. Higher-Order Bounds

3.2.1. Bounding ∑i=1b+1EXi.

3.2.2 Third-Order Bounds.

3.2.3. Proving Lemmas 6 and 7.

4. Conclusion

Appendix A. Supporting Proofs for Section 2

A.1. The Interpolator A

A.2. Proving Proposition 2

A.2.1. Bounding ε2(Y) through ε4(Y).

A.2.2. Bounding ε1(Y).

A.2.3. Proving Lemma A.2

Appendix B. Supporting Proofs for Section 3

B.1. Proving Lemma B.1

B.2. Proving Lemma B.12

B.2.1. Proving Lemma B.5.

B.2.2. Proof of Lemma B.6.

B.3. Proof of Lemma B.3

B.4. Proving Lemma B.4

B.4.1 Proving the Gambler’s Ruin Result.

B.5. Proof of Lemma 8

References

Volume 13, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

3.2.1. Bounding $\sum_{i = 1}^{b + 1} E X_{i}$ .

A.2.1. Bounding $ε_{2} (Y)$ through $ε_{4} (Y)$ .

A.2.2. Bounding $ε_{1} (Y)$ .