Open Access

Generalized Max-Weight Policies in Stochastic Matching

Matthieu Jonckheere
Matthieu Jonckheere
[email protected]
https://orcid.org/0000-0003-3576-5866
LAAS-CNRS (Laboratoire d’analyse et d’architecture des systèmes)-(Centre national de la recherche scientifique), 31400 Toulouse, France;
Search for more papers by this author
,
Pascal Moyal
Pascal Moyal
[email protected]
https://orcid.org/0000-0002-6638-5551
Faculté des Sciences et Technologies, IECL/INRIA PASTA (Institut Élie Cartan de Lorraine)/(Institut national de recherche en sciences et technologies du numérique, Processus Aléatoires Spatio-Temporels et Applications), Site de Nancy, Faculté des Sciences et Technologies Campus, 54506 Vandoeuvre-lès-Nancy, France;
Search for more papers by this author
,
Claudia Ramírez
Claudia Ramírez
[email protected]
Aristas S.R.L., CABA (Ciudad Autónoma de Buenos Aires) C1414CLO, Argentina;
Search for more papers by this author
,
Nahuel Soprano-Loto
Corresponding Author
Nahuel Soprano-Loto
[email protected]
https://orcid.org/0000-0002-5120-3903
Aristas S.R.L., CABA (Ciudad Autónoma de Buenos Aires) C1414CLO, Argentina;Departamento de Matemática, Facultad de Ingeniería, Universidad de Buenos Aires, CABA C1063ACV, Argentina
Search for more papers by this author

LAAS-CNRS (Laboratoire d’analyse et d’architecture des systèmes)-(Centre national de la recherche scientifique), 31400 Toulouse, France;

Search for more papers by this author

Pascal Moyal

[email protected]

https://orcid.org/0000-0002-6638-5551

Faculté des Sciences et Technologies, IECL/INRIA PASTA (Institut Élie Cartan de Lorraine)/(Institut national de recherche en sciences et technologies du numérique, Processus Aléatoires Spatio-Temporels et Applications), Site de Nancy, Faculté des Sciences et Technologies Campus, 54506 Vandoeuvre-lès-Nancy, France;

Search for more papers by this author

Claudia Ramírez

[email protected]

Aristas S.R.L., CABA (Ciudad Autónoma de Buenos Aires) C1414CLO, Argentina;

Search for more papers by this author

Nahuel Soprano-Loto

Corresponding Author

Nahuel Soprano-Loto

[email protected]

https://orcid.org/0000-0002-5120-3903

Aristas S.R.L., CABA (Ciudad Autónoma de Buenos Aires) C1414CLO, Argentina;Departamento de Matemática, Facultad de Ingeniería, Universidad de Buenos Aires, CABA C1063ACV, Argentina

Search for more papers by this author

Published Online:1 Aug 2022https://doi.org/10.1287/stsy.2022.0098

Abstract

We consider a matching system where items arrive one by one at each node of a compatibility network according to Poisson processes and depart from it as soon as they are matched to a compatible item. The matching policy considered is a generalized max-weight policy where decisions can be noisy. Additionally, some of the nodes may have impatience, that is, leave the system before being matched. Using specific properties of the max-weight policy, we construct a simple quadratic Lyapunov function. This allows us to establish stability results, to prove exponential convergence speed toward the stationary measure, and to give explicit bounds for the stationary mean of the largest queue size in the system. We finally illustrate some of these results using simulations on toy examples.

Funding: N. Soprano-Loto is partially supported by Aristas, MathAmSud [Grant RSPSM (Random Structures and Processes in Statistical Mechanics)], and ANR (Agence Nationale de la Recherche) [Grant PRC (Projets de Recherche Collaborative) MATCHES (CE40)].

1. Introduction

The so-called bipartite stochastic matching model was introduced in Caldentey et al. (2009) (see also Adan and Weiss (2012) and Adan et al. (2017)) as a variant of skill-based service systems. In this model, two infinite ordered sequences (a sequence of servers and a sequence of customers) are considered, and customer/server are matched in first come, first served order, that is, from left to right, according to compatibilities that are prescribed by a given bipartite graph. In the so-called extended bipartite matching model (Bušić et al. 2013), customer/server couples enter the system at each time point. A matching occurs when an arriving customer (respectively, server) finds a compatible server (respectively, customer) present in the system, in which case both leave the system instantaneously. Otherwise, items remain in the system waiting for compatible arrivals (in particular, there are no service times). The field of applications of such models is very wide, from housing to job applications, from transplant protocols to blood banks, peer-to-peer systems to dating websites, ride sharing, and so on. The mathematical settings of such models are the following: there is a set C of customer classes and a set S of server classes, and a bipartite graph on the bipartition $C \cup S$ indicates the compatibilities: a customer class $c \in C$ and a server class $s \in S$ are compatible if and only if there is an edge between nodes c and s in the compatibility graph. Customer/server couples enter the system in discrete time (or are read one by one from left to right, in the prior models), and the classes of the couples in the order of arrivals (respectively, read from left to right) are independent and identically distributed (i.i.d.) from a measure μ on C × S. In Caldentey et al. (2009), Adan and Weiss (2012), and Adan et al. (2017), it is assumed that $μ ≔ μ_{C} \otimes μ_{S}$ on C × S; that is, the class of the incoming customer and server are independent of one another, whereas this assumption is relaxed in Bušić et al. (2013). In case of multiple compatibilities, it is the role of the matching policy to determine what match is performed. In Adan et al. (2017), it is shown that, under a natural stability condition, the stationary probability of the system under first come, first served has a remarkable product form. Interestingly, this result can then be adapted to various skill-based queueing models as well and in particular those applying (various declinations of) the so-called first come first matched (FCFM)-assign the longest idling server (ALIS) service discipline (Adan and Weiss 2014). For various extensions of bipartite matching models, see also Adan et al. (2018). In Bušić et al. (2013), the stability condition in Adan et al. (2017) is shown to be also sufficient for the stability of extended bipartite models ruled by the “match the longest” policy, consisting of always matching incoming items to the compatible class having the longest queue. In Moyal et al. (2018), a coupling from the past result is obtained, showing the existence of a unique bi-infinite matching in various cases of extended bipartite matching models and for a broader class of matching policies than FCFM, thereby generalizing various results of Adan et al. (2017).

In many practical systems, it is often more realistic to assume that arrivals are simple instead of pairwise. Moreover, the bipartition of the compatibility graph into classes of servers and customers only suits applications in which this bipartition is natural (donor/receiver, house/applicant, job/applicant, etc.). However, in many cases, the context requires that the compatibility graph take a general (i.e., not necessarily bipartite) form. For instance, in dating websites, it is a priori not possible to split items into two sets of classes with no possible matches within those sets. Similarly, in kidney exchange programs, intraincompatible donor/receiver couples enter the system looking for a compatible couple to perform a crossed transplant. Then, it is convenient to represent donor/receiver couples as single items, and compatibility between couples means that a kidney exchange can be performed between the two couples (the donor of the first couple can give to the receiver of the second, and the donor of the second can give to the receiver of the first). In particular, if one considers blood types as a primary compatibility criterion, the compatibility graph between couples is naturally nonbipartite. Motivated by such applications, among others, Mairesse and Moyal (2016) proposed a generalization of the previous matching models, termed general stochastic matching model, in which items enter one by one, and the compatibility graph G is general. For a given compatibility graph G and a given matching policy $Φ$ , the stability region Stab $(G, Φ)$ of this system is defined as the set of those measures μ deciding the class of the incoming items, such that the Markov chain describing the evolution of the system is positive recurrent. Mairesse and Moyal (2016) identified a universal necessary condition for stability, that is, a set Ncond(G) that includes Stab $(G, Φ)$ for any matching policy $Φ$ and any graph G. Following Bušić et al. (2013), Mairesse and Moyal (2016) also showed that match the longest is maximal stable; that is, it has a maximal stability region, as the two sets Ncond(G) and Stab $(G, Φ)$ coincide. Moyal and Perry (2017) then showed that, aside for a particular class of graphs, the random class-uniform policy (choosing the match in a compatible class of the incoming item, uniformly at random among those having a nonempty queue) is never maximal stable and that priority policies are in general not maximal stable. Then, Moyal et al. (2020) showed that the policy first come, first matched is maximal stable and that, similarly to Adan et al. (2017), the stationary probability of the system can be written in a product form. Various extensions of the general stochastic matching model can also be found in Büke and Chen (2015, 2017) regarding systems with bipartite complete compatibility graphs but in which each match can be enacted with a nominal probability, and in Rahmé and Moyal (2019) to stochastic matching systems on hypergraphs, thereby matching items by groups of two or more. In a recent line of study, stochastic matching models have also been studied from the point of view of stochastic optimization (Gurvich and Ward 2014, Bušić and Meyn 2016, Nazari and Stolyar 2019). In particular, the max-weight policy introduced in Tassiulas and Ephremides (1992) is a natural policy allowing a natural tradeoff between efficiency and stability.

Here we consider a matching system where items arrive one by one at each node of a compatibility network according to Poisson processes and depart from it as soon as they are matched to a compatible item. The matching policy considered is a generalized max-weight policy where decisions can be subject to noise. Additionally, some of the nodes may have impatience, that is, leave the system before being matched.

1.1. Motivation

In many applications, reneging of items needs to be taken into account: for instance, applicants may renege from the systems, whereas jobs/houses/university spots may no longer be available. Similarly, in organ transplants, patients may renege from the system because of death or change in their diagnosis, whereas organs have a short life duration outside a body and need to be transplanted within a very short period of time—see a first approach of such systems in Boxma et al. (2011).

The present work is dedicated to an extension of general stochastic matching models to the case where (i) classes of items can be impatient, in that they renege from the system if they do not find a match within a given (random) period of time, and (ii) the matching policy is of the max-weight class, with possible errors in the assessment of the system state.

Specifically, we consider that the policy decisions may be subject to noise that can lead to errors in the estimate of the queue lengths, which is necessary to implement matching policies of the max-weight type.

Indeed, as is well known from the vast literature on the load balancing of parallel service systems, implementing policies that are sensitive to the queue sizes, such as the max-weight type policies considered in the present paper, becomes algorithmically prohibitive as the number of queues grows large. Efficiently sorting the queue sizes may easily lead to inaccuracies, entailing routing errors.

Specifically, in many practical cases the queue sizes of the various servers are not always known: they are either estimated by the incoming customers (as in physical queues in supermarkets, for instance), or by a central dispatcher (e.g., in passport-checking stations). Likewise, information regarding the queues to each of the servers in large web-server farms or call centers require constant communication between the job dispatchers and the servers, slowing down the response time. This information is thus not always available in real time (Lu et al. 2011), and in such cases, it can then be preferable to estimate the queue sizes, based on past or partial information, for instance. Observe that this source of inaccuracies is the main motivation for introducing “Power of d”–type policies to reduce the complexity of routing and the probability of errors by routing the requests only to a subset of the available servers having a more tractable size (see Mitzenmacher (1996) and related references on this matter).

In the context of matching systems, in most of the aforementioned applications, the requests of the various classes are physically gathered in different queues; that is, each node of the graph has its own buffer for storing the corresponding items. In many cases, the size of the graph, that is, the number of classes, is large, leading to routing problems that are similar to the context of parallel service systems. For instance, liver transplants are subject to compatibility constraints that lie far beyond the blood types of givers and receivers, namely, age, the nature of the pathology of the patient (cancer or cirrhosis), and immunological criteria. This leads to an increase of the number of classes of givers and receivers, and in turn, to an increase of the complexity of implementing max-weight type policies and to the potentiality of allocation errors or inaccuracies.

To account for this difficulty, in the spirit of Moyal and Perry (2021) for parallel service systems, we introduce an error parameter in the estimate of the queue sizes in the context of matching systems. In this context, it is then reasonable to consider that these errors are independent of one another but follow distributions that may depend on the specificity of the considered nodes. For instance, an arriving item to node j chooses a match from the neighboring node i, making an error of estimate of the queue length of node i, that may depend on i and j; see our later assumptions.

All in all, by “generalized max-weight policies,” we mean policies that are sensitive to the queue sizes, that incorporate rewards to give specific weights to the various matchings, and that take into account the possibility of errors in assessing the queue sizes of the various nodes of the graph.

1.2. Main Contributions

As we developed previously, the extension of the general stochastic matching model introduced in Mairesse and Moyal (2016), to the paradigm of reneging systems, is prevalent in practice. As will be specified later, adding an impatience parameter (possibly to some, but not all of the item classes) makes the analysis more intricate than in the original model. This constitutes the first main contribution of this work.

Using the specific structure of the max-weight policy, our second main contribution, Theorem 1, is to show that a simple quadratic function is a Lyapunov function whenever the traffic condition “ $λ \in NCOND (G, γ)$ ” (see (4)) that can be understood as a generalization of the classical condition “ $λ \in NCOND (G)$ ” to the case of reneging, is satisfied. This result is interesting and original for its own sake, even in the no-reneging case. This allows us directly to characterize the stability region and, as a corollary, to conclude that the max-weight policy is maximum stable. By doing so, we widely generalize results in Mairesse and Moyal (2016) to the case of generalized max-weight policies (of which the match the longest (ml) policy introduced in Bušić et al. (2013) and then Mairesse and Moyal (2016) is a particular case, see the later discussion) and to matching models with reneging. This quadratic function was already identified in Mairesse and Moyal (2016) and Bušić et al. (2013) as a Lyapunov function by an indirect proof: First, by stating that if the Foster-Lyapunov criterion applies to a quadratic Lyapunov function for some matching policy, then it must be also the case for the policy ml, and second, by showing that the criterion does apply to an ad hoc randomized policy. However, this argument cannot be generalized to the much wider case of max-weight policies, whence the main interest of the direct proof that is presented here.

We say that a couple graph/reneging vector $(G, γ)$ is stabilizable whenever there exists a matching policy and an arrival vector λ making the corresponding matching model stable. As a consequence of Theorem 1, $(G, γ)$ is stabilizable if and only if the set $NCOND (G, γ)$ is nonempty. It is well known (theorem 1 in Mairesse and Moyal 2016) that for a general stochastic matching model without reneging, the graph G is stabilizable if and only if it is not bipartite, because this condition is equivalent to $NCOND (G, γ)$ being nonempty. Our third main contribution is a quite remarkable result: In Proposition 2, we show a very simple sufficient condition for the stabilizabiliy of the system: it is sufficient that G be nonbipartite, or that only one class of elements renege. In particular, for a bipartite G (as is the case for organ transplants model, for instance), the system is stabilizable if, and only if at least one class of elements renege. Said differently, whenever G is bipartite and arrivals are made by single items, it is enough that one node receives impatient elements to make the system stabilizable, rather than artificially assuming that arrivals are made by pairs (as is the case in the (extended) bipartite matching models studied in the literature).

Showing that a given (even simple) function is a Lyapunov function is in general a challenging issue, and our case is not an exception, especially in such general a context. Our proof is quite intricate and relies on carefully exploiting this generalized Ncond condition. Combining these ideas, as our fourth main contribution, we are also able to prove exponential convergence speed toward the stationary measure, and additionally, by combining with stochastic comparisons, to construct bounds for the stationary mean of the largest queue size of customers in the system.

We finally illustrate some of those results using simulations on toy examples.

1.3. Outline of the Paper

In Section 2, we give the formal definition of the model we study. Section 3 is dedicated to the theoretical results. Section 3.1 focuses on stability: we characterize the stability region of the models in Theorem 1; this theorem is based on Proposition 1, a key result in which we control the drift of a quadratic function. Last, Proposition 2 describes precisely whether a given model is stabilizable. In Section 3.2, we then investigate the speed of convergence to equilibrium: a geometric Lyapunov bound is given in Proposition 3, and the exponential convergence to equilibrium follows as a consequence in Theorem 2. In Section 3.3, some bounds are given for moments of the size of the largest queue of the system in equilibrium in Theorem 3. Section 4 is dedicated to simulations: we compare the max-weight policy with a priority policy in Section 4.1, and an analysis of the theoretical bounds obtained in the previous section is given in Section 4.2. All proofs are provided in Section 5.

2. Model

2.1. Notation

The graph $G = (V, E)$ is assumed in all the article to be finite, simple, undirected, and connected. Because the set of nodes V represents customers classes, we use the words node and class in an indistinguishable way during all the article. The nonconnected case can be addressed by decomposing into connected components. Furthermore, to avoid trivialities, we assume $| V | > 1$ . We write ${i, j} \in E$ to denote that there is an edge connecting the nodes $i, j \in V$ in the graph G. We say that a subset $I \subset V$ is an independent set if $i, j \in I$ implies ${i, j} \notin E$ . For any subset $W \subset V$ , we let

I (W) = {I \subset W : I is an independent set} ∖ {\emptyset}

be the family of nonempty independent subsets of W, in a way that

I ≔ I (V)

gathers all nonempty independent sets of the graph G. Let also

E (W) = {i \in V : {i, j} \in E for some j \in W}

. For singletons, we write E(i) for

E ({i})

. Observe that

W \cap E (W)

is the union of the connected components of W that are not singletons; in particular,

W \cap E (W) = \emptyset

if and only if W is an independent set. Let

R, R_{+}, R_{+}^{*}, ℤ, N_{0}

and

N

, respectively, denote the sets of real numbers, nonnegative real numbers, positive real numbers, integers, nonnegative integers, and positive integers. For

p, q \in ℤ

, we let

[​​​ [p, q] ​​​] ≔ [p, q] \cap ℤ

be the set of all integers in the closed interval

[p, q]

. Let also, for

z \in R^{V}

and

W \subset V

| | z | |_{W} = \max_{i \in W} | z (i) |

, writing simply

| | z | |

in the case

W = V

(this is the supremum norm), and letting

| | z | |_{\emptyset} = 0

2.2. Stochastic Matching Model with Reneging

The (continuous-time) stochastic matching model with reneging $(G, MW, λ, γ)$ is formally defined as follows: we give ourselves a set of $| V |$ independent Poisson processes $N (i), i \in V$ , of respective intensities $λ (i) > 0, i \in V$ , and denote by $λ ≔ (λ (i), i \in V)$ the arrival intensity vector of the model. For any $i \in V$ , we denote by ${T_{n}^{i} : n \in N}$ the sequence of successive arrival times in the process N(i). For any subset $W \subset V$ , denote by $λ (W) = \sum_{i \in W} λ (i)$ the intensity of the superposed Poisson processes having indexes in W. The overall superposition of the Poisson process of the N(i)s, $i \in V$ , is denoted by N. It is thus a Poisson process of intensity $λ (V)$ , and its points are denoted by ${T_{n} : n \in N} ≔ \cup_{i \in V} {T_{n}^{i} : n \in N}$ . For all i, on each point $T_{n}^{i}, n \in N$ , an item of class i (or i-item for short) enters the system. Thus, the points of N are the overall arrival times in the system. We say that two items of respective classes i and j in V are compatible if and only if ${i, j} \in E$ . To each class $i \in V$ is also associated a parameter $γ (i) \geq 0$ . Denote $γ ≔ (γ (i), i \in V)$ , the reneging rate vector of the model. Unlike the vector λ, we allow γ to have null entries. We then associate to each i-item an exponential random variable (r.v.) of distribution $E (γ (i))$ , interpreted as its patience time. An r.v. of distribution $E (0)$ is interpreted as almost surely (a.s.) infinite, and we denote by

R ≔ {i \in V : γ (i) > 0} and R^{c} ≔ {i \in V : γ (i) = 0}

the subsets of nodes whose items do (respectively, do not) renege from the system. The corresponding item then reneges from the system at the end of this time period provided that it has not left before because of the matching mechanism we describe later. We assume that the patience times of all items are independent of one another and of the other involved r.v.s.

The dynamics of the model $(G, MW, λ, γ)$ is then defined recursively at the successive points of N, as follows:

We start at time 0 with a state in which all present items are incompatible. In other words, in the system at time 0, for any ${i, j} \in E$ , there are either no i items, or no j items, or neither. Such a buffer content is called admissible. We associate a patience to each of these initial items that is drawn, independently of everything else, from the distribution corresponding to its class, for example, from $E$ ( $γ (j)$ ) for a j-item. As previously mentioned, if $j \in R^{c}$ , that is, $γ (j) = 0$ , the patience time of the corresponding item is set as infinite.
Let $n \in N$ , and suppose that the buffer content of the system was admissible at time $T_{n - 1}$ . We first update the buffer content at each reneging time in $[T_{n - 1}, T_{n})$ , by discarding all items present in the system at time $T_{n - 1}$ and whose patience time has expired during that time interval. Observe that the buffer content remains admissible after this procedure. Then, suppose that T_n is a point of the jth arrival process, that is, $T_{n} = T_{k}^{j}$ for some $k \in [ [1, n] ]$ , or in other words, a j-item enters the system at T_n. This happens with probability
$μ_{λ} (j) ≔ \frac{λ (j)}{λ (V)} \cdot$ (1)
We then face the following alternatives:
- (a) Either the buffer contains no items compatible with the incoming j-item, that is, there are no stored items of classes in E(j), in which case the latter item is stored in the buffer. We then draw the (possibly infinite) patience time of this item from the distribution $E$ ( $γ (j)$ ), independently of everything else.
- (b) Or there is at least one remaining item having class in E(j) in the buffer. Then the incoming j-item must be matched with one of these compatible items, and it is the role of the matching policy to determine its match. We assume that the matching policy is of the generalized max-weight policy type with parameters w and F (mw $_{w, F}$ , or simply mw, for short). To define this, let $w ≔ {w_{(j, i)} : (j, i) \in V^{2} s . t . {i, j} \in E}$ be a family of nonnegative numbers and ${F_{(j, i)} : (j, i) \in V^{2} s . t . {i, j} \in E}$ be a family of cumulative distribution functions (c.d.f.s). We index with ordered pairs $(j, i)$ instead of the unordered ones because the involved parameters are not assumed to be symmetric. The first entry of the pair j will typically represent the class of the incoming item, and the second one i, the class of the item j performs a match with. Denote, for all $i \in V$ , by x(i) the number of stored i-items at this time. Then the match of the incoming j-item is chosen uniformly at random from the set
  $\underset{i \in E (j) : x (i) > 0}{arg max} ({[x (i) + U_{(j, i), n}]}^{+} + w_{(j, i)}),$ (2)
  where $U_{(j, i), n}$ is an r.v. drawn at time n independently of everything else from the c.d.f. $F_{(j, i)}$ , interpreted as the measurement error of the queue length of node i, made on an arrival of a j element at time T_n, and $w_{(j, i)}$ is interpreted as the specific reward of matching an incoming j-item with an i-item. Observe that in (2), we make the simplifying assumption that measurement errors are made only regarding the length of nonempty queues; that is, empty queues are identified correctly. This makes sense in practice, first, because the overestimation of the size of an empty queue might lead to a matching involving an item that is not actually present in the system. Second, it is usually simpler to determine whether a given buffer is empty or not (by sending a ping to the corresponding server) than to precisely measure the size of a (potentially much longer) nonempty queue.
  Then, once the matching is done, the incoming j-item leaves the system right away, together with its match. We emphasize that, unlike the notion of compatibility given by the undirected set of edges E, symmetry between i and j is a priori not assumed in the definitions of the measurement errors and the matching rewards.
It is then easily seen that, in any case, the buffer content at time T_n is also admissible.
We continue the construction inductively.

Remark 1.

Particular cases of generalized max-weight policies are as follows:

The classical max-weight policies, as defined inTassiulas and Ephremides (1992), recovered by taking constant weights and, for every $(j, i) \in V^{2}$ such that ${i, j} \in E$ , the c.d.f. $F_{(j, i)} ≔ 𝟙_{R_{+}}$ as the one corresponding to the Dirac δ₀ distribution, that is, no error.
The match the longest policy introduced inMairesse and Moyal ( 2016), recovered by taking $F_{(j, i)} ≔ 𝟙_{R_{+}}$ and $w_{(j, i)} = 0$ for all $(j, i) \in V^{2}$ such that ${i, j} \in E$ , that is, no measurement errors and no rewards.

Remark 2.

The error measurement model we consider is additive, namely, a scale-free error is made on all queue lengths, and we prevent absurd measurement of negative queue lengths by taking the positive part of the latter (see (2)). Another interesting model would be to assume mutliplicative error measurements; namely, (2) would read

\underset{i \in E (j) : x (i) > 0}{arg max} (x (i) U_{(j, i), n} + w_{(j, i)}) .

We strongly believe that all results to come also hold in that case but leave the proof of this conjecture to the reader.

2.3. Markov Representation

Let, for all $t \geq 0, X_{t} ≔ (X_{t} (i), i \in V)$ be the queue length vector at time t, namely, for any $i \in V, X_{t} (i)$ is the number of i-items in the buffer at time t. It is easily seen that the process $X = {X_{t} : t \geq 0}$ is a regular continuous time Markov chain (CTMC) on the (countable) commutative state-space

X = {x \in N_{0}^{V} : x (i) x (j) = 0 for any {i, j} \in E} .

For any $x \in X$ , let $S_{x} = {i \in V : x (i) > 0}$ be its support. Observe that S_x is an independent set for every $x \in X$ . For all $i \in V$ , denote by δ_i the ith canonical vector: $δ_{i} (j) = 𝟙 {i = j}$ . The infinitesimal generator $L$ of X is then simply given by

L f (x) = \sum_{i \notin E (S_{x})} [f (x + δ_{i}) - f (x)] λ (i) + \sum_{i \in S_{x}} [f (x - δ_{i}) - f (x)] [γ (i) x (i) + \sum_{j \in E (i)} λ (j) ν_{x, j} (i)]

(3)

for any

x \in X

and any function

f : X \to R

, where, for all

i \in V

and

j \in E (i), ν_{x, j} (i)

is the probability of performing the match

{i, j}

, given that a j-item enters the system in state x. From the very definition of mw, the latter is fully determined by the c.d.f.s

F_{(j, i)}, i \in S_{x} \cap E (j)

3. Results

3.1. Stability

The state $x \in X$ is said to be positive recurrent if the expected amount of time until the CTMC revisits x given that $X_{0} = x$ , is finite. The CTMC is said to be positive recurrent if every state $x \in X$ is. Because the CTMC is irreducible, this occurs if and only if some state is positive recurrent. Positive recurrence is equivalent to the existence and uniqueness of an invariant distribution. We say that the matching model $(G, MW, λ, γ)$ is stable if the CTMC ${(X_{t})}_{t \geq 0}$ is positive recurrent. The next result provides a necessary and sufficient condition for stability.

Theorem 1.

For any connected graph G and any $γ \in R_{+}^{V}$ , the model $(G, M W_{w, F}, λ, γ)$ is stable if and only if its intensity vector λ belongs to the set

NCOND (G, γ) = {λ \in {(R_{+}^{*})}^{V} : λ (I) < λ (E (I)) for any I \in I (R^{c})} .

(4)

Observe that if we define

η = η (λ) ≔ {\begin{array}{l} \min {λ (E (I)) - λ (I) : I \in I (R^{c})} & if R^{c} \neq \emptyset \\ \infty & else, \end{array}

then

λ \in NCOND (G, γ)

if and only if

η (λ) > 0

For $(j, i) \in V^{2} such that {i, j} \in E$ , let $U_{(j, i)}$ be an r.v. with c.d.f. given by $F_{(j, i)}$ ; that is, $U_{(j, i)}$ has the same distribution than $U_{(j, i), n}$ for every n. For $κ \in (0, λ (V))$ , define

u_{κ, (j, i)} = \min {u \in R_{+} : ℙ (| U_{(j, i)} | \leq u) \geq {(1 - \frac{κ}{λ (V)})}^{\frac{1}{2 | E |}}},

and let

u_{κ} = \max {u_{κ, (j, i)} : (j, i) \in V^{2} s . t . {i, j} \in E} .

The quantity $u_{κ}$ is tailor-made for the event

B_{κ, n} ≔ \underset{(j, i) \in V^{2} : {i, j} \in E}{\cap} {| U_{(j, i), n} | \leq u_{κ}}

to satisfy

ℙ (B_{κ, n}) \geq 1 - \frac{κ}{λ (V)} for every n \in N .

(5)

Theorem 1 is a corollary of the following key generator bound for a quadratic function.

Proposition 1.

Assume $λ \in NCOND (G, γ)$ and let $κ \in (0, λ (V))$ . Let $f_{2} : X \to R$ be the quadratic function defined by $f_{2} (x) = \sum_{i \in V} x {(i)}^{2}$ , and let $\overset{ˇ}{w} = \max_{(j, i) \in V^{2} : {i, j} \in E} w_{(j, i)}$ .

If $R = \emptyset$ , then
$L f_{2} (x) \leq λ (V) + 2 λ (V) (2 u_{κ} + \overset{ˇ}{w}) | V | + 2 (κ - η) | | x | |$ (6)
for every $x \in X$ .
Otherwise, if $R \neq \emptyset$ , we have
$\begin{array}{l} L f_{2} (x) \leq & λ (V) + \sum_{i \in R} [- 2 γ (i) x {(i)}^{2} + (γ (i) + 2 λ (i)) x (i)] \\ + 2 [| | x | |_{R} + (2 u_{κ} + \overset{ˇ}{w}) | R^{c} |] λ (R^{c}) \\ + 2 [λ (V) (2 u_{κ} + \overset{ˇ}{w}) | R^{c} | + (κ - η) | | x | |_{R^{c}}] 𝟙 {| | x | |_{R^{c}} > | | x | |_{R} + (2 u_{κ} + \overset{ˇ}{w}) | R^{c} |} \end{array}$ (7)
for every $x \in X$ .

As can be easily seen in the proof of the necessity statement in Theorem 1, for a connected graph G and a reneging rate vector γ, λ being an element of $NCOND (G, γ)$ is necessary for the existence of a matching policy that makes the system stable, whatever the matching policy is. The sufficiency statement means that such models always exist whenever λ do belong to $NCOND (G, γ)$ —It suffices to implement a matching policy in the max-weight class. Therefore, it is natural to say that the couple $(G, γ)$ is stabilizable if there exists an intensity vector λ such that the model with reneging $(G, MW, λ, γ)$ is stable, that is, if Ncond $(G, γ)$ is nonempty. The following proposition makes precise the class of stabilizable graphs.

Proposition 2.

Let $G = (V, E)$ be a connected graph and $γ \in R_{+}^{V}$ be a reneging rate vector. Then $NCOND (G, γ)$ is nonempty if and only if G is nonbipartite or $R \neq \emptyset$ .

In other words, if G is bipartite, it is enough that one class of elements renege to make the system stabilizable. As will be made clearer in the proof of Proposition 2, the reason behind this (apparently surprising) result is as follows: if $G = (V, E)$ is bipartite, say of bipartition $V = V_{1} \cup V_{2}$ , and no elements renege, then the only independent set I that cannot satisfy $λ (I) < λ (E (I))$ for some intensity vector λ is either I = V₁ or I = V₂ because, as G is bipartite and connected, ${V_{1}, V_{2}}$ is the only pair of sets such that $V_{2} = E (V_{1})$ and $V_{1} = E (V_{2})$ , and we cannot have $λ (V_{1}) < λ (V_{2})$ together with $λ (V_{2}) < λ (V_{1})$ . Then, adding reneging to a single node of V₁ (if I = V₁), or to V₂ (if I = V₁) allows to simply break this paradoxal symmetrical condition. See the details in Section 5.3.

To summarize, gathering Theorem 1 and Proposition 2, we obtain the following panorama regarding the stability of the model:

Any couple $(G, γ)$ is stabilizable provided that G is nonbipartite or that some classes of elements renege, that is, $R \neq \emptyset$ .
Under these conditions, any model $(G, MW, λ, γ)$ with $λ \in NCOND (G, γ)$ , is stable.

Remark 3

(Full Reneging Case). In the case where $R^{c} = \emptyset$ , that is, all items may renege, Condition (4) is empty, and we retrieve the good-sense result that any such matching model is stable, whatever the intensity vector λ.

Remark 4

(No-Reneging Case). In the case where $R = \emptyset$ (no reneging), the model amounts to a continuous-time version of that introduced inMairesse and Moyal (2016). From Theorem 1, any model $(G, MW, λ, γ)$ is stable whenever λ belongs to the set

NCOND (G) = {λ \in {(R_{+}^{*})}^{V} : λ (I) < λ (E (I)), for all I \in I} .

(8)

In other words, any policy of the mw type is then maximal stable, in the sense that it has the largest stability region. This result generalizes those ofMairesse and Moyal (2016). Indeed, setting $F_{(j, i)} ≔ 𝟙_{R_{+}}$ for all (j, i) (no errors) and $w_{(j, i)} ≔ w_{j}$ for all $j \in V$ and all $i \in E (j)$ (equal rewards for all matches involving an incoming j-item, $j \in V$ ), the matching policy amounts to match the longest (priority is given to the node having the largest queue), and we retrieve assertion (15) in theorem 2 ofMairesse and Moyal (2016). Theorem 1 is thus a wide generalization of the latter result, to the case of weighted mw policies with measurement errors.

3.2. Speed of Convergence to Equilibrium

In this and the following section, we assume $R = \emptyset$ for simplicity. The involved techniques can be used to obtain analogous results in the general case (see Remark 5).

Building on the previous results and following classical approaches for Markov processes with bounded jumps, we now show how to define a geometric Lyapunov function that is very useful to quantify the speed of convergence to the steady state. We first have the following result.

Proposition 3.

Let $(G, M W_{w, F}, λ, γ)$ be a max-weight stochastic matching model with $γ \equiv 0$ and λ belonging to the set $NCOND (G, γ)$ defined by (4). Then, if $α > 0$ is small enough, there exist $c, k > 0$ such that

L e^{α | | x | |_{2}} \leq - c e^{α | | x | |_{2}}

for any

x \in X

such that

| | x | |_{2} \geq k

, being

| | x | |_{2} = {(\sum_{i \in V} x {(i)}^{2})}^{1 / 2}

and

L

is the generator of the process X defined by (3).

The previous proposition implies that the distribution of the Markov process X converges to its stationary distribution exponentially fast. More precisely, the following result follows immediately from Proposition 3, using, for example, the results in Hairer (2014).

Theorem 2.

As in Proposition 3, let $(G, M W_{w, F}, λ, γ)$ be a max-weight stochastic matching model with $γ \equiv 0$ and λ belonging to $NCOND (G, γ)$ , and let π be the unique invariant distribution of the process X (granted by Theorem 1). For any $α > 0$ there exist constants c > 0 and $0 < ρ < 1$ such that

| | ℙ_{x} (X_{t} = \cdot) - π (\cdot) | |_{TV} \leq c ρ^{t} e^{α | | x | |}

for all

x \in X

. Here

ℙ_{x}

is the law of the process conditioned to

[X_{0} = x]

, and

| | . | |_{TV}

denotes the total variation distance.

3.3. Moments Bounds at Equilibrium

We control below the first moment of the maximum queue size under the steady state of the system.

Theorem 3.

Assume that we are under hypotheses of Proposition 3, namely $R = \emptyset$ and $λ \in NCOND (G, γ)$ , and let π be the unique invariant distribution. Let $U_{(j, i)}$ have distribution $F_{(j, i)}$ (this r.v. has been defined in Section 3.1), and assume that there exists $B \geq 0$ such that $ℙ [- B \leq U_{(j, i)} \leq B] = 1$ for every $(j, i) \in V^{2}$ such that ${i, j} \in E$ . Then we have that

\max_{i \in V} \frac{ρ_{i}}{1 - ρ_{i}} \leq \int | | x | |_{\infty} π (d​ x) \leq \frac{λ (V)}{η} (\frac{1}{2} + (\overset{ˇ}{w} + 2 B) | V |),

(9)

where

ρ_{i} = λ (i) / λ (E (i))

for all

i \in V

Remark 5.

With exactly the same procedures, the content of Sections 3.2 and 3.3 can be extended in the following two directions: on the one hand, one may weaken hypothesis $R = \emptyset$ ; on the other, bounds for higher order moments under π may be obtained. We prefer to avoid developing this direction since the computations become rapidly cumbersome.

4. Simulations

In this section, we present a few simulation results. In the first set of simulations, we illustrate the gain of performance of max-weight policies with respect to other, more static, policies, for systems with reneging. To do so, as an illustrative example, in Section 4.1, a max-weight policy is compared with policies of the priority type, namely, policies that only take into consideration the rewards of the various matches but not the queue sizes. Then, in Section 4.2, we compare the theoretical bounds obtained by Lyapunov techniques in Section 3.3 with simulated values. These bounds are not meant to be very tight, and we leave their optimization for future research.

4.1. Comparison with Priority Policies

In this section, we assume $F_{(j, i)} = 𝟙_{R_{+}}$ for every $i, j \in V$ such that ${i, j} \in E$ , namely no measurement errors are considered. A policy is said to be of the priority type if, whenever the system is in state $x \in X$ and a j-item enters the system, the match of the incoming j-item (if any) is chosen uniformly at random from the set

\underset{i \in E (j) : x (i) > 0}{arg max} w_{(j, i)} .

This amounts to saying that each j-item has a fixed order of priorities between the neighboring nodes of i, for choosing its match. (Observe that a priority policy can be seen as a max-weight policy where we set the errors to $U_{(j, i), n} = - \infty$ a.s. for all n, (j, i).) As it is illustrated in the two examples in section 5 of Mairesse and Moyal (2016), priority policies can be maximal stable or not. Conversely, as is shown in theorem 3 of Moyal and Perry (2017), for any graph G outside a particular class of graphs, there always exists a nonmaximal priority policy.

To compare the asymptotic behavior of max-weight and priority policies for a variety of graphs, arrival and departure rates, and rewards, we randomize all these parameters. We perform 100 simulations. In each one, we first sample the graph G, the arrival rates λ, departure rates γ, and rewards w, as follows:

The graph $G = (V, E)$ is an Erdős-Rényi one with parameters $| V | = 30$ and $p = 0.1$ ;
The arrival rates ${(λ (i))}_{i \in V}$ are i.i.d. with common distribution $Unif [0, 10]$ ;
The reneging rates are i.i.d. with common distribution $Ber (0.5)$ (on average, half of the nodes do not renege); and
We consider symmetric rewards, namely $w_{(j, i)} = w_{(i, j)}$ for every i, j such that ${i, j} \in E$ , and the family ${w_{(j, i)} : {i, j} \in E}$ is i.i.d. with common law $Unif [0, 10]$ .

In each case, the condition Ncond is forced; that is, if λ is not an element of Ncond(G), the simulation is rerun. Then, for each simulation we compare a system ruled by max-weight to the same system ruled by the corresponding priority policy, for the same sampled parameters. In other words, we perform paired replications in the sense that a random network is generated by items 1–4, and the same network is used for comparing the corresponding policies. Two level of randomness are involved: the first one to define the network (formed by the graph, and the values of the rates and rewards), and the second one to draw, conditioned to the already sampled network, the effective dynamic of the process (consisting on the arrival and departure episodes, and the randomness to decide which item class to pick in case of a tie in the value of the weights). Observe that, from Theorem 1, the max-weight system is necessarily stable, whereas in view of the previous observations, the system ruled by the priority policy is not necessarily so.

Figure 1 represents, in vertical order, the size of the largest queue, the cumulative reward (each time a $(j, i)$ -match occurs, a reward $w_{j, i}$ accumulates) and the cumulative number of departures, the three quantities versus time, in the time interval $[0, 100]$ . Figure 2 is the boxplot for the size of the largest queue at time t = 100.

Figure 1. From Top to Bottom, Largest Queue Size, Cumulative Reward, and Cumulative Number of Departures, in the Time Interval $[0, 100]$
*Notes.* (a) Priority. (b) Max-weight.

**Figure 2. Boxplots for 100 Iterations at Time t = 100 of the Largest Queue Size**

These results put in evidence an important and striking qualitative difference between the two policies, in the behavior of the buffer size process. Specifically, max-weight keeps far fewer items in the system, whereas both policies have essentially the same performance in terms of cumulative reward and departures. In many cases, the priority policy seems to fail in stabilizing the system, whereas the corresponding max-weight system is always stable. Finally, it is worth mentioning that the same picture was obtained in unpublished simulations ran in much larger time intervals.

4.2. Bounds in the Stationary Regime

Unlike Section 4.1, in this section, impatience is excluded. We first restrict our attention to the classical match the longest policy; that is, we take $w_{j, i} = 0$ for every $i, j \in V$ such that ${i, j} \in E$ , and we work without measurement errors. An important difference with respect to the previous subsection is that here the parameters $(G, λ)$ are drawn only once at the beginning. We clarify that no comparison with a priority policy is made here. The distribution of the pair $(G, λ)$ is maintained: Erdős-Rényi distribution with parameters $| V | = 30$ and $p = 0.1$ for G, and common distribution $Unif [0, 10]$ for the i.i.d. family ${(λ (i))}_{i \in V}$ . Figure 3 represents a histogram at time t = 100 of 500 simulations. The vertical lines represent the lower and upper bounds for the expected size of the largest queue given by Theorem 3.

**Figure 3. Histogram for 500 Simulations at Time t = 100 of the Largest Queue Size for Match the Longest (i.e., MW with *w_ij* = 0)**

Finally, we study the case with measurement errors and rewards. The random variables ${U_{(j, i)} : {i, j} \in E}$ giving place to the noises are chosen to be i.i.d. with common distribution $Unif [- 1, 1]$ , and the rewards are again symmetric and i.i.d. with common distribution $Unif [0, 10]$ . The distributions of G and λ do not change, and the tuple $(G, λ, U, w)$ is drawn only once at the beginning. Figure 4 is a histogram also for 500 iterations at time t = 100, and the vertical line represents the lower bound for the expectation. The numerical value for the upper bound for the expectation is 26,867.96, and it is not included in the graphic.

**Figure 4. Histogram for 500 Simulations at Time t = 100 of the Largest Queue Size for Max-Weight with Rewards and Noise**

For match the longest, we observe tight bounds for the expected value. All the upper bounds become very loose when we include the presence of rewards or noises. From unpublished simulations, we observe that this is a general picture and not only a particular situation for this choice of the parameters. In the presence of asymmetric rewards and noise, the bounds are not informative. We leave for future work to obtain better tailored made bounds in this case. A typical histogram is very close to the lower bound for the expectation, and the upper bound is more accurate if larger values of p in the distribution of G are considered.

5. Proofs

5.1. Proof of Proposition 1

Fix $x \in X$ and $κ \in (0, λ (V))$ . To shorten notation, we write

τ_{i} (x) = \sum_{j \in E (i)} λ (j) ν_{x, j} (i) .

We assume during the proof that we are in the $(n - 1)$ th step, namely, the law of $ν_{x, j}$ is ruled by the family of r.v.s ${U_{(j, i), n} : {i, j} \in E}$ ; of course, by homogeneity, this can be done without loss of generality. For the quadratic function f₂, identity

f_{2} (x \pm δ_{i}) - f_{2} (x) = 1 \pm 2 x (i)

holds as long as

x \pm δ_{i} \in X

. Hence, in view of the expression defining the generator (3), we get

L f_{2} (x) = \sum_{i \notin E (S_{x})} [1 + 2 x (i)] λ (i) + \sum_{i \in S_{x}} [1 - 2 x (i)] [γ (i) x (i) + τ_{i} (x)] .

(10)

Because $x (j) = 0$ for all $j \in E (S_{x})$ , the first sum equals

λ (E {(S_{x})}^{c}) + 2 \sum_{i \in V} λ (i) x (i) .

Observe that

\begin{array}{l} \sum_{i \in S_{x}} τ_{i} (x) & = \sum_{i \in S_{x}} \sum_{j \in E (i)} λ (j) ν_{x, j} (i) \\ = \sum_{i \in V} \sum_{j \in V} 𝟙 {i \in S_{x}} 𝟙 {{i, j} \in E} λ (j) ν_{x, j} (i) \\ = \sum_{i \in V} \sum_{j \in V} 𝟙 {i \in S_{x}} 𝟙 {j \in E (S_{x})} λ (j) ν_{x, j} (i) \\ = \sum_{i \in V} \sum_{j \in V} 𝟙 {j \in E (S_{x})} λ (j) ν_{x, j} (i) \\ = \sum_{j \in E (S_{x})} λ (j) \sum_{i \in V} ν_{x, j} (i) \\ = \sum_{j \in E (S_{x})} λ (j) \\ = λ (E (S_{x})), \end{array}

(11)

the third and the forth identities following because

ν_{x, j} (i)

is zero if

{i, j} \notin E

i \notin S_{x}

. Hence, the second sum in (10) can be rewritten as

\sum_{i \in V} γ (i) x (i) + λ (E (S_{x})) - 2 \sum_{i \in V} γ (i) x {(i)}^{2} - 2 \sum_{i \in V} x (i) τ_{i} (x) .

Putting these observations together, we get

L f_{2} (x) = λ (V) + \sum_{i \in V} [- 2 γ (i) x {(i)}^{2} + x (i) γ (i)] + 2 \sum_{i \in V} x (i) [λ (i) - τ_{i} (x)] .

(12)

From now on the proof is devoted to controlling the last term; an objective that will be carried out through the use of the following technical result.

Lemma 1.

Fix $x \in X$ and $κ \in (0, η]$ , and assume that the nonempty subset $W \subset S_{x}$ satisfies the following hypotheses:

For every $j \in E (W), \sum_{i \in W} ν_{x, j} (i | B_{κ, n}) = 1$ .
For every $A \subset W, λ (A) < λ (E (A))$ .

Then

\sum_{i \in W} x (i) [λ (i) - τ_{i} (x)] \leq λ (E (W)) (2 u_{κ} + \overset{ˇ}{w}) | W | + (κ - η) | | x | |_{W} .

(13)

In the assumptions of Lemma 1, hypothesis 1 states that, conditioned to $B_{κ, n}$ , an arriving j-item necessarily matches with an item whose class belongs to W. On another hand, hypothesis 2 states that the inequalities defining Ncond are satisfied inside the set W. The connection between the two hypotheses can be informally understood as follows: the second one gives stability inside W, and the first one tells you that this stability is not being taken advantage of by classes outside $W \cup E (W)$ .

Before providing the proof of Lemma 1, we show how to conclude from it. To avoid trivialities, we assume $S_{x} \neq \emptyset$ from now on.

Assume $R = \emptyset$ (no reneging). It is easy to see that the hypotheses of Lemma 1 are satisfied for $W = S_{x}$ . Hence, from (13) it follows that
$\sum_{i \in V} x (i) [λ (i) - τ_{i} (x)] \leq λ (V) (2 u_{κ} + \overset{ˇ}{w}) | V | + (κ - η) | | x | | .$ (14)
Then simply replace in (12) to obtain the desired inequality (6).
We now turn to the case where $R \neq \emptyset$ . First, the result in the subcase $R^{c} = \emptyset$ follows immediately from (12). Therefore, let us finally assume that $R \neq \emptyset$ and $R^{c} \neq \emptyset$ . Coming back to (12), we have
$\begin{array}{l} L f_{2} (x) & \leq λ (V) + \sum_{i \in V} [- 2 γ (i) x {(i)}^{2} + x (i) γ (i)] + 2 \sum_{i \in R} x (i) λ (i) + 2 \sum_{i \in R^{c}} x (i) [λ (i) - τ_{i} (x)] \\ = λ (V) + \sum_{i \in R} [- 2 γ (i) x {(i)}^{2} + x (i) (γ (i) + 2 λ (i))] + 2 \sum_{i \in R^{c}} x (i) [λ (i) - τ_{i} (x)] . \end{array}$ (15)

If $| | x | |_{R^{c}} \leq | | x | |_{R} + | R^{c} | (2 u_{κ} + \overset{ˇ}{w})$ , we can easily conclude because

\sum_{i \in R^{c}} x (i) [λ (i) - τ_{i} (x)] \leq \sum_{i \in R^{c}} x (i) λ (i) \leq [| | x | |_{R} + | R^{c} | (2 u_{κ} + \overset{ˇ}{w})] λ (R^{c}) .

(16)

From now on we assume that $| | x | |_{R^{c}} > | | x | |_{R} + | R^{c} | (2 u_{κ} + \overset{ˇ}{w})$ , and we recall that we are in the situation in which $R \neq \emptyset$ and $R^{c} \neq \emptyset$ . One is tempted to apply Lemma 1 with $W = R^{c} \cap S_{x}$ , especially because, for this particular choice of W, hypothesis 2 of this lemma is guaranteed by the hypothesis $NCOND (G, γ)$ . Unfortunately, hypothesis 1 of Lemma 1 fails. The rest of the proof is devoted to finding a proper subset $W \subset R^{c} \cap S_{x}$ fitting this missing requirement.

Here we introduce a crucial notion of connectivity that depends on the number of stored items, notion that is also used in the proof of Lemma 1. We define an auxiliary graph $(V^{'}, E^{'})$ by setting $V^{'} = S_{x} \cap R^{c}$ and ${i, j} \in E^{'} \Leftrightarrow i \neq j and | x (i) - x (j) | \leq {2 u}_{κ} + \overset{ˇ}{w}$ . Let $C$ be the partition of $V^{'}$ induced by the connected components of the graph $(V^{'}, E^{'})$ : for every $C \in C$ , C is a subset of $V^{'}$ , and every pair $i, j \in C$ is connected by a path of edges in $E^{'}$ . The set

W = \cup {C \in C : x (i) > | | x | |_{R} + 2 u_{κ} + \overset{ˇ}{w} for every i \in C}

is the one that does the trick as we show here:

We first see that $W \neq \emptyset$ . Let $i_{0} \in V^{'}$ be such that $| | x | |_{R^{c}} = x (i_{0})$ , and let $C_{0} \in C$ be the subset to which i₀ belongs. We will prove that $C_{0} \subset W$ . We have to see that $x (i) > | | x | |_{R} + 2 u_{κ} + \overset{ˇ}{w}$ for every $i \in C_{0}$ . If $i = i_{0}$ , this follows because
$x (i_{0}) = | | x | |_{R^{c}} > | | x | |_{R} + | R^{c} | (2 u_{κ} + \overset{ˇ}{w}) \geq | | x | |_{R} + 2 u_{κ} + \overset{ˇ}{w} .$ (17)
If $i \neq i_{0}$ , consider a path in $(V^{'}, E^{'})$ connecting i₀ with i, namely, take $i_{1}, \dots, i_{L} \in C_{0}$ (L is simply the length of the path) such that $i_{L} = i, {i_{l - 1}, i_{l}} \in E^{'}$ for every $1 \leq l \leq L$ , and $i_{l} \neq i_{l^{'}}$ for every $l, l^{'} \in {0, \dots, L}, l \neq l^{'}$ . The definition of C₀ implies that
$x (i_{0}) - x (i) \leq \sum_{l = 1}^{L} | x (i_{l - 1}) - x (i_{l}) | \leq L (2 u_{κ} + \overset{ˇ}{w}) .$ (18)
Using that $L + 1 \leq | R^{c} |$ , we have
$\begin{array}{l} x (i) & \geq x (i_{0}) - L (2 u_{κ} + \overset{ˇ}{w}) \\ > | | x | |_{R} + | R^{c} | (2 u_{κ} + \overset{ˇ}{w}) - L (2 u_{κ} + \overset{ˇ}{w}) \geq | | x | |_{R} + 2 u_{κ} + \overset{ˇ}{w} \end{array}$
as desired.
We prove now that hypothesis 1 of Lemma 1 holds. For $j \in E (W)$ fixed, we have to see that, conditioned to $B_{κ, n}$ , the policy necessarily performs a match with an item class in W. This follows if we prove that
${[x (i) + U_{(j, i), n}]}^{+} + w_{(j, i)} > {[x (i^{'}) + U_{n, (j, i^{'})}]}^{+} + w_{(j, i^{'})}$ (19)
for every $i \in W$ and $i^{'} \in S_{x} ∖ W$ . We split into the following two subcases: $i^{'} \in S_{x} \cap R$ and $i^{'} \in (S_{x} \cap R^{c}) ∖ W$ .
- (a) If $i^{'} \in S_{x} \cap R$ , we have
  $\begin{array}{l} {[x (i) + U_{(j, i), n}]}^{+} + w_{(j, i)} & > | | x | |_{R} + 2 u_{κ} + \overset{ˇ}{w} + U_{(j, i), n} + w_{(j, i)} \\ \geq x (i^{'}) + u_{κ} + \overset{ˇ}{w} \\ \geq {[x (i^{'}) + U_{n, (j, i^{'})}]}^{+} + w_{(j, i^{'})} . \end{array}$
- (b) For the subcase $i^{'} \in (S_{x} \cap R^{c}) ∖ W$ , we will use the following obvious property: if $i_{1}, i_{2} \in V^{'}$ both belong to the same set of connected nodes $C \in C$ , and $i_{3} \in V^{'}$ is such that $x (i_{1}) \land x (i_{2}) - (2 u_{κ} + \overset{ˇ}{w}) \leq x (i_{3}) \leq x (i_{1}) \lor x (i_{2}) + 2 u_{κ} + \overset{ˇ}{w}$ , then also i₃ belongs to C. We next show that
  $x (i) > x (i^{'}) + 2 u_{κ} + \overset{ˇ}{w} .$ (20)
Suppose by contradiction that the inequality is false. Let $C^{'} \in C$ be the subset to which $i^{'}$ belongs, and let $i^{'}^{'} \in C^{'}$ be such that $x (i^{'}^{'}) \leq | | x | |_{R} + 2 u_{κ} + \overset{ˇ}{w}$ (such an $i^{'}^{'}$ exists because otherwise $i^{'}$ would belong to W). In particular,
$x (i^{'}) \land x (i^{'}^{'}) - (2 u_{κ} + \overset{ˇ}{w}) \leq | | x | |_{R} + 2 u_{κ} + \overset{ˇ}{w} < x (i) \leq x (i^{'}) \lor x (i^{'}^{'}) + 2 u_{κ} + \overset{ˇ}{w} .$
The mentioned property implies that $i \in C^{'}$ , contradictorily implying that $i \notin W$ . Once we have Inequality (20), we can conclude because
$\begin{array}{l} {[x (i) + U_{(j, i), n}]}^{+} + w_{(j, i)} & > x (i^{'}) + 2 u_{κ} + \overset{ˇ}{w} + U_{(j, i), n} + w_{(j, i)} \\ \geq {[x (i^{'}) + U_{n, (j, i^{'})}]}^{+} + w_{(j, i^{'})} . \end{array}$ (21)
As mentioned before, hypothesis 2 of Lemma 1 holds for $V^{'}$ because of the $NCOND (G, γ)$ assumption, and it extends to W simply for being a subset.

We can now apply Lemma 1 to W, use that $x (i) \leq | | x | |_{R} + (2 u_{κ} + \overset{ˇ}{w}) | R^{c} |$ for every $i \in R^{c} ∖ W$ , and use that $| | x | |_{W} = | | x | |_{R^{c}}$ , to obtain

\begin{array}{l} \sum_{i \in R^{c}} x (i) [λ (i) - τ_{i} (x)] & \leq \sum_{i \in R^{c} ∖ W} x (i) λ (i) + \sum_{i \in W} x (i) [λ (i) - τ_{i} (x)] \\ \leq [| | x | |_{R} + | R^{c} | (2 u_{κ} + \overset{ˇ}{w})] λ (R^{c}) + λ (W^{c}) (2 u_{κ} + \overset{ˇ}{w}) | W | + (κ - η) | | x | |_{W} \\ \leq [| | x | |_{R} + | R^{c} | (2 u_{κ} + \overset{ˇ}{w})] λ (R^{c}) + λ (V) (2 u_{κ} + \overset{ˇ}{w}) | R^{c} | + (κ - η) | | x | |_{R^{c}} . \end{array}

Replace in (15) to conclude.

Proof of Lemma 1.

We divide the proof into three steps.

Step 1: Separation into connected components. We define an auxiliary graph $(W, \tilde{E})$ by ${i, j} \in \tilde{E} \Leftrightarrow | x (i) - x (j) | \leq 2 u_{κ} + \overset{ˇ}{w}$ , and we denote by ${C_{1}, \dots, C_{L}}$ the partition of W into connected subsets of nodes. We suppose that they are ordered in a decreasing way with respect to the number of stored items: if $l < l^{'}$ , and if $i \in C_{l}$ and $i^{'} \in C_{l^{'}}$ , then $x (i) > x (i^{'}) + 2 u_{κ} + \overset{ˇ}{w}$ . More precisely, if $i_{1}, \dots, i_{| W |}$ is an enumeration of the nodes in W such that $x (i_{k}) \leq x (i_{k + 1})$ for every $k \in {1, \dots, | W | - 1}$ , there are exactly $L - 1$ of them $i_{k_{1}}, \dots, i_{k_{L - 1}}$ such that $x (i_{k_{l} + 1}) > x (i_{k_{l}}) + 2 u_{κ} + \overset{ˇ}{w}$ for every $l \in {1, \dots, L - 1}$ , and they satisfy $x (i_{k_{l}}) = \max {x (i) : i \in C_{l}}$ and $x (i_{k_{l} + 1}) = \min {x (i) : i \in C_{l + 1}}$ . The idea behind these definitions is to distinguish nodes whose weights are too far apart. In particular, we will use the following property:

for every l \in [​​​ [1, L] ​​​] and every j \in E (C_{1} \cup \dots \cup C_{l}), \sum_{i \in C_{1} \cup \dots \cup C_{l}} ν_{x, j} (i | B_{κ, n}) = 1 .

(22)

In other words, conditioned to $B_{κ, n}$ , an incoming item whose class belongs to $E (C_{1} \cup \dots \cup C_{l})$ necessarily matches with an item whose class belongs to $C_{1} \cup \dots, C_{l}$ . To see why this is true, take $j \in E (C_{1} \cup \dots \cup C_{l})$ . In particular, we have $j \in E (W)$ , so $\sum_{i \in W} ν_{x, j} (i | B_{κ, n}) = 1$ by hypothesis; hence, we are done if $l = L$ . The case $l < L$ follows because, for every $i \in C_{1} \cup \dots \cup C_{l}$ and every $i^{'} \in C_{l + 1} \cup \dots \cup C_{L}$ , we have

\begin{array}{l} {[x (i) + U_{(j, i)}]}^{+} + w_{(j, i)} = x (i) + U_{(j, i)} + w_{(j, i)} & > x (i^{'}) + U_{(j, i)} + 2 u_{κ} + \overset{ˇ}{w} + w_{(j, i)} \\ \geq {[x (i^{'}) + U_{(j, i^{'})}]}^{+} + w_{(j, i^{'})} . \end{array}

(23)

We will also use that

| | x | |_{C_{l}} - (2 u_{κ} + \overset{ˇ}{w}) | W | \leq x (i) \leq | | x | |_{C_{l}}

(24)

for any

l \in [​​​ [1, L] ​​​]

and any

i \in C_{l}

Step 2: Control of the number of stored items in each C_l. In this step, we reduce the problem by controlling the maximum and minimum number of stored items belonging to the same connected subsets. First observe that

\sum_{i \in W} x (i) [λ (i) - τ_{i} (x)] \leq \sum_{i \in W} x (i) [λ (i) - \sum_{j \in E (i)} λ (j) ν_{x, j} (i | B_{κ, n}) ℙ (B_{κ, n})]

(25)

= \sum_{i \in W} x (i) h_{i} (x)

(26)

= \sum_{l = 1}^{L} \sum_{i \in C_{l}} x (i) h_{i} (x),

the first identity being simply a definition of

h_{i} (x)

as the expression in square brackets. Using (24), we have

\begin{array}{l} \sum_{i \in C_{l}} x (i) h_{i} (x) = \sum_{i \in C_{l}}^{+} x (i) h_{i} (x) + \sum_{i \in C_{l}}^{-} x (i) h_{i} (x) & \leq | | x | |_{C_{l}} \sum_{i \in C_{l}}^{+} h_{i} (x) + [| | x | |_{C_{l}} - (2 u_{κ} + \overset{ˇ}{w}) | W |] \sum_{i \in C_{l}}^{-} h_{i} (x) \\ = | | x | |_{C_{l}} \sum_{i \in C_{l}} h_{i} (x) - (2 u_{κ} + \overset{ˇ}{w}) | W | \sum_{i \in C_{l}}^{-} h_{i} (x), \end{array}

(27)

where the superscripts + (respectively, –) in the sums mean that we are summing over the indexes is for which

h_{i} (x) \geq 0

(respectively,

h_{i} (x) < 0

). Hence, calling

H_{l} (x) = \sum_{i \in C_{l}} h_{i} (x)

, we have

\sum_{l = 1}^{L} \sum_{i \in C_{l}} x (i) h_{i} (x) \leq \sum_{l = 1}^{L} | | x | |_{C_{l}} H_{l} (x) - (2 u_{κ} + \overset{ˇ}{w}) | W | \sum_{l = 1}^{L} \sum_{i \in C_{l}}^{-} h_{i} (x) .

(28)

However, observe that

- h_{i} (x) = \sum_{j \in E (i)} λ (j) ν_{x, j} (i | B_{κ, n}) ℙ (B_{κ, n}) - λ (i) \leq \sum_{j \in E (i)} λ (j) ν_{x, j} (i | B_{κ, n}),

implying that

\begin{array}{l} - \sum_{l = 1}^{L} \sum_{i \in C_{l}}^{-} h_{i} (x) & \leq \sum_{l = 1}^{L} \sum_{i \in C_{l}}^{-} \sum_{j \in E (i)} λ (j) ν_{x, j} (i | B_{κ, n}) \\ \leq \sum_{l = 1}^{L} \sum_{i \in C_{l}} \sum_{j \in E (i)} λ (j) ν_{x, j} (i | B_{κ, n}) \\ = \sum_{i \in W} \sum_{j \in E (i)} λ (j) ν_{x, j} (i | B_{κ, n}) \\ = \sum_{j \in E (W)} λ (j) \sum_{i \in W \cap E (j)} ν_{x, j} (i | B_{κ, n}) \\ = \sum_{j \in E (W)} λ (j) \\ \leq λ (E (W)), \end{array}

the second identity following as in (11), the last identity following from hypothesis 1, and the last inequality from the fact that

W \subseteq S_{x}

is an independent set. An application of this bound to the right-hand side (r.h.s.) of (28) reduces the proof to proving that

\sum_{l = 1}^{L} | | x | |_{C_{l}} H_{l} (x) \leq (κ - η) | | x | |_{W},

and for this we use an iterative procedure.

Step 3: Iterative procedure. First, recalling (5), observe that

\begin{array}{l} H_{1} (x) & = \sum_{i \in C_{1}} λ (i) - ℙ (B_{κ, n}) \sum_{j \in E (C_{1})} λ (j) \sum_{i \in C_{1}} ν_{x, j} (i | B_{κ, n}) \\ = λ (C_{1}) - ℙ (B_{κ, n}) \sum_{j \in E (C_{1})} λ (j) \leq λ (C_{1}) - (1 - \frac{κ}{λ (V)}) λ (E (C_{1})) \leq κ - η . \end{array}

(29)

In the last inequality, we used hypothesis 2; in the second identity, we use Property (22). We are in the following alternative: if it holds that

H_{l} (x) \leq 0 \forall l \in {2, \dots, L},

(30)

we are done because, in view of (29),

\sum_{l = 1}^{L} | | x | |_{C_{l}} H_{l} (x) \leq | | x | |_{C_{1}} H_{1} (x) \leq (κ - η) | | x | |_{W}

(observe that

| | x | |_{C_{1}} = | | x | |_{W}

). If (30) does not hold, we define

L^{'} = \max {l \in [​​​ [1, L] ​​​] : H_{l} (x) > 0},

which, from (29), is necessarily strictly larger than one. Then we have that

\sum_{l = 1}^{L} | | x | |_{C_{l}} H_{l} (x) \leq \sum_{l = 1}^{L^{'}} | | x | |_{C_{l}} H_{l} (x) \leq \sum_{l = 1}^{L^{'} - 2} | | x | |_{C_{l}} H_{l} (x) + | | x | |_{C_{L^{'} - 1}} [H_{L^{'} - 1} (x) + H_{L^{'}} (x)],

(31)

where sums over empty sets are set to zero. If

H_{L^{'} - 1} (x) + H_{L^{'}} (x) \leq 0

, we bound the last quantity by

\sum_{l = 1}^{L^{'} - 2} | | x | |_{C_{l}} H_{l} (x)

and restart the procedure; that is, we look for

\max {l \in [​​​ [1, L^{'} - 2] ​​​] : H_{l} (x) > 0},

and so on. If

H_{L^{'} - 1} (x) + H_{L^{'}} (x) > 0

, we bound the r.h.s. of (31) by

\sum_{l = 1}^{L^{'} - 3} | | x | |_{C_{l}} H_{l} (x) + | | x | |_{C_{L^{'} - 2}} [H_{L^{'} - 2} (x) + H_{L^{'} - 1} (x) + H_{L^{'}} (x)],

separate between the cases

H_{L^{'} - 2} (x) + H_{L^{'} - 1} (x) + H_{L^{'}} (x) \leq 0 and H_{L^{'} - 2} (x) {+ H}_{L^{'} - 1} (x) {+ H}_{L^{'}} (x) > 0,

and proceed as previously shown. It only remains to address the case where all partial sums are positive; that is,

\sum_{l = l_{0}}^{L^{'}} H_{l} (x) > 0

for any

l_{0} \in [​​​ [2, L^{'} - 2] ​​​]

. In this case, the final step of the iteration gives

\sum_{l = 1}^{L^{'}} | | x | |_{C_{l}} H_{l} (x) \leq | | x | |_{C_{1}} \sum_{l = 1}^{L^{'}} H_{l} (x) .

(32)

However, as in (29), we have that

\begin{array}{l} \sum_{l = 1}^{L^{'}} H_{l} (x) & = \sum_{i \in C_{1} \cup \dots \cup C_{L^{'}}} λ (i) - ℙ (B_{κ, n}) \sum_{j \in E (C_{1} \cup \dots \cup C_{L^{'}})} λ (j) \sum_{i \in C_{1} \cup \dots \cup C_{L^{'}}} ν_{x, j} (i | B_{κ, n}) \\ = λ (C_{1} \cup \dots \cup C_{L^{'}}) - ℙ (B_{κ, n}) λ [E (C_{1} \cup \dots \cup C_{L^{'}})] \leq κ - η, \end{array}

where we are again using Property (22). Plugging this into (32), we get that

\sum_{l = 1}^{L^{'}} | | x | |_{C_{l}} H_{l} (x) \leq (κ - η) | | x | |_{W},

which concludes the proof. □

5.2. Proof of Theorem 1

Necessity of the condition Ncond can be shown similarly to proposition 2 in Mairesse and Moyal (2016): For any $I \in I (R^{c})$ and any $t \geq 0$ , all items of class in I that have departed the system before t have necessarily been matched with an element of E(I) arrived before t; thus, we have that

\begin{array}{l} X_{t} (I) ≔ \sum_{i \in I} X_{t} (i) & \geq \sum_{i \in I} X_{0} (i) - \sum_{i \in E (I)} X_{0} (i) + \sum_{i \in I} N {(i)}_{t} - \sum_{i \in E (I)} N {(i)}_{t} \\ \geq \sum_{i \in I} X_{0} (i) + M_{t}^{I} + t (λ (I) - λ (E (I))), \end{array}

where for all i,

N {(i)}_{t}

denotes the number of points of the process N(i) up to t, and M^I is a martingale with respect to the natural filtration of N. Therefore, it is routine to check that the above process tends a.s. to infinity as

t \to \infty

λ (I) - λ (E (I)) > 0

, and is at best null recurrent if

λ (I) - λ (E (I)) = 0

We now prove sufficiency. By the continuous-time version of the Foster-Lyapunov theorem (Meyn and Tweedie 1993), it suffices to show that

L f_{2} (x) \leq - 1

for every x outside a finite set. Take

κ = η / 2

. If

R = \emptyset

, Inequality (6) becomes

L f_{2} (x) \leq c_{1} - η | | x | |

(33)

for a constant c₁ that does not depend on x; this clearly implies the assertion. If

R \neq \emptyset

, Inequality (7) becomes

- 2 \hat{γ} | | x | |_{R}^{2} + c_{2} | | x | |_{R} + c_{3} - η | | x | |_{R^{c}} 𝟙 {| | x | |_{R^{c}} > | | x | |_{R} + c_{4}},

(34)

being

\hat{γ} = \min_{i \in R} γ (i)

. The r.h.s. of (34) is larger than

- 1

only when

| | x | |_{R}

and

| | x | |_{R^{c}}

are simultaneously small enough, a fact that occurs only for finitely many xs. □

5.3. Proof of Proposition 2

To expedite the proof, first observe that, for any G and γ, $λ \in {(R_{+}^{*})}^{V}$ is an element of $NCOND (G, γ)$ defined by (4) if and only if $μ_{λ}$ defined by (1) is an element of

{NCOND}^{'} (G, γ) = {μ \in P_{V}^{*} : μ (I) < μ (E (I)) for any I \in I (R^{c})},

being

P_{V}^{*}

the set of strictly positive probabilities defined on V; similarly, λ belongs to

NCOND (G)

defined by (8) if and only if

μ_{λ}

belongs to

{NCOND}^{'} (G) = {μ \in P_{V}^{*} : μ (I) < μ (E (I)) for any I \in I} .

First, if G is bipartite and $R = \emptyset$ , then theorem 1 of Mairesse and Moyal (2016) shows that the set $NCON D^{'} (G, γ) = NCON D^{'} (G)$ is empty. From the previous remark, so is the set $NCOND (G, γ) = NCOND (G)$ .

Regarding the converse, let us first assume that G is nonbipartite. Then, again from theorem 1 in Mairesse and Moyal (2016), the set $NCON D^{'} (G)$ is nonempty. However, plainly, for any γ, we have $NCON D^{'} (G) \subset NCON D^{'} (G, γ)$ because any element of $I (R)$ (if any) is also an element of $I$ . This shows that $NCON D^{'} (G, γ)$ is nonempty and therefore is $NCOND (G, γ)$ .

It only remains to consider the case where G is bipartite (in which case $NCON D^{'} (G)$ is empty again by theorem 1 of Mairesse and Moyal (2016)), and γ is such that $R \neq \emptyset$ . Let $V_{1} \cup V_{2}$ be a bipartition of V into two independent sets. Then, it follows from theorem 4.2 in Bušić et al. (2013) that there exists a probability measure $μ \in P_{V}^{*}$ such that

μ (V_{1}) = μ (V_{2}) = \frac{1}{2} and μ (I) < μ (E (I)) for all I \in I ∖ {V_{1}, V_{2}} .

(35)

Suppose that $R \cap V_{1} \neq \emptyset$ (the case where $R \cap V_{2} \neq \emptyset$ is symmetric). Our idea is to modify the measure μ by transferring a small enough amount of mass from a node of V₂ to a node of $R \cap V_{1}$ , so as to belong to $I (R^{c})$ . For this, we first set a node $i \in V_{2}$ and a positive real number ε as follows:

(i) If $V_{1} \subset R$ , i is arbitrary in V₂ and
$ε = \frac{1}{2} (μ (i) \land \min {1 - μ (j) : j \in R \cap V_{1}}) .$
(ii) Else, i is an arbitrary element of $E (R^{c} \cap V_{1})$ and we set
$ε = \frac{1}{3} (\min {μ (E (I)) - μ (I) : I \in I (R^{c} \cap V_{1})} \land μ (i) \land \min {1 - μ (j) : j \in R \cap V_{1}}),$ (36)
which is strictly positive because V₁ is not an element of $I (R^{c} \cap V_{1})$ .

We then construct the measure $\hat{μ}$ , as follows:

{\begin{array}{l} \hat{μ} (i) = μ (i) - ε; \\ \hat{μ} (j) = μ (j) + ε for some arbitrary j \in R \cap V_{1}; \\ \hat{μ} (k) = μ (k) for all other k \in V . \end{array}

Then, $\hat{μ}$ clearly is a probability measure on V, such that

μ (A) \leq \hat{μ} (A) \leq μ (A) + ε and μ (B) - ε \leq \hat{μ} (B) \leq μ (B), for all A \subset V_{1} and B \subset V_{2} .

(37)

We aim at showing that

\hat{μ} (I) < \hat{E} (I), for all I \in I (R^{c}) .

(38)

For this, first observe that because V₁ intersects with R it is not an independent set of $I (R^{c})$ . Second, if $V_{2} \in I (R^{c})$ (which is true if and only if $R \subset V_{1}$ ), we get

\hat{μ} (V_{2}) = μ (V_{2}) - ε = μ (V_{1}) - ε = \hat{μ} (V_{1}) - 2 ε = \hat{μ} (E (V_{2})) - 2 ε < \hat{μ} (E (V_{2})) .

Now take an independent set $I \in I (R^{c}) ∖ {V_{2}}$ such that $I \cap V_{1} = \emptyset$ . Then we have that

\hat{μ} (E (I)) - \hat{μ} (I) \geq μ (E (I)) - μ (I) > 0,

(39)

where the first inequality follows from (37) together with the fact that

E (I) \subset V_{1}, I \subset V_{2}

, and the second, from the facts that

I (R^{c}) \subset I, I \notin {V_{1}, V_{2}}

and (35).

Last, take an independent set $I \in I (R^{c}) ∖ {V_{2}}$ such that $I \cap V_{1} \neq \emptyset$ . (Such an element exists only if V₁ is not included in R, so we are in case (ii).) Then we have

\begin{array}{l} \hat{μ} (E (I)) - \hat{μ} (I) & = \hat{μ} (E (I) \cap V_{1}) + \hat{μ} (E (I) \cap V_{2}) - \hat{μ} (I \cap V_{2}) - \hat{μ} (I \cap V_{1}) \\ = \hat{μ} (E (I \cap V_{2})) - \hat{μ} (I \cap V_{2}) + \hat{μ} (E (I \cap V_{1})) - \hat{μ} (I \cap V_{1}) \\ \geq \hat{μ} (E (I \cap V_{1})) - \hat{μ} (I \cap V_{1}) \\ \geq μ (E (I \cap V_{1})) - ε - (μ (I \cap V_{1}) + ε) > 0, \end{array}

where in the second equality, we use the fact that the graph is bipartite, implying that

E (I) \cap V_{1} = E (I \cap V_{2})

and

E (I) \cap V_{2} = E (I \cap V_{1})

; the first inequality follows from (39) in the case where

I \cap V_{2} \neq \emptyset

, or else is an equality; and where we use (37) in the second inequality and (36) is the third one.

We have thus proven that $\hat{μ} (I) < \hat{E} (I)$ for any $I \in I (R^{c})$ . This implies that $\hat{μ}$ is an element of $NCON D^{'} (G, γ)$ and thereby an element of $NCOND (G, γ)$ that let us conclude. □

5.4. Proof of Proposition 3

Neglecting the reneging part, we get for all x,

L e^{α | | x | |_{2}} = \sum_{i \notin E (S_{x})} λ (i) (e^{α | | x + δ_{i} | |_{2}} - e^{α | | x | |_{2}}) + \sum_{i \in S_{x}} τ_{i} (x) (e^{α | | x - δ_{i} | |_{2}} - e^{α | | x | |_{2}})

(recall that

τ_{i} (x) = \sum_{j \in E (i)} λ (j) ν_{x, j} (i)

). Taking common factor

e^{α | | x | |_{2}}

, using that

e^{a} - 1 \leq a + a^{2}

| a |

is small enough, and that

{(| | x \pm δ_{i} | |_{2} - | | x | |_{2})}^{2} \leq 1

(this is simply a triangular inequality), last expression can be bounded by

\begin{array}{l} e^{α | | x | |_{2}} [\sum_{i \notin E (S_{x})} λ (i) [α (| | x + δ_{i} | |_{2} - | | x | |_{2}) + α^{2}] + \sum_{i \in S_{x}} (α (| | x - δ_{i} | |_{2} - | | x | |_{2}) + α^{2}) τ (i)] \\ = e^{α | | x | |_{2}} α [\sum_{i \notin E (S_{x})} λ (i) (| | x + δ_{i} | |_{2} - | | x | |_{2}) + \sum_{i \in S_{x}} (| | x - δ_{i} | |_{2} - | | x | |_{2}) τ (i) + α λ (V)] . \end{array}

From a Taylor development,

| | x \pm δ_{i} | |_{2} - | | x | |_{2} \leq \frac{\pm x (i)}{| | x | |_{2}} + \frac{1}{2 | | x | |_{2}} .

Using this inequality, we get

\begin{array}{l} \sum_{i \notin E (S_{x})} λ (i) (| | x + δ_{i} | |_{2} - | | x | |_{2}) + \sum_{i \in S_{x}} (| | x - δ_{i} | |_{2} - | | x | |_{2}) τ (i) \\ \leq \frac{1}{| | x | |_{2}} (\sum_{i \in S_{x}} x (i) [λ (i) - τ (i)] + \frac{1}{2} λ (V)) \\ \leq \frac{1}{| | x | |_{2}} (λ (V) (2 u_{κ} + \overset{ˇ}{w}) | V | + (κ - η) | | x | |_{\infty} + \frac{1}{2} λ (V)) \\ \leq \frac{λ (V) (2 u_{κ} + \overset{ˇ}{w}) | V |}{| | x | |_{2}} + (κ - η) \sqrt{| V |} + \frac{λ (V)}{2 | | x | |_{2}} \end{array}

for every

κ \in (0, λ (V))

. In the second inequality, we used (6). We have obtained

L e^{α | | x | |_{2}} \leq e^{α | | x | |_{2}} (\frac{α λ (V) (2 u_{κ} + \overset{ˇ}{w}) | V |}{| | x | |_{2}} + \frac{α λ (V)}{2 | | x | |_{2}} + α (κ - η) \sqrt{| V |} + α^{2} λ (V)) .

We can conclude by choosing $κ < η$ and α such that $α (κ - η) \sqrt{| V |} + α^{2} λ (V) < 0$ . □

5.5. Proof of Theorem 3

Under our assumptions, Inequality (6) takes the simpler form

L f_{2} (x) \leq λ (V) + 2 λ (V) (2 B + \overset{ˇ}{w}) | V | - 2 η | | x | | .

The upper bound in (9) follows after integrating with respect to π and using that

\int (L f) (x) π (d​ x) = 0 for every π -integrable f : X \to R .

We now turn to the lower bound of (9). For this, we introduce the following stochastic lower bound for X: Consider a fictional matching model such that, for any $i \in V$ , as soon as an item of a class $j \in E (i)$ enters the system, one class-i item leaves the system. Let $\tilde{X}$ be the $N^{| V |}$ -valued process counting the number of items of each class in the system at any time. It is easy to prove by an immediate coupling argument, that ${\tilde{X}}_{t} (i) \leq_{st} X_{t} (i)$ for all $i \in V$ , where “ $\leq_{st}$ ” stands for the strong stochastic ordering. Moreover, it is also immediate to observe that the ith marginal of $\tilde{X}$ is just a birth and death process with up rate $λ (i)$ and down rate $λ (E (i))$ in any state. Hence, the marginal of the stationary measure $\tilde{π}$ of $\tilde{X}$ satisfies

\int x (i) \tilde{π} (d​ x) = \frac{ρ_{i}}{1 - ρ_{i}}, for ρ_{i} = \frac{λ (i)}{λ (E (i))} \cdot

Then, we get that

\frac{ρ_{i}}{1 - ρ_{i}} \leq \int x (i) π (d​ x) \leq \int | | x | |_{\infty} π (d​ x),

which completes the proof. □

6. Conclusion

By proving that a quadratic function and its exponential are Lyapunov functions under the generalized N-COND condition, we showed that the generalized max-weight policy has very desirable properties: it is maximal stable even in the presence of noisy observations and has exponential convergence toward its stationary measure. In an ongoing work, we show using systematic and extensive simulations that it also compares very favorably to state-of-the-art matching policies both in terms of long-term variance and convergence.

Acknowledgments

We are grateful to the anonymous referees for several comments that helped us give a clear focus to the article.

References

Adan I, Weiss G (2012) Exact FCFS matching rates for two infinite multitype sequences. Oper. Res. 60(2):475–489.Link, Google Scholar
Adan I, Weiss G (2014) A skill based parallel service system under FCFS-ALIS—Steady state, overloads, and abandonments. Stochastic Systems 4(1):250–299.Link, Google Scholar
Adan I, Busic A, Mairesse J, Weiss G (2017) Reversibility and further properties of FCFS infinite bipartite matching. Math. Oper. Res. 43(2):598–621.Link, Google Scholar
Adan I, Kleiner I, Righter R, Weiss G (2018) FCFS parallel service systems and matching models. Performance Evaluation 127–128:253–272.Google Scholar
Boxma OJ, David I, Perry D, Stadje W (2011) A new look at organ transplantation models and double matching queues. Probability Engrg. Inform. Sci. 25(2):135–155.Google Scholar
Büke B, Chen H (2015) Stabilizing policies for probabilistic matching systems. Queueing Systems 80(1–2):35–69.Google Scholar
Büke B, Chen H (2017) Fluid and diffusion approximations of probabilistic matching systems. Queueing Systems 86(1–2):1–33.Google Scholar
Bušić A, Meyn S (2016) Approximate optimality with bounded regret in dynamic matching models. Preprint, submitted June 25, https://arxiv.org/abs/1411.1044.Google Scholar
Bušić A, Gupta V, Mairesse J (2013) Stability of the bipartite matching model. Adv. Appl. Probability 45(2):351–378.Google Scholar
Caldentey R, Kaplan E, Weiss G (2009) FCFS infinite bipartite matching of servers and customers. Adv. Appl. Probability 41(3):695–730.Google Scholar
Gurvich I, Ward A (2014) On the dynamic control of matching queues. Stochastic Systems 4(2):479–523.Link, Google Scholar
Hairer M (2014) Convergence of Markov Processes. Lecture Notes available in the web-page of the author, https://www.hairer.org/notes/Convergence.pdf.Google Scholar
Lu Y, Xie Q, Kliot G, Geller A, Larus JR, Greenberg A (2011) Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services. Performance Evaluation 68(11):1056–1071.Google Scholar
Mairesse J, Moyal P (2016) Stability of the stochastic matching model. J. Appl. Probability 53(4):1064–1077.Google Scholar
Meyn SP, Tweedie RL (1993) Stability of Markovian processes. III. Foster-Lyapunov criteria for continuous-time processes. Adv. Appl. Probability 3(25):518–548.Google Scholar
Mitzenmacher M (1996) The power of two choices in randomized load balancing. PhD thesis, University of California, Berkeley, CA.Google Scholar
Moyal P, Perry O (2017) On the instability of matching queues. Ann. Appl. Probability 27(6):3385–3434.Google Scholar
Moyal P, Perry O (2021) Stability of parallel server systems. Oper. Res., ePub ahead of print August 16, https://doi.org/10.1287/opre.2021.2125.Google Scholar
Moyal P, Busic A, Mairesse J (2018) Loynes construction for the extended bipartite matching. Preprint, submitted March 7, https://arxiv.org/abs/1803.02788.Google Scholar
Moyal P, Busic A, Mairesse J (2020) A product form for the general stochastic matching model. Preprint, submitted February 5, https://arxiv.org/abs/1711.02620.Google Scholar
Nazari M, Stolyar AL (2019) A stochastic matching model on hypergraphs. Queueing Systems 91(1–2):143–170.Google Scholar
Rahmé Y, Moyal P (2019) Reward maximization in general dynamic matching systems. Preprint, submitted July 30, https://arxiv.org/abs/1907.12711.Google Scholar
Tassiulas L, Ephremides A (1992) Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks. IEEE Trans. Automated Control 37(12):1936–1948.Google Scholar

Volume 13, Issue 1

March 2023

Pages 1-180

Article Information

Metrics

Information

Received:July 14, 2021
Accepted:June 06, 2022
Published Online:August 01, 2022

Cite as

Matthieu Jonckheere, Pascal Moyal, Claudia Ramírez, Nahuel Soprano-Loto (2022) Generalized Max-Weight Policies in Stochastic Matching. Stochastic Systems 13(1):40-58.

https://doi.org/10.1287/stsy.2022.0098

Keywords

Acknowledgments

We are grateful to the anonymous referees for several comments that helped us give a clear focus to the article.

PDF download

Available Issues

Available Issues

Available Issues

Generalized Max-Weight Policies in Stochastic Matching

Abstract

1. Introduction

1.1. Motivation

1.2. Main Contributions

1.3. Outline of the Paper

2. Model

2.1. Notation

2.2. Stochastic Matching Model with Reneging

2.3. Markov Representation

3. Results

3.1. Stability

3.2. Speed of Convergence to Equilibrium

3.3. Moments Bounds at Equilibrium

4. Simulations

4.1. Comparison with Priority Policies

4.2. Bounds in the Stationary Regime

5. Proofs

5.1. Proof of Proposition 1

5.2. Proof of Theorem 1

5.3. Proof of Proposition 2

5.4. Proof of Proposition 3

5.5. Proof of Theorem 3

6. Conclusion

References

Volume 13, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News