Optimal Server Assignment in Queues with Flexible Servers and Abandonments

Published Online:https://doi.org/10.1287/stsy.2023.0109

Abstract

We study a Markovian system of two stations in tandem and two flexible servers who are capable of working at both stations. The novelty of our work is that jobs waiting in line to be processed by the second station may leave the system (without being processed at the second station) after an exponential amount of time. We show that the dynamic server assignment policy that maximizes the long-run average throughput assigns one server to each station unless station 2 is starved (i.e., has no work) or the number of jobs in the buffer reaches a certain threshold (both servers work together at station 1 when station 2 is starved and at station 2 when the threshold is reached). We specify the criterion for the assignment of the servers to the stations and characterize the threshold. Finally, we investigate how the threshold depends on the service and abandonment rates.

1. Introduction

We consider a Markovian system of two tandem stations, a finite buffer of size B between the stations, and two servers. Suppose there is an infinite supply of raw materials in front of the first station, infinite storage space after the second station, and the system is operating under manufacturing blocking. The servers are flexible in that they are crosstrained and allowed to switch between stations with negligible travel time. We assume that each server can work on only one job at a time and that if both servers are assigned to the same station, they work together on one job. The jobs waiting in line to be processed by the second station (i.e., after being processed by the first station) may abandon the system (without receiving service at the second station) after an exponential amount of time with rate θ>0. The service rate of server i = 1, 2 at station j = 1, 2 is denoted by μij. When two servers work together on a job, their service rates are assumed to be additive. The service requirements for the jobs at the two stations are independently and exponentially distributed, and without loss of generality, we assume the mean service requirement of each job at each station is one. For example, such a queueing system can be used to model a manufacturing system producing time-sensitive goods (such as food items, pharmaceutical products, etc.) (as an example, see Aazami and Saidi-Mehrabad 2021).

For the queueing system described, our objective is to determine the dynamic server assignment policy that maximizes the long-run average throughput. The trade-off between assigning servers to station 1 versus station 2 is that one may either risk losing jobs that have been already processed at station 1 or not take advantage of the servers’ abilities at the station that they are most effective at. By dynamic assignment, we mean that servers are allocated to stations according to the state of the underlying system. We use Π to denote the set of all dynamic server assignment policies and Dπ(t) to denote the number of departures from the tandem line (after completing service at station 2) under policy πΠ by time t0. Define gπ=lim suptE[Dπ(t)]/t as the long-run average throughput under policy π. Our objective is to solve the optimization problem

maxπΠgπ(1)
and identify the optimal throughput g* and the optimal policy π* such that gπ*=g*. For simplicity, set Σj=i=12μij for j{1,2}. For any j, Σj=0 would imply the throughput of the tandem line will always be zero and any server assignment policy is optimal. Similarly, for each server i{1,2},j=12μij=0 would imply the server will always be idle, and we can simply exclude the server from the system, in which case one can easily show that the expedite policy (that assigns all servers to the same job until completion) would be optimal. To avoid trivial cases, we assume Σj>0 for all j = 1, 2 and j=12μij>0 for all i = 1, 2.

The rest of the paper is organized as follows. In Section 2, we discuss the relevant literature. Section 3 describes the Markov decision process problem that will be used to determine the optimal assignment of the servers. In Section 4, we present our optimal server allocation policy. Finally, Section 5 discusses the properties of the optimal policy. Proofs of some technical results are given in the appendix.

2. Literature Review

Tandem queues with flexible server(s) have drawn a lot of attention in the queueing theory literature. Many authors have considered the dynamic allocation of flexible server(s) to the stations in order to maximize various performance measures. In the interest of space, rather than presenting the entire body of literature in this area, we will mention only some sample papers. In one of the earliest papers in the area, Iravani et al. (1997) considered a two-station tandem queue with a single flexible server. Andradóttir et al. (2001) is the first paper that focused on throughput maximization in tandem queues where the number of servers is equal to the number of stations and the servers are heterogenous. Ahn et al. (2002) considered minimizing the holding cost in a two-station and two-server setting with homogeneous servers. In a more recent paper, Işik et al. (2016) focused on tandem lines with equal numbers of servers and stations where servers do not collaborate. van Leeuwen and Núñez-Queija (2017) investigated a Markovian tandem queueing model, in which service to the first queue is provided in batches, but rather than determining the assignment of servers, they focused on choosing the batch sizes so as to minimize a linear cost function of the mean queue lengths.

On the other hand, queueing systems with customer abandonments arise in many applications, such as manufacturing systems, call centers, hospital emergency rooms, etc. As a result, queues with abandonments have been of interest for a long time (see, for example, Barrer 1957, Ancker and Gafarian 1963, and Baccelli et al. 1984 for earlier works and Jouini and Dallery 2007, Boxma et al. 2011, Brandt and Brandt 2013, Das et al. 2013, Fralix 2013, and Moyal 2013 for more recent results). However, abandonment is a difficult phenomenon to analyze exactly. Because of the interest in call centers, many authors have considered asymptotic analysis of parallel queues with abandonments (see, for example, Whitt 2006 and Dai and He 2013) as well as the determination of asymptotically optimal staffing decisions in parallel queues with abandonments (see, for example, Dai and Tezcan 2008, Atar et al. 2010, Puha and Ward 2019, and Long et al. 2020).

There are a handful of papers that focus on obtaining exact optimal server assignment policies in parallel queues with abandonments. Koole and Pot (2011) consider an inbound call center with a fixed reward per call and communication and agent costs. In the presence of abandonments, they maximize the profit by controlling the number of lines and the number of agents. Down et al. (2011) consider the optimal dynamic assignment of a single server in a two-class service system with abandonments. They formulate the problem as a continuous-time Markov decision process to obtain partial results. Ansari et al. (2019) study a multiclass queueing system with a single server and customer abandonment. They show that the optimal scheduling policy of the server (to minimize the long-run average customer abandonment cost) is a static priority policy. They then derive conditions under which the asymptotically optimal policy of Atar et al. (2010) is optimal in this setting. On the other hand, Batt and Terwiesch (2015) provide an empirical study of parallel queues with impatient customers in the context of an emergency department.

However, the literature on the analysis and control of tandem queues with customer abandonments is scarce. Toward this end, Ayhan (2022) draws attention to this void in the literature, identifies a class of tandem queueing systems with customers abandonments for which exact analysis and control could be possible, and lists some of the open problems in this area that might be of interest to the queueing community. Armony et al. (2017) use fluid and diffusion approximations to determine the allocation of nurses in a tandem queue with abandonments that arises in a healthcare setting. Kim et al. (2013) derive the Laplace–Stieltjes transform of the sojourn time distribution in a tandem queue with Markovian arrivals and customer abandonments that can model a call center with interactive voice response. Reed and Yechiali (2013) study a two-station tandem queue with deadlines and retrials (i.e., when the deadline of a job expires, it is fed back to the end of the line rather than abandoning) and derive its stability condition. To the best of our knowledge, there are only four papers that provide exact analysis of tandem queues with flexible servers and abandonments. We review these next.

Wang et al. (2019) consider a Markovian queuing system of two stations in tandem where the first station has infinite buffer capacity and the second station has finite buffer capacity. Customers can get impatient and leave while waiting for service at the first or second station. Furthermore, they can also balk if there is no place available in the buffer of the second station when they complete service at the first station. The authors develop a recursive algorithm to compute the throughput of such systems and then, use their methodology to compute the system throughput under various staffing strategies. Zayas-Cabán et al. (2016) consider a similar Markovian system but with infinite buffer capacity at both stations. The objective is to determine the dynamic allocation of multiple homogenous servers at the two stations in order to maximize the discounted infinite-horizon expected reward and long-run average reward. The main novelty of their paper is to analyze the continuous-time problem directly because uniformization is not possible because of infinite buffer sizes and abandonments. They provide some sufficient conditions that guarantee that simple priority policies are optimal. In a follow-up paper, Zayas-Cabán et al. (2019) consider emergency departments with two stages where the patients can abandon without being treated in the second stage. They introduce threshold policies that prioritize the second stage unless the number of patients in the first stage exceeds the threshold and provide heuristics on how to choose the threshold. In a recent paper, Yu et al. (2023) consider a two-stage service system with two types of servers, namely subordinates who perform the first-stage service and supervisors who have their own responsibilities in addition to collaborating with the subordinates on the second-stage service. Rewards are earned when first- or second-stage service is completed and when supervisors finish one of their own responsibilities. Costs are incurred when impatient customers abandon without completing the second-stage service. They determine how the supervisors should distribute their time between their joint work with the subordinates and their own responsibilities to maximize the long-run average reward.

We now explain the positioning of this work relative to other papers (i.e., Zayas-Cabán et al. 2016, 2019; Wang et al. 2019; Yu et al. 2023) that also provide exact analysis of tandem systems with flexible servers and abandonments. In particular, unlike Zayas-Cabán et al. (2016, 2019) and Wang et al. (2019), we completely characterize the server allocation policy that maximizes the long-run average throughput for the two-station and two-server tandem network with abandonments described in Section 1. Furthermore, we allow the servers to be heterogenous in that their service rates not only depend on the station but also, depend on the server. Finally, unlike Yu et al. (2023), we consider the optimal allocation of all heterogenous servers, whereas Yu et al. (2023) focuses on dynamic assignment of one server type in a different setting.

3. Problem Formulation

For all πΠ and t0, let Xπ(t) denote the number of jobs that have been processed at station 1 but are either waiting to be processed or being processed by station 2 at time t, where Xπ(t)S={0,1,,B+2}. From now on, we assume that Π is the set of all Markovian stationary deterministic policies corresponding to the state space S. For any given πΠ,{Xπ(t):t0} is clearly a birth-death process with state space S. Because B< and the sum of all transition rates out of each state sS is bounded by i=12max1j2μij+(B+2)θ<+, {Xπ(t)} is uniformizable with a finite uniformization constant q. Thus, the continuous-time optimization Problem (1) can be translated into a discrete-time Markov decision problem by uniformization (i.e., modeling the state transitions at the event times of a Poisson process with rate q) (see, for example, Andradóttir et al. 2001). Note that q must satisfy i=12max1j2μij+(B+2)θq<, and the chosen value of q has no impact on our results.

We next provide a formal description of the discrete-time Markov decision process problem. As mentioned, the state space is S={0,1,,B+2}, and we define the action space A{aσ1σ2:σi{0,1,2},i=1,2}, where σi=j{1,2} means that server i is assigned to station j and σi=0 means server i is idle.

We next specify for all sS and aA, r(s, a) and p(j|s,a), which are the immediate reward and the probability of going to state j in one step when action a is chosen in state s when action a is chosen in state s, which is. We have for all aA,

r(0,a)=0
and for all s=1,,B+2,
r(s,a00)=r(s,a10)=r(s,a01)=r(s,a11)=0.

Finally, for all s=1,,B+2,

r(s,a20)=r(s,a21)=μ12,r(s,a02)=r(s,a12)=μ22, andr(s,a22)=Σ2.

Furthermore,

p(j|s,a00)={sθqfor s=1,,B+2 and j=s1,1sθqfor s=0,,B+2 and j=s,0otherwise;
p(j|s,a10)={μ11qfor s=0,,B+1 and j=s+1,sθqfor s=1,,B+2 and j=s1,1μ11+sθqfor s=0,,B+1 and j=s,1sθqfor s=B+2 and j=s,0otherwise;
p(j|s,a20)={μ12+(s1)θqfor s=1,,B+2 and j=s1,1μ12+(s1)θqfor s=1,,B+2 and j=s,1for s=0 and j=s,0otherwise;
p(j|s,a12)={μ11qfor s=0,,B+1,j=s+1,μ22+(s1)θqfor s=1,,B+2 and j=s1,1μ11+μ22+(s1)θqfor s=1,,B+1 and j=s,1μ22+(s1)θqfor s=B+2 and j=s,1μ11qfor s=0 and j=s,0otherwise;
p(j|s,a11)={Σ1qfor s=0,,B+1,j=s+1,sθqfor s=1,,B+2 and j=s1,1Σ1+sθqfor s=0,,B+1 and j=s,1sθqfor s=B+2 and j=s,0otherwise;
and
p(j|s,a22)={Σ2+(s1)θqfor s=1,,B+2,j=s1,1Σ2+(s1)θqfor s=1,,B+2 and j=s,1for s=0 and j=s,0otherwise,;
where in the interest of space, we omitted p(j|s,a01),p(j|s,a02), and p(j|s,a21) because they are similar to p(j|s,a10),p(j|s,a20), and p(j|s,a12), respectively, with the roles of servers 1 and 2 switched.

Note that because of the abandonments, the Markov decision process problem is unichain, and hence, the long-run average reward optimality equations for all s=0,,B+2 are given as

maxaA{r(s,a)g+j=0B+2p(j|s,a)h(j)h(s)}=0
where g is the long-run average throughput and h is the bias vector (see sections 8.2.1 and 8.4.1 in Puterman 1994). Furthermore, we know from theorem 8.4.5 in Puterman (1994) that there exists an optimal Markovian stationary policy π*. Note that a Markovian deterministic decision rule d:SA specifies which action d(s)A to choose in each state sS. Thus, a stationary policy π can be defined using the corresponding decision rule d and will be denoted as π=d.

4. Optimal Server Assignment Policy

In this section, we completely characterize the dynamic server assignment policy that maximizes the long-run average throughput. Without loss of generality, we number the servers such that μ11μ22μ21μ12. From our assumptions in Section 1, this means that μ11>0 and μ22>0. For n=1,,B+2, define the decision rule dn as follows:

dn(s)={a11for s=0,a12for s=1,,n1,a22for s=n,,B+2.

Note that (d1) is the expedite policy, whereas (dB+2) is the optimal policy when there are no abandonments (as shown in Andradóttir et al. 2001). On the other hand, (dn), for 1<n<B+2, is equivalent to the optimal policy of systems without abandonments with a buffer size of n – 2.

We next state the main result of the paper, which completely characterizes the optimal server assignment policy. Note that the uniqueness of the optimal policy is subject to the interpretation that two actions could be equivalent if the service rate of a server at a particular station is zero.

Theorem 4.1.

When the servers are numbered such that μ11μ22μ21μ12, the policy (dN) is optimal with throughput g*=gN where N and gn (for n=1,,B+2) are defined in (8) and (2), respectively. Furthermore, this is the unique optimal policy in the class of Markovian stationary deterministic policies when τ(N)>0, where τ(n) (for n=1,,B+2) is defined in (4) and (5).

We next define the terminology used in the statement of Theorem 4.1 and provide several results that will be used in its proof. One can easily compute the long-run average throughput gn under policy (dn), for n=1,,B+2, as

gn=β(n)Δ(n),(2)
where
β(n)=[Σ2+(n1)θ]Σ1μ22α(n)+Σ1Σ2μ11n1 andΔ(n)=[Σ2+(n1)θ][f(1,n)+Σ1α(n)]+Σ1μ11n1,
with
α(n)=k=2nμ11k2f(k,n),f(k,n)=j=kn1[μ22+(j1)θ] for k=1,,n,
and the convention that summation over an empty set is zero and the multiplication over an empty set is one. We then characterize
g1g0=τ(1)Σ1+Σ2,
and for n=2,,B+2,
gngn1=β(n)Δ(n1)β(n1)Δ(n)Δ(n)Δ(n1)=Σ1μ11n2τ(n)Δ(n)Δ(n1),(3)
where g0=0,
τ(1)=Σ1Σ2>0,(4)
and for n=2,,B+2,
τ(n)=[Σ2+(n1)θ][Σ2+(n2)θ]μ22f(1,n1)[Σ2+(n1)θ][Σ1μ12α(n)+Σ2f(1,n)]+[Σ2+(n2)θ]μ11[Σ1μ12α(n1)+Σ2f(1,n1)].(5)

Thus, the sign of τ(n) is the same as the sign of gngn1 for 1nB+2. The next proposition provides a useful property of the τ(·) function.

Proposition 4.1.

For n{2,,B+2}, if τ(n)0, then τ(k)0 for all 1kn1. Similarly, if τ(n)0, then τ(k)0 for all n+1kB+2.

Proof.

For n2, we have

τ(n+1)=[μ22+(n1)θ]τ(n)μ12θε(n1),(6)
where for n1,
ε(n)=[f(1,n)+Σ1α(n)][n(n1)θ2+(2n1)μ22θ+μ11μ12]+Σ1μ222α(n)+f(1,n)[μ22θ+μ22Σ2+μ11μ22]+Σ1μ11n1[nθ+μ11+μ22].(7)

The relationship in (6) with some algebra follows from the equalities

f(k,n+1)=[μ22+(n1)θ]f(k,n)=[μ22+(n1)θ][μ22+(n2)θ]f(k,n1) andα(n+1)=[μ22+(n1)θ]α(n)+μ11n1=[μ22+(n1)θ][μ22+(n2)θ]α(n1)+[μ22+(n1)θ]μ11n2+μ11n1,
with n2 and k=1,,n1.

Because ε(n)>0, we have τ(n)0 if τ(n+1)0. Note that if n = 1, the assertion still holds because τ(1)=Σ1Σ2>0. Similarly, if τ(n)0, then τ(n+1)0. 

We then define

N=max{n:τ(n)0 for n=1,,B+2}.(8)

Note that Proposition 4.1 implies that gNgN1g1 and gN>gN+1gB+2. It also follows from (6) that N=B+2 when μ12=0. This makes sense because when μ12=0,a12=a22.

Before proving the theorem, we will present several properties of the bias vector hN under policy (dN). Let PdN be the (B+2)×(B+2)-dimensional transition probability matrix corresponding to the decision rule dN, and let rdN be the (B+2)-dimensional reward vector corresponding to dN, with rdN(s) denoting the reward earned in state s under decision rule dN, for all sS (see Section 3). Then,

rdN(s)={0for s=0,μ22for s=1,,N1,Σ2for s=N,,B+2,
and
PdN(s,s)={Σ1qfor s=0,s=1,μ11qfor s=1,,N1,s=s+1μ22+(s1)θqfor s=1,,N1,s=s1,Σ2+(s1)θqfor s=N,,B+2,s=s1,1Σ1qfor s=0,s=s,1μ11+μ22+(s1)θqfor s=1,,N1,s=s,1Σ2+(s1)θqfor s=N,,B+2,s=s,0otherwise.

Because the Markov decision process problem is unichain, we employ the unichain policy iteration algorithm (see section 8.6.1 in Puterman 1994) and find a scalar gN and a vector hN solving

rdNgNe+(PdNI)hN=0(9)
with hN(0)=0. Note that e is a column vector of ones and I is the identity matrix. Furthermore, gN is the unique scalar that satisfies (9), whereas hN is unique and equal to the true bias vector plus a constant multiple of the vector e. In the next two lemmas, we provide some properties of the bias vector hN that will be useful in the proof of Theorem 4.1. The proofs of these lemmas are given in the appendix.

Lemma 4.1.

We have

hN(1)=gNqΣ1,(10)
and for s=2,,N,
hN(s)hN(s1)=q[i=0s2gNμ22μ11i+1j=si1s2(μ22+jθ)+gNΣ1μ11s1f(1,s)],(11)
with the convention that product over an empty set is one and for s=N,,B+2,
hN(s)hN(s1)=qΣ2+(s1)θ[Σ2gN].(12)

The next lemma provides closed-form expressions for the differences in (11) and (12). Note that because hN(0)=0, (10), (13), and (14) show that, for all s=1,,B+2,hN(s)hN(s1)0.

Lemma 4.2.

For s=2,,N,

hN(s)hN(s1)=qΔ(N)[μ11Nsμ12Σ1α(s)+[Σ2+(N1)θ]μ22k=s+1Nμ11ks1f(k,N)f(1,s)+μ11NsΣ2f(1,s)],(13)
with the convention that summation over an empty set is zero and for s=N,,B+2,
hN(s)hN(s1)=q[Σ2+(N1)θ]Σ1μ12α(N)+Σ2f(1,N)[Σ2+(s1)θ]Δ(N).(14)

Proof of Theorem 4.1.

We use policy iteration for unichain models (see section 8.6.1 in Puterman 1994) to prove the optimality of (dN). In particular, we set the initial decision rule ν0 of the policy iteration algorithm equal to dN and then, show that the decision rule ν1 computed as

ν1(s)argmaxaAs{r(s,a)+jSp(j|s,a)hN(j)},sS,
is equal to dN(s) for all sS. Toward this end, for all sS and aAs, we will compute the differences
Γ(s,a)=r(s,dN(s))+jSp(j|s,dN(s))hN(j)(r(s,a)+jSp(j|s,a)hN(j))
and show that the differences are nonnegative.

For s = 0, we have dN(s)=a11. Then, (10) yields

Γ(0,a12)=Σ1qhN(1)μ11qhN(1)=μ21qhN(1)=μ21Σ1gN0.

Note that Γ(0,a12)=0 if and only if μ21=0, in which case actions a11 and a12 are the same. Similarly, for a=a21,

Γ(0,a21)=Σ1qhN(1)μ21qhN(1)=μ11qhN(1)=μ11Σ1gN>0.

Finally, when a=a22,Γ(0,a22)=Σ1qhN(1)=gN>0. Because in state s = 0, a12=a10,a21=a01, and a22=a02=a20=a00, we have shown that for all aA{a11},Γ(0,a)0. In the interest of space, for the rest of the proof, we do not specify the differences for idling actions that are all nonnegative (in particular, the differences are strictly positive as long as the actions are not equivalent).

Next, we consider s=1,,N1 for which dN(s)=a12. For a=a22,

Γ(s,a22)=μ22Σ2+Σ2μ22q[hN(s)hN(s1)]+μ11q[hN(s+1)hN(s)]=μ22Σ2+Σ2μ22q[hN(s)hN(s1)]+gNμ22+μ22+(s1)θq[hN(s)hN(s1)]=gNΣ2+Σ2+(s1)θq[hN(s)hN(s1)],
where the second equality follows from (A.3). Now, (11) yields
Γ(s,a22)=gN(1+Σ2+(s1)θΣ1μ11s1f(1,s)+[Σ2+(s1)θ]i=0s21μ11i+1j=s1is2(μ22+jθ))(Σ2+μ22[Σ2+(s1)θ]i=0s21μ11i+1j=s1is2(μ22+jθ))=gN(Σ1μ11s1+[Σ2+(s1)θ][f(1,s)+Σ1α(s)]Σ1μ11s1)Σ2Σ1μ11s1+[Σ2+(s1)θ]Σ1μ22α(s)Σ1μ11s1gs+1(Σ1μ11s1+[Σ2+(s1)θ][f(1,s)+Σ1α(s)]Σ1μ11s1)Σ2Σ1μ11s1+[Σ2+(s1)θ]Σ1μ22α(s)Σ1μ11s1=β(s+1)Δ(s+1)×Σ1μ11s1+[Σ2+(s1)θ][f(1,s)+Σ1α(s)]Σ1μ11s1Σ2Σ1μ11s1+[Σ2+(s1)θ]Σ1μ22α(s)Σ1μ11s1×Δ(s+1)Δ(s+1)=β(s+1)Δ(s)β(s)Δ(s+1)Σ1μ11s1Δ(s+1)=Σ1μ11s1τ(s+1)Σ1μ11s1Δ(s+1)=τ(s+1)Δ(s+1)0,
where the second equality follows from a change of variables with some algebra, the first inequality follows from the definition of N and Proposition 4.1, the second to last equality follows from (3), and the second inequality follows the definition of N and Proposition 4.1. Note that it follows from (6) that τ(s+1)=0 if and only if τ(N)=0 and s=N1. We next consider a=a21 for which
Γ(s,a21)=μ22Σ2+Σ2μ22q[hN(s)hN(s1)]+μ11q[hN(s+1)hN(s)]+μ22+Σ1μ11q[hN(s)hN(s+1)]+μ22q[hN(s1)hN(s)]=Γ(s,a22)+Γ(s,a21),
where we denote the expression in the second line as Γ(s,a21). Because Γ(s,a22)0, it suffices to show that Γ(s,a21)0. From (A.3), we have
Γ(s,a21)=Σ1μ22+(μ11Σ1)gNμ11+Σ1μ22+(μ11Σ1)(s1)θqμ11[hN(s)hN(s1)]=Σ1μ22+(μ11Σ1)gNμ11+Σ1μ22+(μ11Σ1)(s1)θΔ(N)μ11[μ11Ns(Σ2μ22)Σ1α(s)+[Σ2+(N1)θ]μ22k=s+1Nμ11ks1f(k,N)f(1,s)+μ11NsΣ2f(1,s)],
where the second equality follows from (13). Plugging in the closed-form Expression (2) of gN and combining the terms with similar coefficients, with some algebra, we can write μ11Δ(N)Γ(s,a21) as the following sum:
(Σ1μ22+μ11Σ2Σ1Σ2)(Σ1μ11N1+(s1)θμ11NsΣ1α(s))+(15)
μ22[Σ2+(N1)θ]Σ1k=s+1Nμ11ks1[f(ks,N)f(k,N)f(1,s+1)]+(16)
μ22[Σ2+(N1)θ]μ11(s1)θk=s+1Nf(k,N)f(1,s)+(17)
μ11NsΣ1μ22[Σ2μ11k=2sμ11k2f(k+Ns,N)+Σ1(μ22Σ2)α(s)]+(18)
μ11Ns+1Σ1μ22[(N1)θk=2sμ11k2f(k+Ns,N)(s1)θα(s)]+(19)
μ11Nsf(1,s)θ[Σ2(μ11Σ1)(s1)+Σ1μ22(N1)]+(20)
μ22Σ1Σ2μ11N1[f(N+1s,N)f(1,s)].(21)

In order to prove that Γ(s,a21)0, we will show that the expressions in (15)–(21) are nonnegative. Note that

Σ1μ22+μ11Σ2Σ1Σ2=μ11μ22μ21μ120,(22)
which implies that (15) is nonnegative. Similarly, for s+1kN, we have
f(ks,N)f(1,s+1)f(k,N)=f(ks,k)f(k,N)f(1,s+1)f(k,N)0,
where the inequality follows from the fact f(ks,k)f(1,s+1). Thus, the expression in (16) is nonnegative. Clearly, (17) is nonnegative. Note that
Σ2μ11k=2sμ11k2f(k+Ns,N)+Σ1(μ22Σ2)α(s)Σ2μ11k=2sμ11k2f(k,s)+Σ1(μ22Σ2)α(s)=Σ2μ11α(s)+Σ1(μ22Σ2)α(s)=(Σ1μ22+μ11Σ2Σ1Σ2)α(s)0,
where the last inequality follows from (22). This shows that (18) is nonnegative. Similarly, the nonnegativity of (19) follows because k=2sμ11k2f(k+Ns,N)α(s) for sN1. Note that
Σ2(μ11Σ1)(s1)+Σ1μ22(N1)(s1)[Σ2(μ11Σ1)+Σ1μ22]0,(23)
where the last inequality follows from (22), which implies that (20) is nonnegative. Finally, f(N+1s,N)f(1,s)0 for s=1,,N1, and hence, (21) is also nonnegative. This shows that Γ(s,a21)>0. Note that we cannot have Γ(s,a21)=0 because this would require s = N in (23), which is not possible because we are considering the case s=1,,N1. Thus, Γ(s,a21)=Γ(s,a22)+Γ(s,a21)>0. We next consider Γ(s,a11) for s=1,,N1. We have
Γ(s,a11)=μ22+Σ1μ11q[hN(s)hN(s+1)]+μ22θq[hN(s1)hN(s))]=Γ(s,a12)+θq[hN(s)hN(s1)]>0,
where the inequality follows because Γ(s,a12)>0, and we have from Lemma 4.1 that θq[hN(s)hN(s1)]>0. Thus, we have for all aA and s=1,,N1,Γ(s,a12)0.

Next, we consider s=N,,B+2 for which dN(s)=a22. For a=a12,

Γ(s,a12)=μ12+μ12q[hN(s1)hN(s)]+μ11q[hN(s)hN(s+1)]=μ12+μ12Σ2+(s1)θ(gNΣ2)+μ11Σ2+sθ(gNΣ2),
where the second equality follows from (12). In particular, for s = N,
Γ(N,a12)=μ12+μ12Σ2+(N1)θ(gNΣ2)+μ11Σ2+Nθ(gNΣ2)=τ(N+1)(Σ2+Nθ)Δ(N)>0,
where the second equality follows because
gNΣ2=[Σ2+(N1)θ](Σ2f(1,N)+Σ1μ12α(N))Δ(N)(24)
and the inequality follows from the definition of N. Furthermore, for s=N,,B,
Γ(s+1,a12)Γ(s,a12)=(gNΣ2)[μ12(1Σ2+sθ1Σ2+(s1)θ)+μ11(1Σ2+(s+1)θ1Σ2+sθ)]>0,
where the inequality follows because gNΣ2<0 as shown in (24). Similarly,
Γ(B+2,a12)Γ(B+1,a12)=(gNΣ2)[μ12(1Σ2+(B+1)θ1Σ2+Bθ)μ11Σ2+(B+1)θ]>0.

Hence, Γ(s,a12)>0 for all s=N,,B+2. Next, we consider Γ(s,a21) for s=N,,B+2. We have for s=N,,B+1,

Γ(s,a21)=μ22+μ22q[hN(s1)hN(s)]+μ21q[hN(s)hN(s+1)]=μ22+μ22Σ2+(s1)θ(gNΣ2)+μ21Σ2+sθ(gNΣ2).

Thus,

Γ(N,a21)=μ22+μ22Σ2+(N1)θ(gNΣ2)+μ21Σ2+Nθ(gNΣ2)=(Σ2+Nθ)(N1)μ22θf(1,N)+Σ1μ22θ((Σ2+Nθ)(N2)α(N)+f(1,N))(Σ2+Nθ)Δ(N)++(Σ1μ22+μ11Σ2Σ1Σ2)[Σ2+(N1)θ]f(1,N)+Σ1α(N)(Σ2+Nθ)Δ(N)>0,
where the inequality follows from (22). Furthermore,
Γ(s+1,a21)Γ(s,a21)=(gNΣ2)[μ21(1Σ2+(s+1)θ1Σ2+sθ)+μ22(1Σ2+sθ1Σ2+(s1)θ)]>0,
where the inequality follows because gNΣ2<0. Note that
Γ(B+2,a21)=μ22+μ22q[hN(B+1)hN(B+2)]=μ22+μ22Σ2+(B+1)θ(gNΣ2)=μ22(gN+(B+1)θ)Σ2+(B+1)θ>0.

Thus, Γ(s,a21)>0 for all s=N,,B+2. Finally, for s=N,,B+2,

Γ(s,a11)=Σ2+Σ2q[hN(s1)hN(s)]+Σ1q[hN(s)hN(s+1)]+θq[hN(s)hN(s1)]=Γ(s,a12)+Γ(s,a21)+θq[hN(s)hN(s1)])>0.

This proves the optimality of (dN) (see theorem 8.6.6 in Puterman 1994). Because Γ(s,a)>0 for all s=0,,B+2, aA{dN(s)} when τ(N)>0, and a and dN(s) are not equivalent, the uniqueness of the optimal policy follows from proposition 8.5.10 in Puterman 1994. 

5. Properties of the Optimal Policy

Theorem 4.1 demonstrates that as in the optimal server assignment policy without abandonments (characterized in Andradóttir et al. 2001), the primary assignment of the servers to the stations is determined by maximizing the product of the service rates. Furthermore, as in Andradóttir et al. (2001), the servers work together at station 1 only when there is no work to be done at station 2. However, unlike the optimal policy without abandonments, servers work together at station 2 as long as there are N or more customers in the system where NB+2. Thus, the optimal policy does not necessarily use the entire buffer space. Hence, Theorem 4.1 suggests that there is a trade-off between using servers where they are effective (i.e., by letting N be large) and avoiding abandonments (i.e., by choosing N small). Theorem 4.1 also indicates that in the presence of abandonments, the buffer allocation should be done accordingly as one can incur unnecessary costs by assigning buffer space, which will not be needed. We now identify conditions under which n = 1 (so that the expedite policy is optimal).

Remark 5.1.

It is easy to see that n = 1 if and only if μ12>0 and θ>Σ2(μ11μ22μ21μ12)Σ1μ12. This follows because τ(1)>0, and from (5), we have

τ(2)=Σ2(μ11μ22μ21μ12)θΣ1μ12,(25)
which is negative when μ12>0 and θ>Σ2(μ11μ22μ21μ12)Σ1μ12. It then immediately follows that n = 1 when μ11μ22=μ21μ12 (because μ11μ22>0 under our assumptions).

It follows from Remark 5.1 and Theorem 4.1 that (d1) (i.e., the expedite policy) is the unique optimal policy when μ11μ22=μ21μ12 for any θ>0. Thus, when there are abandonments, the optimal policy utilizes no buffer space. This is different from the case when θ = 0. Indeed, we know from Andradóttir et al. (2001) that when there are no abandonments and μ11μ22=μ21μ12,(d1) is an optimal policy, but it is not unique. In fact, in this case, (dB+2) is also optimal. Continuing in this vein, the next corollary (which follows from Remark 5.1 and Theorem 4.1) shows that (d1) (i.e., the expedite policy) is the unique optimal policy when the servers are generalists (i.e., μij=μiγj for i = 1, 2 and j = 1, 2; observe that τ(1) is always positive).

Corollary 5.1.

Suppose that μij=μiγj for i = 1, 2 and j = 1, 2. Then, (d1) is the unique optimal policy.

Note that when the servers are generalists, they are equally effective at both stations. As a result, there is no trade-off between using the servers at stations that they are effective at versus avoiding abandonments. Thus, the optimal policy avoids abandonments completely and utilizes no buffer space.

The next proposition implies that for large-enough B, there exists N<B+2 as long as μ12>0.

Proposition 5.1.

We have for θ>0 and μ12>0,

limnτ(n)=.

Proof.

Note that combining similar terms, with some algebra the expression of τ(n) given in (5) can be written as

τ(n)=[Σ2+(n1)θ][μ12μ22f(1,n1)+μ22f(1,n)Σ1μ12α(n)Σ2f(1,n)]+[Σ2+(n2)θ]μ11[Σ1μ12α(n1)+Σ2f(1,n1)]=[Σ2+(n1)θ][μ12f(1,n1)(n2)θ+Σ1μ12α(n)]+[Σ2+(n2)θ]μ11[Σ1μ12α(n1)+Σ2f(1,n1)]=[Σ2+(n2)θ][Σ1μ12(k=2n1μ11k1[f(k,n1)f(k+1,n)]f(2,n))f(1,n1)((n2)θμ12μ11Σ2)]μ12θ[Σ1α(n)+(n2)θf(1,n1)],(26)
for n2. The result then follows because f(k,n1)<f(k+1,n) for k=2,,n1 and both (n2)θμ12 and f(2, n) tend to infinity as n gets large. 

Remark 5.2.

Note that N is nondecreasing in the buffer size B. This immediately follows from the fact that τ(n) does not depend on B (see Equations (4) and (5)). Furthermore, we have from Proposition 5.1 that if μ12>0, then N will remain the same after some finite B.

In the next proposition, we characterize how N changes with respect to the abandonment rate θ and the four service rates μ22,μ11,μ12, and μ21. In what follows, increasing means nondecreasing, and decreasing means nonincreasing.

Proposition 5.2.

We have

  1. N is decreasing in θ,

  2. N is increasing in μ22,

  3. N is either increasing or is first increasing and then decreasing in μ11,

  4. N is decreasing in μ12, and

  5. N is decreasing in μ21.

Proof.

First assume that μ12=0. Then, N=B+2, and (i)–(iii) and (v) hold trivially. Moreover, (iv) also holds because N cannot be larger than B + 2. For the rest of the proof, we assume that μ12>0. Furthermore, with a slight abuse of notation, in this proof, we will have gn, τ, and ε not only as functions of n but also, the corresponding rate considered in each case.

  1. We have from Remark 5.1 that n = 1 if μ11μ22=μ21μ12. Thus, the result holds for this case, and we assume that μ11μ22μ21μ12>0 for the rest of the proof. Note that it follows from (6) and (25) that, for n2,

    τ(n,0)=μ22n2τ(2,0)=μ22n2Σ2(μ11μ22μ21μ12)>0.(27)

    Furthermore, it follows from (6), (7), and (25) that limθτ(n,θ)= (because the largest power θn1 has a negative coefficient). Thus, for each n2,τ(n,θ) has a positive root. We will show that τ(n,θ) has a single positive root θ(n).

    We start by showing that gn is nonincreasing in θ for all n1. Note that g1 does not depend on θ and hence, is constant in θ. Now, assume that systems I and II are both operating under (dn) for n2 but that system I has a higher abandonment rate θ than system II. Consider sample paths of both systems starting in the same state and generated using (the same) independent Poisson processes with rates μ11+μ21,μ12+μ22,(n1)θ, and thinning. As time evolves, the number of customers in system II will be always greater than or equal to the number of customers in system I because of the higher abandonment rate of system I. In particular, when both systems are in the same state, every abandonment from (service completion at station 1 in) system II is associated with a simultaneous abandonment from (service completion at station 1 in) system I. Moreover, every service completion at station 2 in system I is accompanied by a simultaneous service completion at station 2 in system II. This shows that more customers complete service at station 2 in system II and that gn is nonincreasing in θ.

    Now, suppose that τ(n,θ) has two roots 0<θ1(n)<θ2(n). Then, τ(n,θ1(n))=τ(n,θ2(n))=0, which implies that gn(θ1(n))=gn1(θ1(n)) and gn(θ2(n))=gn1(θ2(n)). Again, consider two systems but now, with arbitrary abandonment rate θ: system I operating under (dn) and system II operating under (dn1). Assume that both systems start at an arbitrary state sn1, and consider their sample paths generated using (the same) independent Poisson processes with rates μ11+μ21,μ12+μ22,(n1)θ, and thinning. Then, both systems will evolve the same until state n – 1 is reached. If there is an abandonment in state n – 1, both systems experience the abandonment, but it is possible that in state n – 1, system I (but not system II) can transition to state n because of a service completion at station 1 or system II (but not necessarily system I) will transition to state n – 2 because of a service completion at station 2. If this happens, then from this point onward, the number of customers in system I will dominate the number of customers in system II. Because the number of customers in system I is always greater than or equal to the number of customers in system II, all abandonments in system II will be experienced in system I as well. However, when the two systems are not in the same state, there may be abandonments in system I that do not occur in system II. Some of these additional abandonments in system I may be replaced by additional service completions at station 1 in system I but not in system II (if both systems are in state n ‒ 1); others may not be replaced. Because all service completions at station 1 lead to either abandonments or service completions at station 2, this implies that as θ increases, gn(θ) decreases at least as rapidly as gn1(θ). So, if gn(θ1(n))=gn1(θ1(n)), then gn(θ2(n))gn1(θ2(n)). Furthermore, the inequality must be strict because otherwise, we would have gn(θ)=gn1(θ) for all θ1(n)θθ2(n), which would imply that τ(n,θ)=0 for all θ1(n)θθ2(n). This is not possible because τ(n,θ) can have at most n – 1 roots as it is a polynomial of order n – 1 in θ. Hence, there is a single θ(n)>0 such that τ(n,θ(n))=0.

    We then have from (6) and (7) that

    τ(n+1,θ(n))=[μ22+(n1)θ(n)]τ(n,θ(n))μ12θ(n)ε(n1,θ(n))<0,
    which together with (27), implies that θ(n+1)<θ(n). Thus,
    N={B+2if θθ(B+2),n{2,,B+1}if θ(n+1)<θθ(n),1for θ>θ(2),
    and the result follows.

  2. The proof is similar to the proof of part (i) and is given in the appendix.

  3. Note that μ11μ21μ12μ22, and it immediately follows from (6) and (7) that if for some μ11μ21μ12μ22,τ(n,μ11)0, and then, τ(n+1,μ11)<0. From (25), observe that

    τ(2,μ11)=μ11(Σ2μ22θμ12)μ21μ12(Σ2+θ).(28)

    Then, n = 1 if Σ2μ22θμ12<0, n = 1 if μ21>0 and Σ2μ22θμ12=0, and n = 2 if μ21=0 and Σ2μ22θμ12=0. Thus, the result holds when Σ2μ22θμ120.

    Now, suppose that Σ2μ22θμ12>0. It follows from (6), (7), and (25) that τ(n,μ21μ12μ22)<0 for all n2 because when μ11=μ21μ12μ22, we have τ(2,μ21μ12μ22)=θΣ1μ12<0 (because μ12>0). Furthermore, limμ11τ(2,μ11)= (from (28)) and limμ11τ(n,μ11)= for all n3 (because (6), (7), and (28) imply that the highest-power (μ11)n1 has a negative coefficient). It follows from (25) and (28) that τ(2,μ11) has a single root μ11(2) greater than μ21μ12μ22. Finally, one can immediately see from (26) that for n3,2τ(n,μ11)(μ11)2<0, and hence, τ(n,μ11) is a concave function of μ11. We can then conclude that for n3,τ(n,μ11) has two roots, a single root, or no roots. Note that it follows from (6) and (7) that if τ(n,μ11) has a single root or no roots, then τ(n+1,μ11) has no roots. First, assume that there exists 3nB+2 such that τ(n,μ11) has two roots, and let n* be the largest such n. Then, τ(n,μ11) for 3n<n* must all have two roots. We consider two cases.

    1. n*B+1, and τ(n*+1,μ11) has a single root. Let μ111(n) and μ112(n) be the two roots of τ(n,μ11) for 3nn* and μ11(n*+1) be the root of τ(n*+1,μ11). Then, these roots should satisfy μ11(2)<μ111(3)<μ111(4)<<μ111(n*)<μ11(n*+1)<μ112(n*)<μ112(n*1)<<μ112(3) because from (6) and (7), we have τ(n,μ11)0, which implies that τ(n+1,μ11)<0 and τ(n,μ11) is concave in μ11. We will have n = 1 if μ11<μ11(2) because τ(n,μ11)<0 for all n2; n = 2 if μ11(2)μ11<μ111(3) because τ(n,μ11)<0 for all n3; n = 3 if μ111(3)μ11<μ111(4) because τ(n,μ11)<0 for all n4; ; N=n* if μ111(n*)μ11<μ11(n*+1) because τ(n,μ11)<0 for all nn*+1; N=n*+1 if μ11=μ11(n*+1) because τ(n,μ11)<0 for all nn*+1; N=n* if μ11(n*+1)<μ11μ112(n*) because τ(n,μ11)<0 for all nn*+1; ; n = 3 if μ112(4)<μ11μ112(3) because τ(n,μ11)<0 for all n4; and n = 2 if μ11>μ112(3) because τ(n,μ11)<0 for all n3. Thus, in this case, N is first nondecreasing and then nonincreasing in μ11.

    2. n*=B+2, or τ(n*+1,μ11) has no roots. Using the argument in (a), one can conclude that N will be first nondecreasing and then nonincreasing in μ11 but in this case, will never attain n*+1.

      On the other hand, if there is no 3nB+2 such that τ(n,μ11) has two roots, we also consider two cases.

    3. τ(3,μ11) has one root. Then, we will have μ11(2)<μ11(3), and n = 1 if μ11<μ11(2); n = 2 if μ11(2)μ11<μ11(3); n = 3 if μ11=μ11(3); and n = 2 if μ11>μ11(3). Thus, N is again first nondecreasing and then nonincreasing in μ11.

    4. τ(3,n) has no roots. Then, n = 1 if μ11<μ11(2) and n = 2 if μ11>μ11(2). Hence, in this case, N is nondecreasing in μ11.

    This concludes the proof that when Σ2μ22θμ12>0, N is either nondecreasing or is first nondecreasing and then nonincreasing in μ11.

  4. Proofs are given in the appendix. 

As expected, Proposition 5.2(i) shows that as the abandonment rate θ increases, the optimal policy utilizes smaller buffer space in order to avoid abandonments. On the other hand, we have from Proposition 5.2(ii) that as the service rate μ22 of the server at the second station increases, the optimal policy uses a larger buffer space and takes advantage of the effectiveness of the server at station 2. Moreover, Proposition 5.2(iii) indicates that when μ11 is sufficiently large, the jobs accumulate quickly in the buffer, and the optimal policy uses a smaller buffer capacity as the service rate μ11 at the first station increases. However, for small μ11, it is possible that the optimal policy may utilize the specialty of the servers and that the optimal (utilized) buffer capacity may increase as μ11 increases. As an example, consider a system with μ12=1,μ21=1,μ22=8, and θ = 4. It is easy to verify that when μ11=3, n = 4; when μ11=4, n = 5; and when μ11=30, n = 4. Thus, N first increases and then decreases as μ11 increases. Proposition 5.2(iv) shows that the optimal (utilized) buffer capacity decreases as the service rate μ12 of the first server at the second station increases (provided that it is less than or equal to μ11μ22μ21). Similarly, Proposition 5.2(v) indicates that if the service rate μ21 of the second server at the first station increases (provided that it is less than or equal to μ11μ22μ12), then the optimal policy uses a smaller buffer space. These results are reasonable because as the service rates μ12, μ21 increase, the servers become less specialized, and hence, the optimal policy shifts emphasis from utilizing server specialty toward reducing abandonments.

Appendix

Proof of Lemma 4.1.

In component notation, the equations in (9) can be written as

gNΣ1qhN(1)=0,(A.1)
gN+μ11+μ22qhN(1)μ11qhN(2)=μ22,(A.2)
gNμ22+(s1)θqhN(s1)+μ11+μ22+(s1)θqhN(s)μ11qhN(s+1)=μ22,(A.3)
for s=2,,N1,
gNΣ2+(s1)θqhN(s1)+Σ2+(s1)θqhN(s)=Σ2,(A.4)
for s=N,,B+2.

The equality in (10) follows immediately from (A.1). We use induction to prove the equality in (11). Note that for s = 2, the result follows immediately from (A.2) and (10). Now, assume that (11) holds for k=2,,s, and we will show that it holds for k=s+1. We have from (A.3) that

hN(s+1)hN(s)=qμ11[gNμ22+μ22+(s1)θq[hN(s)hN(s1)]]=qμ11[gNμ22+1μ11[i=0s2gNμ22μ11ij=si1s1(μ22+jθ)+(μ22+(s1)θ)gNΣ1μ11s2f(1,s)]]=q[i=0s1gNμ22μ11i+1j=sis1(μ22+jθ)+gNΣ1μ11sf(1,s+1)],
where the second equality follows from the induction hypothesis and the third equality follows from the fact that f(1,s+1)=f(1,s)(μ22+(s1)θ). Thus, the result (11) holds. The equality in (12) follows immediately from (A.4). 

Proof of Lemma 4.2.

In order to prove (13), we plug in Expression (2) for gN in (11). Note that with some simple algebra, we have

gNμ22=[Σ2+(N1)θ]μ22f(1,N)+μ11N1Σ1μ12Δ(N)
and
gNΣ1=[Σ2+(N1)θ]μ22k=2Nμ11k2f(k,N)+μ11N1Σ2Δ(N).

Then,

i=0s2gNμ22μ11i+1j=si1s2(μ22+jθ)=μ11Nsμ12Σ1Δ(N)α(s)[Σ2+(N1)θ]μ22f(1,N)Δ(N)i=0s21μ11i+1j=si1s2(μ22+jθ),(A.5)
where we used a change of variables to obtain the first part of the equality. Furthermore,
gNΣ1μ11s1f(1,s)=[Σ2+(N1)θ]μ22Δ(N)k=2sμ11k1sf(k,N)f(1,s)+[Σ2+(N1)θ]μ22k=s+1Nμ11k2f(k,N)+μ11N1Σ2Δ(N)μ11s1f(1,s)=[Σ2+(N1)θ]μ22Δ(N)k=0s21μ11k+1f(sk,N)f(1,s)+[Σ2+(N1)θ]μ22k=s+1Nμ11k2f(k,N)+μ11N1Σ2Δ(N)μ11s1f(1,s)=[Σ2+(N1)θ]μ22f(1,N)Δ(N)k=0s21μ11k+1j=sk1s2(μ22+jθ)(A.6)
+[Σ2+(N1)θ]μ22k=s+1Nμ11ks1f(k,N)+μ11NsΣ2Δ(N)f(1,s),(A.7)
where the second equality follows from a change of variables and the third equality follows from the fact that f(sk,N)f(1,s)=f(1,N)j=sk1s2(μ22+jθ) for k=0,,s2. Summing (A.5), (A.6), and (A.6) and using (11) yields (13). Similarly, with some simple algebra, (2) yields
Σ2gN=[Σ2+(N1)θ](Σ2f(1,N)+Σ1μ12α(N))Δ(N),
and (14) now follows directly from (12). 

Proof of Proposition 5.2(ii).

Note that μ22μ21μ12μ11. It then follows from (6), (7), and (25) that τ(n,μ21μ12μ11)<0 for all n2 because when μ22=μ21μ12μ11, we have τ(2,μ21μ12μ11)=θΣ1μ12. Furthermore, it also follows from (6), (7), and (25) that limμ22τ(n,μ22)= (because the largest power μ22n has a positive coefficient). We will show that for each n2,τ(n,μ22) has a single root μ22(n) greater than μ21μ12μ11.

We start by showing that gn is nondecreasing in μ22 for all n1. Assume that systems I and II are both operating under (dn) but that system I has a higher service rate μ22 than system II. Consider sample paths of both systems starting in the same state and generated using (the same) independent Poisson processes with rates μ11+μ21,μ12+μ22,(n1)θ, and thinning. As time evolves, the number of customers in system II will be always greater than or equal to the number of customers in system I because of the higher service rate of system I. In particular, when both systems are in the same state, every service completion at station 2 in system II is associated with a service completion at station 2 in system I. Moreover, every service completion at station 1 in system II is experienced in system I, and every abandonment from system I is associated with an abandonment from system II. Because system I has more service completions at station 1 and fewer abandonments than system II, it must have more service completions at station 2, and gn is nondecreasing in μ22.

Now, suppose that τ(n,μ22) has two roots μ21μ12μ11<μ221(n)<μ222(n). Then, τ(n,μ221(n))=τ(n,μ222(n))=0, which implies that gn(μ221(n))=gn1(μ221(n)) and gn(μ222(n))=gn1(μ222(n)). Again, consider two systems but now, with arbitrary service rate μ22.: system I operating under (dn) and system II operating under (dn1). Assume that both systems start at an arbitrary state sn1, and consider their sample paths generated using (the same) independent Poisson processes with rates μ11+μ21,μ12+μ22,(n1)θ, and thinning. As in part (i), the number of customers in system I will always be greater than or equal to the number of customers in system II. Thus, every service completion by server 2 at station 2 in system II is associated with a service completion by server 2 at station 2 in system I. This implies that as μ22 increases, gn(μ22) increases at least as rapidly as gn1(μ22). So, if gn(μ221(n))=gn1(μ221(n)), then gn(μ222(n))gn1(μ222(n)). Furthermore, the inequality must be strict because otherwise, we would have gn(μ22)=gn1(μ22) for all μ221(n)μ22μ222(n), which would imply that τ(n,μ22)=0 for all μ221(n)μ22μ222(n). This is not possible because τ(n,μ22) can have at most n roots as it is a polynomial of order n in μ22. Hence, there is a single μ22(n)>μ21μ12μ11 such that τ(n,μ22(n))=0.

We then have from (6) and (7) that

τ(n+1,μ22(n))=[μ22(n)+(n1)θ]τ(n,μ22(n))μ12θε(n1,μ22(n))<0,
which implies that μ22(n+1)>μ22(n). Thus,
N={B+2if μ22μ22(B+2),n{2,,B+1}if μ22(n)μ22<μ22(n+1),1for μ22<μ22(2),
and the result follows. 

Proof of Proposition 5.2(iv).

Note that it follows from (6) and (25) that for all 2nB+2,

τ(n,0)=f(2,n)τ(2,0)=f(2,n)Σ2μ11μ22>0.

First, assume μ21>0; then, we have 0<μ12μ11μ22μ21, and it follows from (6), (7), and (25) that for n2,τ(n,μ11μ22μ21)<0. Then, we can conclude that τ(n,μ12) has a single root 0<μ12(n)μ11μ22μ21 because we know from (6), (7), and (25) that τ(n,μ12) is a quadratic function of μ12 for n2. We then have from (6) that

τ(n+1,μ12(n))=[μ22+(n1)θ]τ(n,μ12(n))μ12(n)θε(n1)<0,
which implies that μ12(n+1)<μ12(n). Thus,
N={B+2if μ12μ12(B+2),n{2,,B+2}if μ12(n+1)<μ12μ12(n),1for μ12>μ12(2).

On the other hand, if μ21=0, then μ12(0,), but we have from (6), (7), and (25) that for n2,limμ12τ(n,μ12)= (as τ(2,μ12) is a linear function of μ12, τ(n,μ12) is a quadratic function of μ12 for n3, and all powers of μ12 have negative coefficients in both cases); the result follows similarly. 

Proof of Proposition 5.2(v).

It is easy to see from (5) that for all n2,τ(n,μ21) is a linear function of μ21 with coefficient

μ12([Σ2+(n2)θ]μ11α(n1)[Σ2+(n1)θ]α(n))<0,
where the inequality follows because (n2)θ<(n1)θ and μ11α(n1)=k=3nμ11k2f(k1,n1)<k=2nμ11k2f(k,n)=α(n). Hence, τ(n,μ21) is nonincreasing in μ21, and the result follows from (6) and (7). 

References

  • Aazami A, Saidi-Mehrabad M (2021) A production and distribution planning of perishable products with a fixed lifetime under vertical competition in the seller-buyer systems: A real-world application. J. Manufacturing Systems 58(Part A):223–247.Google Scholar
  • Ahn HS, Duenyas I, Lewis ME (2002) Optimal control of a two-stage tandem queuing system with flexible servers. Probab. Engrg. Inform. Sci. 16:453–469.Google Scholar
  • Ancker CJ, Gafarian AV (1963) Some queuing problems with balking and reneging. Oper. Res. 11(1):88–100.Google Scholar
  • Andradóttir S, Ayhan H, Down D (2001) Server assignment policies for maximizing the steady-state throughput of finite queueing systems. Management Sci. 47(10):1421–1439.Google Scholar
  • Armony M, Chan CW, Zhu B (2017) Critical care capacity management: Understanding the role of a step down unit. Production Oper. Management 27:859–883.Google Scholar
  • Arsani S, Debo L, Iravani S (2019) Optimal policy in single-server multi-class queuing systems with abandonment. Preprint, submitted June 1, http://dx.doi.org/10.2139/ssrn.3453227.Google Scholar
  • Atar R, Giat C, Shimkin N (2010) The cμ/θ rule for many-server queues with abandonment. Oper. Res. 58(5):1427–1439.Google Scholar
  • Ayhan H (2022) Server assignment policies in queues with customer abandonments. Queueing Systems 100:393–395.Google Scholar
  • Baccelli F, Boyer F, Hebuterne G (1984) Single server queues with impatient customers. Adv. Appl. Probab. 16:887–905.Google Scholar
  • Barrer DY (1957) Queuing with impatient customers and ordered service. Oper. Res. 5(5):650–656.LinkGoogle Scholar
  • Batt RJ, Terwiesch C (2015) Waiting patiently: An empirical study of queue abandonment in an emergency department. Management Sci. 61(1):39–59.Google Scholar
  • Boxma O, Perry D, Stadje W (2011) The M/G/1+G queue revisited. Queueing Systems 67:207–220.Google Scholar
  • Brandt A, Brandt M (2013) Workload and busy period for M/GI/1 with a general impatience mechanism. Queueing Systems 75:189–209.Google Scholar
  • Dai JG, He S (2013) Many-server queues with customer abandonment: Numerical analysis of their diffusion models. Stochastic Systems 3(1):96–146.Google Scholar
  • Dai JG, Tezcan T (2008) Optimal control of parallel server systems with many servers in heavy traffic. Queueing Systems 59:95–134.Google Scholar
  • Das S, Jenkins L, Sengupta D (2013) Analysis of an M/M/1+G queue operated under the FCFS policy with exact admission control. Queueing Systems 75:169–178.Google Scholar
  • Down DG, Koole G, Lewis ME (2011) Dynamic control of a single-server system with abandonments. Queueing Systems 67:63–90.Google Scholar
  • Fralix BH (2013) On the time-dependent moments of Markovian queues with reneging. Queueing Systems 75:149–168.Google Scholar
  • Iravani S, Posner MJM, Buzacott JA (1997) A two-stage tandem queue attended by a moving server with holding and switching costs. Queueing Systems 26:203–228.Google Scholar
  • Işik T, Andradóttir S, Ayhan H (2016) Optimal control of queueing systems with non-collaborating servers. Queueing Systems 84:79–110.Google Scholar
  • Jouini O, Dallery Y (2007) Monotonicity properties for multiserver queues with reneging and finite waiting times. Probab. Engrg. Inform. Sci. 21:335–360.Google Scholar
  • Kim C, Dudin A, Dudin S, Dudina O (2013) Tandem queueing system with impatient customers as a model of call center with interactive voice response. Performance Evaluation 70:440–453.Google Scholar
  • Koole G, Pot A (2011) Technical Note—A note on profit maximization and monotonicity for inbound call centers. Oper. Res. 59(5):1304–1308.LinkGoogle Scholar
  • Long Z, Shimkin N, Zhang H, Zhang J (2020) Dynamic scheduling of multiclass many-server queues with abandonment: The generalized cμ/h rule. Oper. Res. 68(4):1218–1230.LinkGoogle Scholar
  • Moyal P (2013) On queues with impatience: Stability, and the optimality of earliest deadline first. Queueing Systems 75:211–242.Google Scholar
  • Puha AL, Ward AR (2019) Scheduling an overloaded multiclass many-server queue with impatient customers. Netessine S, ed. INFORMS TutORials in Operations Research: Operations Research & Management Science in the Age of Analytics (INFORMS, Catonsville, MD), 189–217.Google Scholar
  • Puterman ML (1994) Markov Decision Processes (John Wiley & Sons, New York).Google Scholar
  • Reed J, Yechiali U (2013) Queues in tandem with customer deadlines and retrials. Queueing Systems 73:1–34.Google Scholar
  • van Leeuwen D, Núñez-Queija R (2017) Optimal dispatching in a tandem queue. Queueing Systems 87:269–281.Google Scholar
  • Wang J, Abouee Mehrizi H, Baron O, Berman O (2019) Tandem queues with impatient customers. Performance Evaluation 135(C):102011.Google Scholar
  • Whitt W (2006) Fluid models for multiserver queues with abandonments. Oper. Res. 54(1):37–54.LinkGoogle Scholar
  • Yu Z, Andradóttir S, Ayhan H (2023) Optimal control of supervisors balancing individual and joint responsibilities. Probab. Engrg. Inform. Sci., ePub ahead of print January 20, https://doi.org/10.1017/S0269964823000013.Google Scholar
  • Zayas-Cabán G, Xie J, Green LV, Lewis ME (2016) Dynamic control of a tandem system with abandonments. Queueing Systems 84:279–293.Google Scholar
  • Zayas-Cabán G, Xie J, Green LV, Lewis ME (2019) Policies for physician allocation to triage and treatment in emergency departments. IISE Trans. Healthcare Systems Engrg. 9:342–356.Google Scholar