Optimal Server Assignment in Queues with Flexible Servers and Abandonments
Abstract
We study a Markovian system of two stations in tandem and two flexible servers who are capable of working at both stations. The novelty of our work is that jobs waiting in line to be processed by the second station may leave the system (without being processed at the second station) after an exponential amount of time. We show that the dynamic server assignment policy that maximizes the long-run average throughput assigns one server to each station unless station 2 is starved (i.e., has no work) or the number of jobs in the buffer reaches a certain threshold (both servers work together at station 1 when station 2 is starved and at station 2 when the threshold is reached). We specify the criterion for the assignment of the servers to the stations and characterize the threshold. Finally, we investigate how the threshold depends on the service and abandonment rates.
1. Introduction
We consider a Markovian system of two tandem stations, a finite buffer of size B between the stations, and two servers. Suppose there is an infinite supply of raw materials in front of the first station, infinite storage space after the second station, and the system is operating under manufacturing blocking. The servers are flexible in that they are crosstrained and allowed to switch between stations with negligible travel time. We assume that each server can work on only one job at a time and that if both servers are assigned to the same station, they work together on one job. The jobs waiting in line to be processed by the second station (i.e., after being processed by the first station) may abandon the system (without receiving service at the second station) after an exponential amount of time with rate . The service rate of server i = 1, 2 at station j = 1, 2 is denoted by μij. When two servers work together on a job, their service rates are assumed to be additive. The service requirements for the jobs at the two stations are independently and exponentially distributed, and without loss of generality, we assume the mean service requirement of each job at each station is one. For example, such a queueing system can be used to model a manufacturing system producing time-sensitive goods (such as food items, pharmaceutical products, etc.) (as an example, see Aazami and Saidi-Mehrabad 2021).
For the queueing system described, our objective is to determine the dynamic server assignment policy that maximizes the long-run average throughput. The trade-off between assigning servers to station 1 versus station 2 is that one may either risk losing jobs that have been already processed at station 1 or not take advantage of the servers’ abilities at the station that they are most effective at. By dynamic assignment, we mean that servers are allocated to stations according to the state of the underlying system. We use Π to denote the set of all dynamic server assignment policies and to denote the number of departures from the tandem line (after completing service at station 2) under policy by time . Define as the long-run average throughput under policy π. Our objective is to solve the optimization problem
The rest of the paper is organized as follows. In Section 2, we discuss the relevant literature. Section 3 describes the Markov decision process problem that will be used to determine the optimal assignment of the servers. In Section 4, we present our optimal server allocation policy. Finally, Section 5 discusses the properties of the optimal policy. Proofs of some technical results are given in the appendix.
2. Literature Review
Tandem queues with flexible server(s) have drawn a lot of attention in the queueing theory literature. Many authors have considered the dynamic allocation of flexible server(s) to the stations in order to maximize various performance measures. In the interest of space, rather than presenting the entire body of literature in this area, we will mention only some sample papers. In one of the earliest papers in the area, Iravani et al. (1997) considered a two-station tandem queue with a single flexible server. Andradóttir et al. (2001) is the first paper that focused on throughput maximization in tandem queues where the number of servers is equal to the number of stations and the servers are heterogenous. Ahn et al. (2002) considered minimizing the holding cost in a two-station and two-server setting with homogeneous servers. In a more recent paper, Işik et al. (2016) focused on tandem lines with equal numbers of servers and stations where servers do not collaborate. van Leeuwen and Núñez-Queija (2017) investigated a Markovian tandem queueing model, in which service to the first queue is provided in batches, but rather than determining the assignment of servers, they focused on choosing the batch sizes so as to minimize a linear cost function of the mean queue lengths.
On the other hand, queueing systems with customer abandonments arise in many applications, such as manufacturing systems, call centers, hospital emergency rooms, etc. As a result, queues with abandonments have been of interest for a long time (see, for example, Barrer 1957, Ancker and Gafarian 1963, and Baccelli et al. 1984 for earlier works and Jouini and Dallery 2007, Boxma et al. 2011, Brandt and Brandt 2013, Das et al. 2013, Fralix 2013, and Moyal 2013 for more recent results). However, abandonment is a difficult phenomenon to analyze exactly. Because of the interest in call centers, many authors have considered asymptotic analysis of parallel queues with abandonments (see, for example, Whitt 2006 and Dai and He 2013) as well as the determination of asymptotically optimal staffing decisions in parallel queues with abandonments (see, for example, Dai and Tezcan 2008, Atar et al. 2010, Puha and Ward 2019, and Long et al. 2020).
There are a handful of papers that focus on obtaining exact optimal server assignment policies in parallel queues with abandonments. Koole and Pot (2011) consider an inbound call center with a fixed reward per call and communication and agent costs. In the presence of abandonments, they maximize the profit by controlling the number of lines and the number of agents. Down et al. (2011) consider the optimal dynamic assignment of a single server in a two-class service system with abandonments. They formulate the problem as a continuous-time Markov decision process to obtain partial results. Ansari et al. (2019) study a multiclass queueing system with a single server and customer abandonment. They show that the optimal scheduling policy of the server (to minimize the long-run average customer abandonment cost) is a static priority policy. They then derive conditions under which the asymptotically optimal policy of Atar et al. (2010) is optimal in this setting. On the other hand, Batt and Terwiesch (2015) provide an empirical study of parallel queues with impatient customers in the context of an emergency department.
However, the literature on the analysis and control of tandem queues with customer abandonments is scarce. Toward this end, Ayhan (2022) draws attention to this void in the literature, identifies a class of tandem queueing systems with customers abandonments for which exact analysis and control could be possible, and lists some of the open problems in this area that might be of interest to the queueing community. Armony et al. (2017) use fluid and diffusion approximations to determine the allocation of nurses in a tandem queue with abandonments that arises in a healthcare setting. Kim et al. (2013) derive the Laplace–Stieltjes transform of the sojourn time distribution in a tandem queue with Markovian arrivals and customer abandonments that can model a call center with interactive voice response. Reed and Yechiali (2013) study a two-station tandem queue with deadlines and retrials (i.e., when the deadline of a job expires, it is fed back to the end of the line rather than abandoning) and derive its stability condition. To the best of our knowledge, there are only four papers that provide exact analysis of tandem queues with flexible servers and abandonments. We review these next.
Wang et al. (2019) consider a Markovian queuing system of two stations in tandem where the first station has infinite buffer capacity and the second station has finite buffer capacity. Customers can get impatient and leave while waiting for service at the first or second station. Furthermore, they can also balk if there is no place available in the buffer of the second station when they complete service at the first station. The authors develop a recursive algorithm to compute the throughput of such systems and then, use their methodology to compute the system throughput under various staffing strategies. Zayas-Cabán et al. (2016) consider a similar Markovian system but with infinite buffer capacity at both stations. The objective is to determine the dynamic allocation of multiple homogenous servers at the two stations in order to maximize the discounted infinite-horizon expected reward and long-run average reward. The main novelty of their paper is to analyze the continuous-time problem directly because uniformization is not possible because of infinite buffer sizes and abandonments. They provide some sufficient conditions that guarantee that simple priority policies are optimal. In a follow-up paper, Zayas-Cabán et al. (2019) consider emergency departments with two stages where the patients can abandon without being treated in the second stage. They introduce threshold policies that prioritize the second stage unless the number of patients in the first stage exceeds the threshold and provide heuristics on how to choose the threshold. In a recent paper, Yu et al. (2023) consider a two-stage service system with two types of servers, namely subordinates who perform the first-stage service and supervisors who have their own responsibilities in addition to collaborating with the subordinates on the second-stage service. Rewards are earned when first- or second-stage service is completed and when supervisors finish one of their own responsibilities. Costs are incurred when impatient customers abandon without completing the second-stage service. They determine how the supervisors should distribute their time between their joint work with the subordinates and their own responsibilities to maximize the long-run average reward.
We now explain the positioning of this work relative to other papers (i.e., Zayas-Cabán et al. 2016, 2019; Wang et al. 2019; Yu et al. 2023) that also provide exact analysis of tandem systems with flexible servers and abandonments. In particular, unlike Zayas-Cabán et al. (2016, 2019) and Wang et al. (2019), we completely characterize the server allocation policy that maximizes the long-run average throughput for the two-station and two-server tandem network with abandonments described in Section 1. Furthermore, we allow the servers to be heterogenous in that their service rates not only depend on the station but also, depend on the server. Finally, unlike Yu et al. (2023), we consider the optimal allocation of all heterogenous servers, whereas Yu et al. (2023) focuses on dynamic assignment of one server type in a different setting.
3. Problem Formulation
For all and , let denote the number of jobs that have been processed at station 1 but are either waiting to be processed or being processed by station 2 at time t, where . From now on, we assume that Π is the set of all Markovian stationary deterministic policies corresponding to the state space S. For any given is clearly a birth-death process with state space S. Because and the sum of all transition rates out of each state is bounded by , is uniformizable with a finite uniformization constant q. Thus, the continuous-time optimization Problem (1) can be translated into a discrete-time Markov decision problem by uniformization (i.e., modeling the state transitions at the event times of a Poisson process with rate q) (see, for example, Andradóttir et al. 2001). Note that q must satisfy , and the chosen value of q has no impact on our results.
We next provide a formal description of the discrete-time Markov decision process problem. As mentioned, the state space is , and we define the action space , where means that server i is assigned to station j and means server i is idle.
We next specify for all and , r(s, a) and , which are the immediate reward and the probability of going to state j in one step when action a is chosen in state s when action a is chosen in state s, which is. We have for all ,
Finally, for all ,
Furthermore,
Note that because of the abandonments, the Markov decision process problem is unichain, and hence, the long-run average reward optimality equations for all are given as
4. Optimal Server Assignment Policy
In this section, we completely characterize the dynamic server assignment policy that maximizes the long-run average throughput. Without loss of generality, we number the servers such that . From our assumptions in Section 1, this means that and . For , define the decision rule dn as follows:
Note that is the expedite policy, whereas is the optimal policy when there are no abandonments (as shown in Andradóttir et al. 2001). On the other hand, , for , is equivalent to the optimal policy of systems without abandonments with a buffer size of n – 2.
We next state the main result of the paper, which completely characterizes the optimal server assignment policy. Note that the uniqueness of the optimal policy is subject to the interpretation that two actions could be equivalent if the service rate of a server at a particular station is zero.
When the servers are numbered such that , the policy is optimal with throughput where N and gn (for ) are defined in (8) and (2), respectively. Furthermore, this is the unique optimal policy in the class of Markovian stationary deterministic policies when , where (for ) is defined in (4) and (5).
We next define the terminology used in the statement of Theorem 4.1 and provide several results that will be used in its proof. One can easily compute the long-run average throughput gn under policy , for , as
Thus, the sign of is the same as the sign of for . The next proposition provides a useful property of the function.
For , if , then for all . Similarly, if , then for all .
For , we have
The relationship in (6) with some algebra follows from the equalities
Because , we have if . Note that if n = 1, the assertion still holds because . Similarly, if , then .
We then define
Note that Proposition 4.1 implies that and . It also follows from (6) that when . This makes sense because when .
Before proving the theorem, we will present several properties of the bias vector hN under policy . Let be the -dimensional transition probability matrix corresponding to the decision rule dN, and let be the -dimensional reward vector corresponding to dN, with denoting the reward earned in state s under decision rule dN, for all (see Section 3). Then,
Because the Markov decision process problem is unichain, we employ the unichain policy iteration algorithm (see section 8.6.1 in Puterman 1994) and find a scalar gN and a vector hN solving
We have
The next lemma provides closed-form expressions for the differences in (11) and (12). Note that because , (10), (13), and (14) show that, for all .
For ,
We use policy iteration for unichain models (see section 8.6.1 in Puterman 1994) to prove the optimality of . In particular, we set the initial decision rule ν0 of the policy iteration algorithm equal to dN and then, show that the decision rule ν1 computed as
For s = 0, we have . Then, (10) yields
Note that if and only if , in which case actions a11 and a12 are the same. Similarly, for ,
Finally, when . Because in state s = 0, , and , we have shown that for all . In the interest of space, for the rest of the proof, we do not specify the differences for idling actions that are all nonnegative (in particular, the differences are strictly positive as long as the actions are not equivalent).
Next, we consider for which . For ,
In order to prove that , we will show that the expressions in (15)–(21) are nonnegative. Note that
Next, we consider for which . For ,
Hence, for all . Next, we consider for . We have for ,
Thus,
Thus, for all . Finally, for ,
This proves the optimality of (see theorem 8.6.6 in Puterman 1994). Because for all , when , and a and are not equivalent, the uniqueness of the optimal policy follows from proposition 8.5.10 in Puterman 1994.
5. Properties of the Optimal Policy
Theorem 4.1 demonstrates that as in the optimal server assignment policy without abandonments (characterized in Andradóttir et al. 2001), the primary assignment of the servers to the stations is determined by maximizing the product of the service rates. Furthermore, as in Andradóttir et al. (2001), the servers work together at station 1 only when there is no work to be done at station 2. However, unlike the optimal policy without abandonments, servers work together at station 2 as long as there are N or more customers in the system where . Thus, the optimal policy does not necessarily use the entire buffer space. Hence, Theorem 4.1 suggests that there is a trade-off between using servers where they are effective (i.e., by letting N be large) and avoiding abandonments (i.e., by choosing N small). Theorem 4.1 also indicates that in the presence of abandonments, the buffer allocation should be done accordingly as one can incur unnecessary costs by assigning buffer space, which will not be needed. We now identify conditions under which n = 1 (so that the expedite policy is optimal).
It is easy to see that n = 1 if and only if and . This follows because , and from (5), we have
It follows from Remark 5.1 and Theorem 4.1 that (i.e., the expedite policy) is the unique optimal policy when for any . Thus, when there are abandonments, the optimal policy utilizes no buffer space. This is different from the case when θ = 0. Indeed, we know from Andradóttir et al. (2001) that when there are no abandonments and is an optimal policy, but it is not unique. In fact, in this case, is also optimal. Continuing in this vein, the next corollary (which follows from Remark 5.1 and Theorem 4.1) shows that (i.e., the expedite policy) is the unique optimal policy when the servers are generalists (i.e., for i = 1, 2 and j = 1, 2; observe that is always positive).
Suppose that for i = 1, 2 and j = 1, 2. Then, is the unique optimal policy.
Note that when the servers are generalists, they are equally effective at both stations. As a result, there is no trade-off between using the servers at stations that they are effective at versus avoiding abandonments. Thus, the optimal policy avoids abandonments completely and utilizes no buffer space.
The next proposition implies that for large-enough B, there exists as long as .
We have for and ,
Note that combining similar terms, with some algebra the expression of given in (5) can be written as
Note that N is nondecreasing in the buffer size B. This immediately follows from the fact that does not depend on B (see Equations (4) and (5)). Furthermore, we have from Proposition 5.1 that if , then N will remain the same after some finite B.
In the next proposition, we characterize how N changes with respect to the abandonment rate θ and the four service rates , and μ21. In what follows, increasing means nondecreasing, and decreasing means nonincreasing.
We have
N is decreasing in θ,
N is increasing in μ22,
N is either increasing or is first increasing and then decreasing in μ11,
N is decreasing in μ12, and
N is decreasing in μ21.
First assume that . Then, , and (i)–(iii) and (v) hold trivially. Moreover, (iv) also holds because N cannot be larger than B + 2. For the rest of the proof, we assume that . Furthermore, with a slight abuse of notation, in this proof, we will have gn, τ, and ε not only as functions of n but also, the corresponding rate considered in each case.
We have from Remark 5.1 that n = 1 if . Thus, the result holds for this case, and we assume that for the rest of the proof. Note that it follows from (6) and (25) that, for ,
(27)Furthermore, it follows from (6), (7), and (25) that (because the largest power has a negative coefficient). Thus, for each has a positive root. We will show that has a single positive root .
We start by showing that gn is nonincreasing in θ for all . Note that g1 does not depend on θ and hence, is constant in θ. Now, assume that systems I and II are both operating under for but that system I has a higher abandonment rate than system II. Consider sample paths of both systems starting in the same state and generated using (the same) independent Poisson processes with rates , and thinning. As time evolves, the number of customers in system II will be always greater than or equal to the number of customers in system I because of the higher abandonment rate of system I. In particular, when both systems are in the same state, every abandonment from (service completion at station 1 in) system II is associated with a simultaneous abandonment from (service completion at station 1 in) system I. Moreover, every service completion at station 2 in system I is accompanied by a simultaneous service completion at station 2 in system II. This shows that more customers complete service at station 2 in system II and that gn is nonincreasing in θ.
Now, suppose that has two roots . Then, , which implies that and . Again, consider two systems but now, with arbitrary abandonment rate θ: system I operating under and system II operating under . Assume that both systems start at an arbitrary state , and consider their sample paths generated using (the same) independent Poisson processes with rates , and thinning. Then, both systems will evolve the same until state n – 1 is reached. If there is an abandonment in state n – 1, both systems experience the abandonment, but it is possible that in state n – 1, system I (but not system II) can transition to state n because of a service completion at station 1 or system II (but not necessarily system I) will transition to state n – 2 because of a service completion at station 2. If this happens, then from this point onward, the number of customers in system I will dominate the number of customers in system II. Because the number of customers in system I is always greater than or equal to the number of customers in system II, all abandonments in system II will be experienced in system I as well. However, when the two systems are not in the same state, there may be abandonments in system I that do not occur in system II. Some of these additional abandonments in system I may be replaced by additional service completions at station 1 in system I but not in system II (if both systems are in state n ‒ 1); others may not be replaced. Because all service completions at station 1 lead to either abandonments or service completions at station 2, this implies that as θ increases, decreases at least as rapidly as . So, if , then . Furthermore, the inequality must be strict because otherwise, we would have for all , which would imply that for all . This is not possible because can have at most n – 1 roots as it is a polynomial of order n – 1 in θ. Hence, there is a single such that .
We then have from (6) and (7) that
which together with (27), implies that . Thus,and the result follows.The proof is similar to the proof of part (i) and is given in the appendix.
Note that , and it immediately follows from (6) and (7) that if for some , and then, . From (25), observe that
(28)Then, n = 1 if , n = 1 if and , and n = 2 if and . Thus, the result holds when .
Now, suppose that . It follows from (6), (7), and (25) that for all because when , we have (because ). Furthermore, (from (28)) and for all (because (6), (7), and (28) imply that the highest-power has a negative coefficient). It follows from (25) and (28) that has a single root greater than . Finally, one can immediately see from (26) that for , and hence, is a concave function of μ11. We can then conclude that for has two roots, a single root, or no roots. Note that it follows from (6) and (7) that if has a single root or no roots, then has no roots. First, assume that there exists such that has two roots, and let be the largest such n. Then, for must all have two roots. We consider two cases.
, and has a single root. Let and be the two roots of for and be the root of . Then, these roots should satisfy because from (6) and (7), we have , which implies that and is concave in μ11. We will have n = 1 if because for all ; n = 2 if because for all ; n = 3 if because for all ; ; if because for all ; if because for all ; if because for all ; ; n = 3 if because for all ; and n = 2 if because for all . Thus, in this case, N is first nondecreasing and then nonincreasing in μ11.
, or has no roots. Using the argument in (a), one can conclude that N will be first nondecreasing and then nonincreasing in μ11 but in this case, will never attain .
On the other hand, if there is no such that has two roots, we also consider two cases.
has one root. Then, we will have , and n = 1 if ; n = 2 if ; n = 3 if ; and n = 2 if . Thus, N is again first nondecreasing and then nonincreasing in μ11.
has no roots. Then, n = 1 if and n = 2 if . Hence, in this case, N is nondecreasing in μ11.
This concludes the proof that when , N is either nondecreasing or is first nondecreasing and then nonincreasing in μ11.
Proofs are given in the appendix.
As expected, Proposition 5.2(i) shows that as the abandonment rate θ increases, the optimal policy utilizes smaller buffer space in order to avoid abandonments. On the other hand, we have from Proposition 5.2(ii) that as the service rate μ22 of the server at the second station increases, the optimal policy uses a larger buffer space and takes advantage of the effectiveness of the server at station 2. Moreover, Proposition 5.2(iii) indicates that when μ11 is sufficiently large, the jobs accumulate quickly in the buffer, and the optimal policy uses a smaller buffer capacity as the service rate μ11 at the first station increases. However, for small μ11, it is possible that the optimal policy may utilize the specialty of the servers and that the optimal (utilized) buffer capacity may increase as μ11 increases. As an example, consider a system with , and θ = 4. It is easy to verify that when , n = 4; when , n = 5; and when , n = 4. Thus, N first increases and then decreases as μ11 increases. Proposition 5.2(iv) shows that the optimal (utilized) buffer capacity decreases as the service rate μ12 of the first server at the second station increases (provided that it is less than or equal to ). Similarly, Proposition 5.2(v) indicates that if the service rate μ21 of the second server at the first station increases (provided that it is less than or equal to ), then the optimal policy uses a smaller buffer space. These results are reasonable because as the service rates μ12, μ21 increase, the servers become less specialized, and hence, the optimal policy shifts emphasis from utilizing server specialty toward reducing abandonments.
Appendix
In component notation, the equations in (9) can be written as
The equality in (10) follows immediately from (A.1). We use induction to prove the equality in (11). Note that for s = 2, the result follows immediately from (A.2) and (10). Now, assume that (11) holds for , and we will show that it holds for . We have from (A.3) that
In order to prove (13), we plug in Expression (2) for gN in (11). Note that with some simple algebra, we have
Then,
Note that . It then follows from (6), (7), and (25) that for all because when , we have . Furthermore, it also follows from (6), (7), and (25) that (because the largest power has a positive coefficient). We will show that for each has a single root greater than .
We start by showing that gn is nondecreasing in μ22 for all . Assume that systems I and II are both operating under but that system I has a higher service rate than system II. Consider sample paths of both systems starting in the same state and generated using (the same) independent Poisson processes with rates , and thinning. As time evolves, the number of customers in system II will be always greater than or equal to the number of customers in system I because of the higher service rate of system I. In particular, when both systems are in the same state, every service completion at station 2 in system II is associated with a service completion at station 2 in system I. Moreover, every service completion at station 1 in system II is experienced in system I, and every abandonment from system I is associated with an abandonment from system II. Because system I has more service completions at station 1 and fewer abandonments than system II, it must have more service completions at station 2, and gn is nondecreasing in μ22.
Now, suppose that has two roots . Then, , which implies that and . Again, consider two systems but now, with arbitrary service rate μ22.: system I operating under and system II operating under . Assume that both systems start at an arbitrary state , and consider their sample paths generated using (the same) independent Poisson processes with rates , and thinning. As in part (i), the number of customers in system I will always be greater than or equal to the number of customers in system II. Thus, every service completion by server 2 at station 2 in system II is associated with a service completion by server 2 at station 2 in system I. This implies that as μ22 increases, increases at least as rapidly as . So, if , then . Furthermore, the inequality must be strict because otherwise, we would have for all , which would imply that for all . This is not possible because can have at most n roots as it is a polynomial of order n in μ22. Hence, there is a single such that .
We then have from (6) and (7) that
Note that it follows from (6) and (25) that for all ,
First, assume ; then, we have , and it follows from (6), (7), and (25) that for . Then, we can conclude that has a single root because we know from (6), (7), and (25) that is a quadratic function of μ12 for . We then have from (6) that
On the other hand, if , then , but we have from (6), (7), and (25) that for (as is a linear function of μ12, is a quadratic function of μ12 for , and all powers of μ12 have negative coefficients in both cases); the result follows similarly.
It is easy to see from (5) that for all is a linear function of μ21 with coefficient
References
- (2021) A production and distribution planning of perishable products with a fixed lifetime under vertical competition in the seller-buyer systems: A real-world application. J. Manufacturing Systems 58(Part A):223–247.Google Scholar
- (2002) Optimal control of a two-stage tandem queuing system with flexible servers. Probab. Engrg. Inform. Sci. 16:453–469.Google Scholar
- (1963) Some queuing problems with balking and reneging. Oper. Res. 11(1):88–100.Google Scholar
- (2001) Server assignment policies for maximizing the steady-state throughput of finite queueing systems. Management Sci. 47(10):1421–1439.Google Scholar
- (2017) Critical care capacity management: Understanding the role of a step down unit. Production Oper. Management 27:859–883.Google Scholar
- (2019) Optimal policy in single-server multi-class queuing systems with abandonment. Preprint, submitted June 1, http://dx.doi.org/10.2139/ssrn.3453227.Google Scholar
- (2010) The rule for many-server queues with abandonment. Oper. Res. 58(5):1427–1439.Google Scholar
- (2022) Server assignment policies in queues with customer abandonments. Queueing Systems 100:393–395.Google Scholar
- (1984) Single server queues with impatient customers. Adv. Appl. Probab. 16:887–905.Google Scholar
- (1957) Queuing with impatient customers and ordered service. Oper. Res. 5(5):650–656.Link, Google Scholar
- (2015) Waiting patiently: An empirical study of queue abandonment in an emergency department. Management Sci. 61(1):39–59.Google Scholar
- (2011) The M/G/1+G queue revisited. Queueing Systems 67:207–220.Google Scholar
- (2013) Workload and busy period for M/GI/1 with a general impatience mechanism. Queueing Systems 75:189–209.Google Scholar
- (2013) Many-server queues with customer abandonment: Numerical analysis of their diffusion models. Stochastic Systems 3(1):96–146.Google Scholar
- (2008) Optimal control of parallel server systems with many servers in heavy traffic. Queueing Systems 59:95–134.Google Scholar
- (2013) Analysis of an M/M/1+G queue operated under the FCFS policy with exact admission control. Queueing Systems 75:169–178.Google Scholar
- (2011) Dynamic control of a single-server system with abandonments. Queueing Systems 67:63–90.Google Scholar
- (2013) On the time-dependent moments of Markovian queues with reneging. Queueing Systems 75:149–168.Google Scholar
- (1997) A two-stage tandem queue attended by a moving server with holding and switching costs. Queueing Systems 26:203–228.Google Scholar
- (2016) Optimal control of queueing systems with non-collaborating servers. Queueing Systems 84:79–110.Google Scholar
- (2007) Monotonicity properties for multiserver queues with reneging and finite waiting times. Probab. Engrg. Inform. Sci. 21:335–360.Google Scholar
- (2013) Tandem queueing system with impatient customers as a model of call center with interactive voice response. Performance Evaluation 70:440–453.Google Scholar
- (2011) Technical Note—A note on profit maximization and monotonicity for inbound call centers. Oper. Res. 59(5):1304–1308.Link, Google Scholar
- (2020) Dynamic scheduling of multiclass many-server queues with abandonment: The generalized rule. Oper. Res. 68(4):1218–1230.Link, Google Scholar
- (2013) On queues with impatience: Stability, and the optimality of earliest deadline first. Queueing Systems 75:211–242.Google Scholar
- (2019)
Scheduling an overloaded multiclass many-server queue with impatient customers . Netessine S, ed. INFORMS TutORials in Operations Research: Operations Research & Management Science in the Age of Analytics (INFORMS, Catonsville, MD), 189–217.Google Scholar - (1994) Markov Decision Processes (John Wiley & Sons, New York).Google Scholar
- (2013) Queues in tandem with customer deadlines and retrials. Queueing Systems 73:1–34.Google Scholar
- (2017) Optimal dispatching in a tandem queue. Queueing Systems 87:269–281.Google Scholar
- (2019) Tandem queues with impatient customers. Performance Evaluation 135(C):102011.Google Scholar
- (2006) Fluid models for multiserver queues with abandonments. Oper. Res. 54(1):37–54.Link, Google Scholar
- (2023) Optimal control of supervisors balancing individual and joint responsibilities. Probab. Engrg. Inform. Sci., ePub ahead of print January 20, https://doi.org/10.1017/S0269964823000013.Google Scholar
- (2016) Dynamic control of a tandem system with abandonments. Queueing Systems 84:279–293.Google Scholar
- (2019) Policies for physician allocation to triage and treatment in emergency departments. IISE Trans. Healthcare Systems Engrg. 9:342–356.Google Scholar

