Understanding the Efficiency of Multi-Server Service Systems

Ward Whitt
Ward Whitt
AT&T Bell Laboratories, Murray Hill, New Jersey 07974
Search for more papers by this author

AT&T Bell Laboratories, Murray Hill, New Jersey 07974

Published Online:1 May 1992https://doi.org/10.1287/mnsc.38.5.708

Abstract

In the design and operation of service systems, it is important to determine an appropriate level of server utilization (the proportion of time each server should be working). In a multi-server queue with unlimited waiting space, the appropriate server utilization typically increases as the number of servers (and the arrival rate) increases. We explain this economy of scale and give a rough quantitative characterization. We also show how increased variability in the arrival and service processes tends to reduce server utilization with a given grade of service. As part of this analysis, we develop simple approximations for the mean steady-state waiting time and the full steady-state waiting-time distribution. These approximations exploit an infinite-server approximation for the probability of delay and a single-server approximation for the conditional waiting-time distribution given that waiting occurs. The emphasis is on simple formulas that directly convey understanding.

Cited by
- The Impact of Market Growth on Delivery Speed: Evidence from JD.com
  Tarek Abdallah,
  Robert L. Bray,
  Hojun Choi,
  Vadim Glinskiy
  22 May 2026 | Service Science, Vol. 0, No. 0
- How to Staff When Customers Arrive in Batches
  Andrew Daw,
  Robert C. Hampshire,
  Jamol J. Pender
  12 November 2024 | Management Science, Vol. 71, No. 8
- My CXL Pool Obviates Your PCIe Switch
  6 June 2025
- Optimal Routing to Parallel Servers in Heavy Traffic
  Heng-Qing Ye
  16 August 2023 | Operations Research, Vol. 73, No. 1
- Zero-Order Distributed Non-Stationary Optimization for Hölder-Smooth Problems With Unknown-But-Bounded Noise
  IEEE Access, Vol. 13
- Diffusion-Based Staffing for Multitasking Service Systems with Many Servers
  Jaap Storm,
  Wouter Berkelmans,
  René Bekker
  28 December 2023 | Mathematics of Operations Research, Vol. 49, No. 4
- Tutorials in Operations Research: Smarter Decisions for a Better World
  Vlad Babich
  Pooja Dewan
  Jamol Pender
  David Alderson
  Harvey J. Greenberg
  17 October 2024
- Humanitarian Operations and Earmarked Funding
  Alfonso J. Pedraza-Martinez
  17 October 2024
- Stochastic modeling of integrated order fulfillment processes with delivery time promise: Order picking, batching, and last-mile delivery
  European Journal of Operational Research, Vol. 316, No. 3
- Real-Time Estimations for the Waiting-Time Distribution in Time-Varying Queues
- Team Size and Composition in Home Healthcare: Quantitative Insights and Six Model-Based Principles
  9 November 2023 | Healthcare, Vol. 11, No. 22
- Overflow in systems with two servers: the negative consequences
  13 June 2022 | Flexible Services and Manufacturing Journal, Vol. 35, No. 3
- Optimizing Task Waiting Times in Dynamic Vehicle Routing
  IEEE Robotics and Automation Letters, Vol. 8, No. 9
- Many-Server Heavy-Traffic Limits for Queueing Systems with Perfectly Correlated Service and Patience Times
  Lun Yu,
  Ohad Perry
  8 September 2022 | Mathematics of Operations Research, Vol. 48, No. 2
- Service staffing with delay probabilities
  Operations Research Letters, Vol. 51, No. 3
- On System-Wide Safety Staffing of Large-Scale Parallel Server Networks
  Hassan Hmedi,
  Ari Arapostathis,
  Guodong Pang
  14 February 2022 | Operations Research, Vol. 71, No. 2
- Applications of fluid models in service operations management
  5 November 2022 | Queueing Systems, Vol. 103, No. 1-2
- Non-asymptotic Staffing of Service Systems with a Finite Population
  1 January 2023 | SSRN Electronic Journal, Vol. 68
- Operating Three-Sided Marketplace: Pricing and Spatial Staffing in Food Delivery Systems
  1 January 2023 | SSRN Electronic Journal, Vol. 25
- Uniform stability of some large-scale parallel server networks
  23 July 2022 | Queueing Systems, Vol. 102, No. 3-4
- Pseudo Steady-State Period in Non-Stationary Infinite-Server Queue with State Dependent Arrival Intensity
  28 July 2022 | Mathematics, Vol. 10, No. 15
- A Fluid-Diffusion-Hybrid Limiting Approximation for Priority Systems with Fast and Slow Customers
  Lun Yu,
  Seyed Iravani,
  Ohad Perry
  15 November 2021 | Operations Research, Vol. 70, No. 4
- Staffing for many-server systems facing non-standard arrival processes
  European Journal of Operational Research, Vol. 296, No. 3
- The impact of prolonged service time under off-service placement on flexibility configurations
  SSRN Electronic Journal, Vol. 35
- The hidden cost of the edge
  13 November 2021
- Leveraging Slack Capacity in IaaS Contract Cloud Services
  1 April 2021 | Production and Operations Management, Vol. 30, No. 4
- Assessing uncertainty and risk in an expeditionary military logistics network
  10 July 2019 | The Journal of Defense Modeling and Simulation: Applications, Methodology, Technology, Vol. 18, No. 2
- Joint planning for battery swap and supercharging networks with priority service queues
  International Journal of Production Economics, Vol. 233
- Open problems in queueing theory inspired by datacenter computing
  27 January 2021 | Queueing Systems, Vol. 97, No. 1-2
- Optimal Routing to Parallel Servers in Heavy Traffic
  SSRN Electronic Journal, Vol. 14
- Fleet Coordination in Decentralized Humanitarian Operations Funded by Earmarked Donations
  Alfonso J. Pedraza-Martinez,
  Sameer Hasija,
  Luk N. Van Wassenhove
  25 June 2020 | Operations Research, Vol. 68, No. 4
- Statistical Theory Powering Data Science
  Statistical Science, Vol. 34, No. 4
- Police staffing and workload assignment in law enforcement using multi-server queueing models
  European Journal of Operational Research, Vol. 276, No. 2
- The Impact of Delay Announcements on Hospital Network Coordination and Waiting Times
  Jing Dong,
  Elad Yom-Tov,
  Galit B. Yom-Tov
  30 July 2018 | Management Science, Vol. 65, No. 5
- Performance Degradation in Parallel-Server Systems
  IEEE/ACM Transactions on Networking, Vol. 27, No. 2
- Stochastic Location Models with Congestion
  17 March 2020
- Flexible bed allocations for hospital wards
  8 April 2016 | Health Care Management Science, Vol. 20, No. 4
- Closed-Form Approximations for Optimal (r, q) and (S, T) Policies in a Parallel Processing Environment
  Marcus Ang,
  Karl Sigman,
  Jing-Sheng Song,
  Hanqin Zhang
  19 July 2017 | Operations Research, Vol. 65, No. 5
- A multi-station system for reducing congestion in high-variability queues
  European Journal of Operational Research, Vol. 262, No. 2
- Managing capacity and inventory jointly for multi-server make-to-stock queues
  11 March 2017 | Queueing Systems, Vol. 86, No. 1-2
- Predicting the performance of queues–A data analytic approach
  Computers & Operations Research, Vol. 76
- STAFFING A SERVICE SYSTEM WITH NON-POISSON NON-STATIONARY ARRIVALS
  13 June 2016 | Probability in the Engineering and Informational Sciences, Vol. 30, No. 4
- Stochastic call center staffing with uncertain arrival, service and abandonment rates: A Bayesian perspective
  10 November 2016 | Naval Research Logistics (NRL), Vol. 63, No. 6
- Care on demand in nursing homes: a queueing theoretic approach
  27 December 2014 | Health Care Management Science, Vol. 19, No. 3
- The Vehicle Mix Decision in Emergency Medical Service Systems
  Kenneth C. Chong,
  Shane G. Henderson,
  Mark E. Lewis
  12 October 2015 | Manufacturing & Service Operations Management, Vol. 18, No. 3
- Staffing and scheduling under nonstationary demand for service: A literature review
  Omega, Vol. 58
- Towards High-Value(d) Nursing Home Care: Providing Client-Centred care in a More Efficient Manner
  SSRN Electronic Journal, Vol. 43
- Managing nurse lines – practical challenges and the developing theory
  3 February 2015 | International Journal of Production Research, Vol. 53, No. 24
- A Heuristic for the Multisource Weber Problem with Service Level Constraints
  Prahalad Venkateshan,
  Kamlesh Mathur
  4 September 2014 | Transportation Science, Vol. 49, No. 3
- M/M/c Queue with Two Priority Classes
  Jianfu Wang,
  Opher Baron,
  Alan Scheller-Wolf
  13 May 2015 | Operations Research, Vol. 63, No. 3
- Stochastic grey-box modeling of queueing systems: fitting birth-and-death processes to data
  2 December 2014 | Queueing Systems, Vol. 79, No. 3-4
- Approximations for the waiting-time distribution in an $$M/PH/c$$ M / P H / c priority queue
  10 February 2015 | OR Spectrum, Vol. 37, No. 2
- Stochastic Location Models with Congestion
  21 January 2015
- Leveraging Slack Capacity in IaaS Contract Cloud Services
  SSRN Electronic Journal, Vol. 20
- On the Fairness-Efficiency Tradeoff for Packet Processing with Multiple Resources
  2 December 2014
- Optimizing service times for a public health emergency using a genetic algorithm: Locating dispensing sites and allocating medical staff
  4 November 2014 | IIE Transactions on Healthcare Systems Engineering, Vol. 4, No. 4
- Peakedness‐Based Staffing for Call Center Outsourcing
  1 March 2014 | Production and Operations Management, Vol. 23, No. 3
- Closed-Form Approximations for Optimal (r,q) and (S,T) Policies in a Parallel Processing Environment
  SSRN Electronic Journal, Vol. 229
- On a bicriterion server allocation problem in a multidimensional Erlang loss system
  Journal of Computational and Applied Mathematics, Vol. 252
- APPLICATION OF ADAPTIVE NEURO FUZZY INFERENCE SYSTEM IN THE PROCESS OF TRANSPORTATION SUPPORT
  4 April 2013 | Asia-Pacific Journal of Operational Research, Vol. 30, No. 02
- Controlling excessive waiting times in small service systems with time-varying demand: An extension of the ISA algorithm
  Decision Support Systems, Vol. 54, No. 4
- Bayesian analysis of queues with impatient customers: Applications to call centers
  24 July 2012 | Naval Research Logistics (NRL), Vol. 59, No. 6
- Setting staffing requirements for time dependent queueing networks: The case of accident and emergency departments
  European Journal of Operational Research, Vol. 219, No. 3
- A Diffusion Regime with Nondegenerate Slowdown
  Rami Atar,
  1 April 2012 | Operations Research, Vol. 60, No. 2
- Many-server queues with customer abandonment: A survey of diffusion and fluid approximations
  13 March 2012 | Journal of Systems Science and Systems Engineering, Vol. 21, No. 1
- Refining Square-Root Safety Staffing by Expanding Erlang C
  A. J. E. M. Janssen,
  J. S. H. van Leeuwaarden,
  Bert Zwart,
  1 December 2011 | Operations Research, Vol. 59, No. 6
- Heavy-traffic limits for nearly deterministic queues: stationary distributions
  30 July 2011 | Queueing Systems, Vol. 69, No. 2
- Scheduling admissions and reducing variability in bed demand
  11 June 2011 | Health Care Management Science, Vol. 14, No. 3
- Setting Staffing Levels in Systems with Time-Varying Demand: The Context of an Emergency Department
  SSRN Electronic Journal, Vol. 15
- Controlling Excessive Waiting Times in Emergency Departments: An Extension of the ISA Algorithm
  SSRN Electronic Journal, Vol. 41
- Skill capacity planning and transformation scheduling of IT workforce under stochastic learning and turnover
- Optimal server allocation in general, finite, multi‐server queueing networks
  16 December 2010 | Applied Stochastic Models in Business and Industry, Vol. 26, No. 6
- Designing a call center with an IVR (Interactive Voice Response)
  14 October 2010 | Queueing Systems, Vol. 66, No. 3
- Heavy-traffic limits for nearly deterministic queues
  15 October 2010 | ACM SIGMETRICS Performance Evaluation Review, Vol. 38, No. 2
- Business Process Integration of Multiple Customer Order Review Systems
  IEEE Transactions on Engineering Management, Vol. 57, No. 3
- To pool or not to pool in hospitals: a theoretical and practical comparison for a radiotherapy outpatient department
  20 May 2009 | Annals of Operations Research, Vol. 178, No. 1
- Time-dependent analysis for refused admissions in clinical wards
  18 June 2009 | Annals of Operations Research, Vol. 178, No. 1
- Simulation based analysis of patient arrival to health care systems and evaluation of an operations improvement scheme
  18 June 2009 | Annals of Operations Research, Vol. 178, No. 1
- Dimensioning hospital wards using the Erlang loss model
  15 October 2009 | Annals of Operations Research, Vol. 178, No. 1
- Outsourcing decisions: the effect of scale economies and market structure
  4 May 2010 | Strategic Organization, Vol. 8, No. 2
- Locating and staffing service centers under service level constraints
  European Journal of Operational Research, Vol. 201, No. 1
- An Operational Mechanism Design for Fleet Management Coordination in Humanitarian Operations
  SSRN Electronic Journal, Vol. 12
- Staffing Many-Server Queues with Impatient Customers: Constraint Satisfaction in Call Centers
  Avishai Mandelbaum,
  Sergey Zeltyn,
  25 June 2009 | Operations Research, Vol. 57, No. 5
- Service Interruptions in Large-Scale Service Systems
  Guodong Pang,
  Ward Whitt,
  10 July 2009 | Management Science, Vol. 55, No. 9
- Partitioning of Servers in Queueing Systems During Rush Hour
  Bin Hu,
  Saif Benjaafar,
  24 September 2008 | Manufacturing & Service Operations Management, Vol. 11, No. 3
- Staffing to Maximize Profit for Call Centers with Alternate Service-Level Agreements
  Opher Baron,
  Joseph Milner,
  16 March 2009 | Operations Research, Vol. 57, No. 3
- Multiple criteria optimization method for the vehicle assignment problem in a bus transportation company
  8 January 2010 | Journal of Advanced Transportation, Vol. 43, No. 2
- On the staffing policy and technology investment in a specialty hospital offering telemedicine
  Decision Support Systems, Vol. 46, No. 2
- Performance Evaluation of Multi-Skill Call Center Considering Call Abandonment
- Service Competition with General Queueing Facilities
  Gad Allon,
  Awi Federgruen,
  1 August 2008 | Operations Research, Vol. 56, No. 4
- Assessing an ambulance service with queuing theory
  Computers & Operations Research, Vol. 35, No. 8
- Staffing of Time-Varying Queues to Achieve Time-Stable Performance
  Zohar Feldman,
  Avishai Mandelbaum,
  William A. Massey,
  Ward Whitt,
  1 February 2008 | Management Science, Vol. 54, No. 2
- Dimensioning Large-Scale Membership Services
  Francis de Véricourt,
  Otis B. Jennings,
  1 February 2008 | Operations Research, Vol. 56, No. 1
- What's in a wait?
  Health Policy, Vol. 85, No. 2
- Coping with Time‐Varying Demand When Setting Staffing Requirements for a Service System
  1 January 2007 | Production and Operations Management, Vol. 16, No. 1
- Evaluating Arrival Rate Uncertainty in Call Centers
- Design and Control of a Large Call Center: Asymptotic Analysis of an LP-Based Method
  Achal Bassamboo,
  J. Michael Harrison,
  Assaf Zeevi,
  1 June 2006 | Operations Research, Vol. 54, No. 3
- Allocation of Service Time in a Multiserver System
  Muhammad El-Taha,
  Bacel Maddah,
  1 April 2006 | Management Science, Vol. 52, No. 4
- A multi-class fluid model for a contact center with skill-based routing
  AEU - International Journal of Electronics and Communications, Vol. 60, No. 2
- Service Competition with General Queueing Facilities
  SSRN Electronic Journal, Vol. 43
- Modelling Variability in Hospital Bed Occupancy
  Health Care Management Science, Vol. 8, No. 4
- A Staffing Algorithm for Call Centers with Skill-Based Routing
  Rodney B. Wallace,
  Ward Whitt,
  1 October 2005 | Manufacturing & Service Operations Management, Vol. 7, No. 4
- Pricing and Design of Differentiated Services: Approximate Analysis and Structural Insights
  Constantinos Maglaras,
  Assaf Zeevi,
  1 April 2005 | Operations Research, Vol. 53, No. 2
- Capacity Planning and Management in Hospitals
- Stochastic Models of Customer Portfolio Management in Call Centers
- Staffing Software Maintenance and Support Projects
- Should We Model Dependence and Nonstationarity, and If So How?
- Staffing and Routing in a Two-Tier Call Center
  SSRN Electronic Journal, Vol. 52
- A Diffusion Approximation for the G/GI/n/m Queue
  Ward Whitt,
  1 December 2004 | Operations Research, Vol. 52, No. 6
- Diffusion Approximations for a Multiclass Markovian Service System with “Guaranteed” and “Best-Effort” Service Levels
  Constantinos Maglaras,
  Assaf Zeevi,
  1 November 2004 | Mathematics of Operations Research, Vol. 29, No. 4
- Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Heavy Traffic Regime
  J. Michael Harrison,
  Assaf Zeevi,
  1 April 2004 | Operations Research, Vol. 52, No. 2
- Dimensioning Large Call Centers
  Sem Borst,
  Avi Mandelbaum,
  Martin I. Reiman,
  1 February 2004 | Operations Research, Vol. 52, No. 1
- Fuzzy linear assignment problem: an approach to vehicle fleet deployment
- Pricing and Capacity Sizing for Systems with Shared Resources: Approximate Solutions and Scaling Relations
  Constantinos Maglaras,
  Assaf Zeevi,
  1 August 2003 | Management Science, Vol. 49, No. 8
- How Multiserver Queues Scale with Growing Congestion-Dependent Demand
  Ward Whitt,
  1 August 2003 | Operations Research, Vol. 51, No. 4
- Telephone Call Centers: Tutorial, Review, and Research Prospects
  Noah Gans,
  Ger Koole,
  Avishai Mandelbaum,
  1 April 2003 | Manufacturing & Service Operations Management, Vol. 5, No. 2
- Non-Markovian Queueing Systems
- How Many Hospital Beds?
  1 November 2002 | INQUIRY: The Journal of Health Care Organization, Provision, and Financing, Vol. 39, No. 4
- A cost model of industrial maintenance for profitability analysis and benchmarking
  International Journal of Production Economics, Vol. 79, No. 1
- Designing a Call Center with Impatient Customers
  O. Garnett,
  A. Mandelbaum,
  M. Reiman,
  1 July 2002 | Manufacturing & Service Operations Management, Vol. 4, No. 3
- The Efficiency-Quality Trade-Off of Cross-Trained Workers
  Edieal J. Pinker,
  Robert A. Shumsky,
  1 January 2000 | Manufacturing & Service Operations Management, Vol. 2, No. 1
- Partitioning Customers into Service Groups
  Ward Whitt,
  1 November 1999 | Management Science, Vol. 45, No. 11
- Periodic load balancing
  Queueing Systems, Vol. 30, No. 1-2
- INSIGHTS ON SERVICE SYSTEM DESIGN FROM A NORMAL APPROXIMATION TO ERLANG'S DELAY FORMULA
  1 September 1998 | Production and Operations Management, Vol. 7, No. 3
- Operations management and reengineering
  European Management Journal, Vol. 16, No. 3
- State-dependent stochastic networks. Part I. Approximations and applications with continuous diffusion limits
  The Annals of Applied Probability, Vol. 8, No. 2
- Allocation of queuing facilities using a minimax criterion
  Location Science, Vol. 5, No. 2
- Lightweight call setup — Supporting connection and connectionless services
- STATE‐OF‐THE‐ART SURVEY: OPEN QUEUEING NETWORKS: OPTIMIZATION AND PERFORMANCE EVALUATION MODELS FOR DISCRETE MANUFACTURING SYSTEMS
  1 June 1996 | Production and Operations Management, Vol. 5, No. 2
- Performance bounds for the effectiveness of pooling in multi-processing systems
  European Journal of Operational Research, Vol. 87, No. 2
- Um exame dos modelos de redes de filas abertas aplicados a sistemas de manufatura discretos: parte II
  Gestão & Produção, Vol. 2, No. 3
- APPROXIMATIONS FOR THE GI/G/m QUEUE
  1 June 1993 | Production and Operations Management, Vol. 2, No. 2

Volume 38, Issue 5

May 1992

Pages 609-755

Article Information

Metrics

Information

Published Online:May 01, 1992

Cite as

Ward Whitt, (1992) Understanding the Efficiency of Multi-Server Service Systems. Management Science 38(5):708-723.

https://doi.org/10.1287/mnsc.38.5.708

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Understanding the Efficiency of Multi-Server Service Systems

Abstract

Volume 38, Issue 5

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News