Queues with Many Servers: The Virtual Waiting-Time Process in the QED Regime
Abstract
We consider a first-come first-served multiserver queue in the Quality- and Efficiency-Driven (QED) regime. In this regime, which was first formalized by Halfin and Whitt, the number of servers N is not small, servers' utilization is (Efficiency-Driven) while waiting time is (Quality-Driven). This is equivalent to having the number of servers N being approximately equal to , where R is the offered load and β is a positive constant.
For the G/DK/N queue in the QED regime, we analyze the virtual waiting time VN(t), as N increases indefinitely. Assuming that the service-time distribution has a finite support (hence the DK in G/DK/N), it is shown that, in the limit, the scaled virtual waiting time is representable as a supremum over a random weighted tree (S denotes a service time). Informally, it is then argued that, for large N,


