Stability, Memory, and Messaging Trade-Offs in Heterogeneous Service Systems

Published Online:https://doi.org/10.1287/moor.2021.1191

References

  • [1] Anselmi J (2019) Combining size-based load balancing with round-robin for scalable low latency. IEEE Trans. Parallel Distributed Systems 31(4):886–896.CrossrefGoogle Scholar
  • [2] Anselmi J, Dufour F (2020) Power-of-d-choices with memory: Fluid limit and optimality. Math. Oper. Res. 45(3):862–888.LinkGoogle Scholar
  • [3] Atar R, Keslassy I, Mendelson G, Orda A, Vargaftik S (2020) Persistent-Idle load-distribution. Stochastic Systems 10(2):152–169.LinkGoogle Scholar
  • [4] Bramson M (2011) Stability of join the shortest queue network. Ann. Appl. Probab. 21(4):1568–1625.CrossrefGoogle Scholar
  • [5] Foster F (1953) On the stochastic matrices associated with certain queueing processes. Ann. Math. Statist. 24(3):355–360.CrossrefGoogle Scholar
  • [6] Gamarnik D, Tsitsiklis JN, Zubeldia M (2018) Delay, memory, and messaging tradeoffs in distributed service systems. Stochastic Systems 8(1):45–74.LinkGoogle Scholar
  • [7] Gamarnik D, Tsitsiklis JN, Zubeldia M (2020) A lower bound on the queueing delay in resource constrained load balancing. Ann. Appl. Probab. 30(2):870–901.CrossrefGoogle Scholar
  • [8] Lu Y, Xie Q, Kliot G, Geller A, Larus J, Greenberg A (2011) Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services. Performance Evaluation 68(11):1056–1071.CrossrefGoogle Scholar
  • [9] Mitzenmacher M (1996) The power of two choices in randomized load balancing. Unpublished PhD thesis, University of California, Berkeley.Google Scholar
  • [10] Mukherjee D, Borst S, van Leeuwaarden J, Whiting P (2016) Universality of power-of-d load balancing schemes. Workshop Math. Perform. Model. Anal. ACM SIGMETRICS Performance Evaluation Rev. 44(2):36–38.Google Scholar
  • [11] Shah D, Prabhakar B (2002) The use of memory in randomized load balancing. Proc. IEEE Internat. Sympos. Inform. Theory (Institute of Electrical and Electronics Engineers, Piscataway, NJ).Google Scholar
  • [12] Stolyar A (2015) Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems 80(4):341–361.CrossrefGoogle Scholar
  • [13] van der Boor M, Zubeldia M, Borst S (2020) Zero-wait load balancing with sparse messaging. Oper. Res. Lett. 48(3):368–375.CrossrefGoogle Scholar
  • [14] Vargaftik S, Keslassy I, Orda A (2020) LSQ: Load balancing in large-scale heterogeneous systems with multiple dispatchers. IEEE/ACM Trans. Networking 28(3):1186–1198.CrossrefGoogle Scholar
  • [15] Vvedenskaya ND, Dobrushin RL, Karpelevich FI (1996) Queueing system with selection of the shortest of two queues: An asymptotic approach. Problems Inform. Transmission 32(1):15–27.Google Scholar
  • [16] Zhou X, Shroff N, Wierman A (2021) Asymptotically optimal load balancing in large-scale heterogeneous systems with multiple dispatchers. ACM SIGMETRICS Performance Evaluation Rev. 48(3):57–58.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.