Learning and Balancing Unknown Loads in Large-Scale Systems

Published Online:https://doi.org/10.1287/moor.2021.0212

References

  • [1] Aghajani R, Ramanan K (2019) The hydrodynamic limit of a randomized load balancing network. Ann. Appl. Probab. 29(4):2114–2174.CrossrefGoogle Scholar
  • [2] Aghajani R, Li X, Ramanan K (2017) The PDE method for the analysis of randomized load balancing networks. Proc. ACM Measurement Anal. Comput. Systems 1(2):1–28.Google Scholar
  • [3] Applegate D, Archer A, Gopalakrishnan V, Lee S, Ramakrishnan KK (2010) Optimal content placement for a large-scale VoD system. Proc. 6th Internat. Conf. Co-NEXT’10 (ACM, New York), 1–12.Google Scholar
  • [4] Badonnel R, Burgess M (2008) Dynamic pull-based load balancing for autonomic servers. NOMS 2008-2008 IEEE Network Oper. Management Sympos. (IEEE, Piscataway, NJ), 751–754.Google Scholar
  • [5] Benameur N, Fredj SB, Oueslati-Boulahia S, Roberts JW (2002) Quality of service and flow level admission control in the internet. Comput. Networks 40(1):57–71.CrossrefGoogle Scholar
  • [6] Boor Mvd, Borst SC, van Leeuwaarden JSH, Mukherjee D (2022) Scalable load balancing in networked systems: A survey of recent advances. SIAM Rev. 64(3):554–622.CrossrefGoogle Scholar
  • [7] Bramson M (1998) State space collapse with application to heavy traffic limits for multiclass queueing networks. Queueing Systems 30:89–140.CrossrefGoogle Scholar
  • [8] Bramson M, Lu Y, Prabhakar B (2010) Randomized load balancing with general service time distributions. Performance Evaluation Rev. 38(1):275–286.CrossrefGoogle Scholar
  • [9] Bramson M, Lu Y, Prabhakar B (2012) Asymptotic independence of queues under randomized load balancing. Queueing Systems 71:247–292.CrossrefGoogle Scholar
  • [10] Cheng X, Dale C, Liu J (2008) Statistics and social network of YouTube videos. 2008 16th Internat. Workshop Quality Service (IEEE, Piscataway, NJ), 229–238.Google Scholar
  • [11] Ephremides A, Varaiya P, Walrand J (1980) A simple dynamic routing problem. IEEE Trans. Automatic Control 25(4):690–693.CrossrefGoogle Scholar
  • [12] Foss SG, Stolyar AL (2017) Large-scale join-idle-queue system with general service times. J. Appl. Probab. 54(4):995–1007.CrossrefGoogle Scholar
  • [13] Gamarnik D, Tsitsiklis JN, Zubeldia M (2018) Delay, memory, and messaging tradeoffs in distributed service systems. Stochastic Systems 8(1):45–74.LinkGoogle Scholar
  • [14] Gamarnik D, Tsitsiklis JN, Zubeldia M (2020) A lower bound on the queueing delay in resource constrained load balancing. Ann. Appl. Probab. 30(2):870–901.CrossrefGoogle Scholar
  • [15] Goldsztajn D, Ferragut A, Paganini F (2021) Automatic cloud instance provisioning with quality and efficiency. Performance Evaluation 149–150:102209.CrossrefGoogle Scholar
  • [16] Goldsztajn D, Ferragut A, Paganini F, Jonckheere M (2017) Controlling the number of active instances in a cloud environment. Performance Evaluation Rev. 45(3):15–20.CrossrefGoogle Scholar
  • [17] Goldsztajn D, Borst SC, van Leeuwaarden JSH, Mukherjee D, Whiting PA (2022) Self-learning threshold-based load balancing. INFORMS J. Comput. 34(1):39–54.LinkGoogle Scholar
  • [18] Hu H, Zhu X, Wang Y, Pan R, Zhu J, Bonomi F (2012) QoE-based multi-stream scalable video adaptation over wireless networks with proxy. 2012 IEEE Internat. Conf. Comm. (ICC) (IEEE, Piscataway, NJ), 7088–7092.Google Scholar
  • [19] Joseph V, Borst SC, Reiman MI (2016) Optimal rate allocation for video streaming in wireless networks with user dynamics. IEEE/ACM Trans. Networking 24(2):820–835.CrossrefGoogle Scholar
  • [20] Kelly FP (2011) Reversibility and Stochastic Networks (Cambridge University Press, Cambridge, UK).Google Scholar
  • [21] Kendall MG (1945) The Advanced Theory of Statistics, vol. I (Charles Griffin and Co., London).Google Scholar
  • [22] Key P, Massoulié L, Bain A, Kelly F (2004) Fair internet traffic integration: Network flow models and analysis. Ann. Telecomm. 59(11):1338–1352.CrossrefGoogle Scholar
  • [23] Lu Y, Xie Q, Kliot G, Geller A, Larus JR, Greenberg A (2011) Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services. Performance Evaluation 68(11):1056–1071.CrossrefGoogle Scholar
  • [24] Marshall AW, Olkin I, Arnold BC (1979) Inequalities: Theory of Majorization and Its Applications (Academic Press, Cambridge, MA).Google Scholar
  • [25] Menich R, Serfozo RF (1991) Optimality of routing and servicing in dependent parallel processing systems. Queueing Systems 9:403–418.CrossrefGoogle Scholar
  • [26] Mitzenmacher M (2001) The power of two choices in randomized load balancing. IEEE Trans. Parallel Distributed Systems 12(10):1094–1104.CrossrefGoogle Scholar
  • [27] Mo J, Walrand J (2000) Fair end-to-end window-based congestion control. IEEE/ACM Trans. Networking 8(5):556–567.CrossrefGoogle Scholar
  • [28] Mukherjee D, Stolyar AL (2019) Join idle queue with service elasticity: Large-scale asymptotics of a nonmonotone system. Stochastic Systems 9(4):338–358.LinkGoogle Scholar
  • [29] Mukherjee D, Borst SC, van Leeuwaarden JSH, Whiting PA (2016) Universality of load balancing schemes on the diffusion scale. J. Appl. Probab. 53(4):1111–1124.CrossrefGoogle Scholar
  • [30] Mukherjee D, Borst SC, van Leeuwaarden JSH, Whiting PA (2020) Asymptotic optimality of power-of-d load balancing in large-scale systems. Math. Oper. Res. 45(4):1535–1571.LinkGoogle Scholar
  • [31] Mukherjee D, Dhara S, Borst SC, van Leeuwaarden JSH (2017) Optimal service elasticity in large-scale distributed systems. Proc. ACM Measurement Anal. Comput. Systems 1(1):1–28.Google Scholar
  • [32] Niu D, Li B, Zhao S (2011) Understanding demand volatility in large VoD systems. Proc. 21st Internat. Workshop Network Oper. Systems Support Digital Audio Video (NOSSDAV’11) (ACM, New York), 39–44.Google Scholar
  • [33] Niu D, Liu Z, Li B, Zhao S (2011) Demand forecast and performance prediction in peer-assisted on-demand streaming systems. 2011 Proc. IEEE INFOCOM (IEEE, Piscataway, NJ), 421–425.Google Scholar
  • [34] Schassberger R (1973) Warteschlangen (Springer, Berlin, Heidelberg).CrossrefGoogle Scholar
  • [35] Sparaggis PD, Towsley D, Cassandras C (1993) Extremal properties of the shortest/longest non-full queue policies in finite-capacity systems with state-dependent service rates. J. Appl. Probab. 30(1):223–236.CrossrefGoogle Scholar
  • [36] Srikant R, Ying L (2013) Communication Networks: An Optimization, Control, and Stochastic Networks Perspective (Cambridge University Press, Cambridge, UK).CrossrefGoogle Scholar
  • [37] Stolyar AL (2015) Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems 80:341–361.CrossrefGoogle Scholar
  • [38] Stolyar AL (2017) Pull-based load distribution among heterogeneous parallel servers: The case of multiple routers. Queueing Systems 85:31–65.CrossrefGoogle Scholar
  • [39] Vasantam T, Mukhopadhyay A, Mazumdar RR (2017) Mean-field analysis of loss models with mixed-Erlang distributions under power-of-d routing. 2017 29th Internat. Teletraffic Congress (IEEE, Piscataway, NJ), 250–258.Google Scholar
  • [40] Vvedenskaya ND, Dobrushin RL, Karpelevich FI (1996) Queueing system with selection of the shortest of two queues: An asymptotic approach. Problemy Peredachi Informatsii 32(1):20–34.Google Scholar
  • [41] Winston W (1977) Optimality of the shortest line discipline. J. Appl. Probab. 14(1):181–189.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.