Learning and Balancing Unknown Loads in Large-Scale Systems
Published Online:3 May 2024https://doi.org/10.1287/moor.2021.0212
References
- [1] (2019) The hydrodynamic limit of a randomized load balancing network. Ann. Appl. Probab. 29(4):2114–2174.Crossref, Google Scholar
- [2] (2017) The PDE method for the analysis of randomized load balancing networks. Proc. ACM Measurement Anal. Comput. Systems 1(2):1–28.Google Scholar
- [3] (2010) Optimal content placement for a large-scale VoD system. Proc. 6th Internat. Conf. Co-NEXT’10 (ACM, New York), 1–12.Google Scholar
- [4] (2008) Dynamic pull-based load balancing for autonomic servers. NOMS 2008-2008 IEEE Network Oper. Management Sympos. (IEEE, Piscataway, NJ), 751–754.Google Scholar
- [5] (2002) Quality of service and flow level admission control in the internet. Comput. Networks 40(1):57–71.Crossref, Google Scholar
- [6] (2022) Scalable load balancing in networked systems: A survey of recent advances. SIAM Rev. 64(3):554–622.Crossref, Google Scholar
- [7] (1998) State space collapse with application to heavy traffic limits for multiclass queueing networks. Queueing Systems 30:89–140.Crossref, Google Scholar
- [8] (2010) Randomized load balancing with general service time distributions. Performance Evaluation Rev. 38(1):275–286.Crossref, Google Scholar
- [9] (2012) Asymptotic independence of queues under randomized load balancing. Queueing Systems 71:247–292.Crossref, Google Scholar
- [10] (2008) Statistics and social network of YouTube videos. 2008 16th Internat. Workshop Quality Service (IEEE, Piscataway, NJ), 229–238.Google Scholar
- [11] (1980) A simple dynamic routing problem. IEEE Trans. Automatic Control 25(4):690–693.Crossref, Google Scholar
- [12] (2017) Large-scale join-idle-queue system with general service times. J. Appl. Probab. 54(4):995–1007.Crossref, Google Scholar
- [13] (2018) Delay, memory, and messaging tradeoffs in distributed service systems. Stochastic Systems 8(1):45–74.Link, Google Scholar
- [14] (2020) A lower bound on the queueing delay in resource constrained load balancing. Ann. Appl. Probab. 30(2):870–901.Crossref, Google Scholar
- [15] (2021) Automatic cloud instance provisioning with quality and efficiency. Performance Evaluation 149–150:102209.Crossref, Google Scholar
- [16] (2017) Controlling the number of active instances in a cloud environment. Performance Evaluation Rev. 45(3):15–20.Crossref, Google Scholar
- [17] (2022) Self-learning threshold-based load balancing. INFORMS J. Comput. 34(1):39–54.Link, Google Scholar
- [18] (2012) QoE-based multi-stream scalable video adaptation over wireless networks with proxy. 2012 IEEE Internat. Conf. Comm. (ICC) (IEEE, Piscataway, NJ), 7088–7092.Google Scholar
- [19] (2016) Optimal rate allocation for video streaming in wireless networks with user dynamics. IEEE/ACM Trans. Networking 24(2):820–835.Crossref, Google Scholar
- [20] (2011) Reversibility and Stochastic Networks (Cambridge University Press, Cambridge, UK).Google Scholar
- [21] (1945) The Advanced Theory of Statistics, vol. I (Charles Griffin and Co., London).Google Scholar
- [22] (2004) Fair internet traffic integration: Network flow models and analysis. Ann. Telecomm. 59(11):1338–1352.Crossref, Google Scholar
- [23] (2011) Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services. Performance Evaluation 68(11):1056–1071.Crossref, Google Scholar
- [24] (1979) Inequalities: Theory of Majorization and Its Applications (Academic Press, Cambridge, MA).Google Scholar
- [25] (1991) Optimality of routing and servicing in dependent parallel processing systems. Queueing Systems 9:403–418.Crossref, Google Scholar
- [26] (2001) The power of two choices in randomized load balancing. IEEE Trans. Parallel Distributed Systems 12(10):1094–1104.Crossref, Google Scholar
- [27] (2000) Fair end-to-end window-based congestion control. IEEE/ACM Trans. Networking 8(5):556–567.Crossref, Google Scholar
- [28] (2019) Join idle queue with service elasticity: Large-scale asymptotics of a nonmonotone system. Stochastic Systems 9(4):338–358.Link, Google Scholar
- [29] (2016) Universality of load balancing schemes on the diffusion scale. J. Appl. Probab. 53(4):1111–1124.Crossref, Google Scholar
- [30] (2020) Asymptotic optimality of power-of-d load balancing in large-scale systems. Math. Oper. Res. 45(4):1535–1571.Link, Google Scholar
- [31] (2017) Optimal service elasticity in large-scale distributed systems. Proc. ACM Measurement Anal. Comput. Systems 1(1):1–28.Google Scholar
- [32] (2011) Understanding demand volatility in large VoD systems. Proc. 21st Internat. Workshop Network Oper. Systems Support Digital Audio Video (NOSSDAV’11) (ACM, New York), 39–44.Google Scholar
- [33] (2011) Demand forecast and performance prediction in peer-assisted on-demand streaming systems. 2011 Proc. IEEE INFOCOM (IEEE, Piscataway, NJ), 421–425.Google Scholar
- [34] (1973) Warteschlangen (Springer, Berlin, Heidelberg).Crossref, Google Scholar
- [35] (1993) Extremal properties of the shortest/longest non-full queue policies in finite-capacity systems with state-dependent service rates. J. Appl. Probab. 30(1):223–236.Crossref, Google Scholar
- [36] (2013) Communication Networks: An Optimization, Control, and Stochastic Networks Perspective (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- [37] (2015) Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems 80:341–361.Crossref, Google Scholar
- [38] (2017) Pull-based load distribution among heterogeneous parallel servers: The case of multiple routers. Queueing Systems 85:31–65.Crossref, Google Scholar
- [39] (2017) Mean-field analysis of loss models with mixed-Erlang distributions under power-of-d routing. 2017 29th Internat. Teletraffic Congress (IEEE, Piscataway, NJ), 250–258.Google Scholar
- [40] (1996) Queueing system with selection of the shortest of two queues: An asymptotic approach. Problemy Peredachi Informatsii 32(1):20–34.Google Scholar
- [41] (1977) Optimality of the shortest line discipline. J. Appl. Probab. 14(1):181–189.Crossref, Google Scholar

