Distributed Speed Scaling in Large-Scale Service Systems

Published Online:https://doi.org/10.1287/opre.2024.1012

References

  • Albers S, Müller F, Schmelzer S (2014) Speed scaling on parallel processors. Algorithmica 68(2):404–425.CrossrefGoogle Scholar
  • Ata B, Shneorson S (2006) Dynamic control of an M/M/1 service system with adjustable arrival and service rates. Management Sci. 52(11):1778–1791.LinkGoogle Scholar
  • Bansal N, Chan HL, Pruhs K (2009) Speed scaling with an arbitrary power function. SODA ’09: Proc. Twentieth Annual ACM-SIAM Sympos. Discrete Algorithms (SIAM, Philadelphia), 693–701.Google Scholar
  • Bonald T, Jonckheere M, Proutière A (2004) Insensitive load balancing. ACM SIGMETRICS Performance Evaluation Rev. 32(1):367–377.CrossrefGoogle Scholar
  • Ethier SN, Kurtz TG (1986) Markov Processes: Characterization and Convergence (John Wiley & Sons, Hoboken, NJ).CrossrefGoogle Scholar
  • Gandhi A, Doroudi S, Harchol-Balter M, Scheller-Wolf A (2013) Exact analysis of the M/M/k/setup class of Markov chains via recursive renewal reward. SIGMETRICS ‘13 Proc. ACM SIGMETRICS/Internat. Conf. Measurement Model. Comput. Systems (Association for Computing Machinery, New York), 153–166.Google Scholar
  • Gandhi A, Gupta V, Harchol-Balter M, Kozuch MA (2010) Optimality analysis of energy-performance trade-off for server farm management. Performance Evaluation 67(11):1155–1171.CrossrefGoogle Scholar
  • Gardner WA (1984) Learning characteristics of stochastic-gradient-descent algorithms: A general study, analysis, and critique. Signal Processing 6(2):113–133.CrossrefGoogle Scholar
  • Hou S, Ni W, Chen S, Zhao S, Cheng B, Chen J (2020) Real-time optimization of dynamic speed scaling for distributed data centers. IEEE Trans. Network Sci. Engrg. 7(3):2090–2103.CrossrefGoogle Scholar
  • Huang J, Liu Y, Li R, Li K, An J, Bai Y, Yang F, Xie G (2019) Optimal power allocation and load balancing for non-dedicated heterogeneous distributed embedded computing systems. J. Parallel Distributed Comput. 130:24–36.CrossrefGoogle Scholar
  • Jonckheere M, Prabhu BJ (2018) Asymptotics of insensitive load balancing and blocking phases. Queueing Systems Theory Appl. 88(3–4):243–278.CrossrefGoogle Scholar
  • Jones N (2018) How to stop data centres from gobbling up the world’s electricity. Nature 561(7722):163–167.CrossrefGoogle Scholar
  • Kurtz TG (1992) Averaging for martingale problems and stochastic approximation. Karatzas I, Ocone D, eds. Applied Stochastic Analysis, Lecture Notes in Control and Information Sciences, vol. 177 (Springer, Berlin), 186–209.CrossrefGoogle Scholar
  • Kushner HJ, Yin GG (1997) Stochastic Approximation and Recursive Algorithm and Applications (Springer, New York).Google Scholar
  • Lin M, Wierman A, Andrew LLH, Thereska E (2013) Dynamic right-sizing for power-proportional data centers. IEEE/ACM Trans. Networking 21(5):1378–1391.CrossrefGoogle Scholar
  • Maccio VJ, Down DG (2015) On optimal policies for energy-aware servers. Performance Evaluation 90:36–52.CrossrefGoogle Scholar
  • Mukherjee D, Stolyar A (2019) Join-idle-queue with service elasticity: Large-scale asymptotics of a nonmonotone system. Stochastic Systems 9(4):338–358.LinkGoogle Scholar
  • Mukherjee D, Dhara S, Borst SC, Van Leeuwaarden JSH (2017) Optimal service elasticity in large-scale distributed systems. Proc. ACM Measurement Anal. Comput. Systems 1(1):25.Google Scholar
  • Nedić A, Liu J (2018) Distributed optimization for control. Annual Rev. Control Robotics Autonomous Systems 1(1):77–103.CrossrefGoogle Scholar
  • Nemirovski A, Juditsky A, Lan G, Shapiro A (2009) Robust stochastic approximation approach to stochastic programming. SIAM J. Optim. 19(4):1574–1609.CrossrefGoogle Scholar
  • Polyak BT, Juditsky AB (1992) Acceleration of stochastic approximation by averaging. SIAM J. Control Optim. 30(4):838–855.CrossrefGoogle Scholar
  • Robbins H, Monro S (1951) A stochastic approximation method. Ann. Math. Statist. 22(3):400–407.CrossrefGoogle Scholar
  • Rutten D, Mukherjee D (2024) Mean-field analysis for load balancing on spatial graphs. Ann. Appl. Probab. 34(6):5228–5257.Google Scholar
  • Shehabi A, Smith S, Sartor D, Brown R, Herrlin M, Koomey J, Masanet E, Horner N, Azevedo I, Lintner W (2016) United States Data Center energy usage report. Technical report, Lawrence Berkeley National Lab, Berkeley, CA.Google Scholar
  • Stanojevic R, Shorten R (2010) Distributed dynamic speed scaling. Proc. 29th Conf. Inform. Comm. (IEEE Press, Piscataway, NJ), 426–430.Google Scholar
  • Stolyar AL (2015) Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems 80(4):341–361.CrossrefGoogle Scholar
  • Stolyar AL (2017) Pull-based load distribution among heterogeneous parallel servers: The case of multiple routers. Queueing Systems 85(1):31–65.CrossrefGoogle Scholar
  • Su Y, Yu J, Anand V, Wierman A (2022) Learning-augmented energy-aware scheduling of precedence-constrained tasks. ACM SIGMETRICS Performance Evaluation Rev. 49(2):3–5.CrossrefGoogle Scholar
  • Tsitsiklis JN (1984) Problems in decentralized decision making and computation. PhD thesis, Massachusetts Institute of Technology, Cambridge.Google Scholar
  • Tsitsiklis J, Bertsekas D, Athans M (1986) Distributed asynchronous deterministic and stochastic gradient optimization algorithms. IEEE Trans. Automatic Control 31(9):803–812.CrossrefGoogle Scholar
  • van der Boor M, Comte C (2021) Load balancing in heterogeneous server clusters: Insights from a product-form queueing model. IEEE/ACM 29th Internat. Sympos. Quality Service (IWQOS), (IEEE, Piscataway, NJ), 1–10.Google Scholar
  • Vaze R, Nair J (2020) Multiple server SRPT with speed scaling is competitive. IEEE/ACM Trans. Networking 28(4):1739–1751.CrossrefGoogle Scholar
  • Wierman A, Andrew LLH, Tang A (2009) Power-aware speed scaling in processor sharing systems. IEEE Infocom 2009 (IEEE, Piscataway, NJ), 2007–2015.Google Scholar
  • Yang T, Yi X, Wu J, Yuan Y, Wu D, Meng Z, Hong Y, Wang H, Lin Z, Johansson KH (2019) A survey of distributed optimization. Annual Rev. Control 47:278–305.CrossrefGoogle Scholar
  • Yuan K, Ling Q, Yin W (2016) On the convergence of decentralized gradient descent. SIAM J. Optim. 26(3):1835–1854.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.