A Theory of Auto-Scaling for Resource Reservation in Cloud Services
Published Online:1 Feb 2022https://doi.org/10.1287/stsy.2021.0091
References
- (2013) Cloud monitoring: A survey. Comput. Networking 57(9):2093–2115.Google Scholar
- Amazon (2021) Amazon EC2 auto scaling with EC2 spot instances. Accessed January, 17, 2022, https://aws.amazon.com/getting-started/hands-on/ec2-auto-scaling-spot-instances/.Google Scholar
- Amazon On-Demand Instances (2021) Amazon on-demand instances. Accessed January, 17, 2022, https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-on-demand-instances.html.Google Scholar
- Amazon Spot Instances (2021) Amazon spot instances. Accessed January, 17, 2022, https://aws.amazon.com/ec2/spot/.Google Scholar
- Amazon Web Services (2020a) Amazon AWS containers. Accessed January, 17, 2022, https://aws.amazon.com/containers/.Google Scholar
- Amazon Web Services (2020b) Amazon EC2 auto-scaler. Accessed January, 17, 2022, https://docs.aws.amazon.com/autoscaling/.Google Scholar
- Amazon Web Services (2020c) Amazon web Services (AWS). Accessed January, 17, 2022, https://aws.amazon.com/.Google Scholar
- Amazon Web Services (2020d) AWS service level agreements (SLAs). Accessed January, 17, 2022, https://aws.amazon.com/legal/service-level-agreements/.Google Scholar
- (2000) Unbounded knapsack problem: Dynamic programming revisited. Eur. J. Oper. Res. 123(2):394–407.Google Scholar
- Apache Software Foundation (2019) Apache hadoop yarn. Accessed January, 17, 2022, http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html.Google Scholar
- (1995) Asymptotic analysis of single resource loss systems in heavy traffic, with applications to integrated networks. Adv. Appl. Probability 27(1):273–292.Google Scholar
- (2008) Probability and Measure (John Wiley & Sons, Hoboken, NJ).Google Scholar
- (2006) Queueing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications, 2nd ed., vol. 95 (John Wiley & Sons, Hoboken, NJ).Google Scholar
- (2004) Convex Optimization (Cambridge University Press, New York).Google Scholar
- (2017) A formal proof in Coq of LaSalle’s invariance principle. Ayala-Rincón M, Muñoz CA, eds. Interactive Theorem Proving (Springer International Publishing, Cham, Switzerland), 148–163.Google Scholar
- (2014) VM consolidation: A real case based on openstack cloud. Future Generation Comput. Systems 32:118–127.Google Scholar
- (2016) Online knapsack revisited. Theory Comput. Systems 58(1):153–190.Google Scholar
- (2014) Asymptotic optimality of bestfit for stochastic bin packing. Performance Evaluation Rev. 42(2):64–66.Google Scholar
- (2018) An autonomic resource provisioning approach for service-based cloud applications: A hybrid approach. Future Generation Comput. Systems 78:191–210.Google Scholar
- Google Cloud (2020a) Google Cloud. Accessed January, 17, 2022, https://cloud.google.com/.Google Scholar
- Google Cloud (2020b) Google compute engine pricing. Accessed January, 17, 2022, https://cloud.google.com/compute/all-pricing.Google Scholar
- Google Cloud (2020c) Google kubernetes. Accessed January, 17, 2022, https://cloud.google.com/kubernetes/.Google Scholar
- (2018) Online VM auto-scaling algorithms for application hosting in a cloud. IEEE Trans. Cloud Comput. 8(3):889–898.Google Scholar
- (2012) Online stochastic bin packing. Preprint, submitted November 12, 2012. Accessed January, 17, 2022, https://arxiv.org/abs/1211.2687.Google Scholar
- (2012) Lightweight resource scaling for cloud applications. Proc. EEE/ACM Internat. Sympos. on Cluster, Cloud and Grid Comput. (IEEE Computer Society, Los Alamitos, CA), 644–651.Google Scholar
- (1994) Large loss networks. Stochastic Processing Appl. 53(2):363–378.Google Scholar
- (1997) Optimization via trunk reservation in single resource loss systems under heavy traffic. Ann. Appl. Probability 7(4):1058–1079.Google Scholar
- (1975) Fast approximation algorithms for the knapsack and sum of subset problems. J. ACM 22(4):463–468.Google Scholar
- (2002) Removable online knapsack problems. Proc. Internat. Colloquium on Automata, Languages, and Programming (Springer, Berlin), 293–305.Google Scholar
- (2013) Optimal cloud resource auto-scaling for web applications. Proc. IEEE/ACM Internat. Sympos. on Cluster, Cloud, and Grid Comput. (IEEE Press, Delft, Netherlands), 58–65.Google Scholar
- (2020) Instant virtual machine live migration. Proc. Internat. Conf. on the Econom. of Grids, Clouds, Systems, and Services (Springer, Berlin), 155–170.Google Scholar
- (2017) Choosing among heterogeneous server clouds. Queueing Systems 85(1-2):1–29.Google Scholar
- (2004) Multidimensional knapsack problem. Knapsack Problems (Springer, Berlin), 235–283.Google Scholar
- (1991) Loss networks. Ann. Appl. Probability 1(3):319–378.Google Scholar
- (1990) Optimal control and trunk reservation in loss networks. Probability Engrg. Inform. Sci. 4(2):203–242.Google Scholar
- (1960) Some extensions of Liapunov’s second method. IRE Trans. Circuit Theory 7(4):520–527.Google Scholar
- (2020) A survey of live virtual machine migration techniques. Comput. Sci. Rev. 38:100304.Google Scholar
- (2012) Stochastic models of load balancing and scheduling in cloud computing clusters. Proc. IEEE INFOCOM (IEEE, New York), 702–710.Google Scholar
- (2014) Heavy traffic optimal resource allocation algorithms for cloud computing clusters. Performance Evaluation 81:20–39.Google Scholar
- (2010) Cloud auto-scaling with deadline and budget constraints. Proc. IEEE/ACM Internat. Conf. on Grid Comput. (IEEE Computer Society, Los Alamitos, CA), 41–48.Google Scholar
- (1995) Stochastic on-line knapsack problems. Math. Programming 68(1-3):73–104.Google Scholar
- (1990) An exact algorithm for large unbounded knapsack problems. Oper. Res. Lett. 9(1):15–20.Google Scholar
- Microsoft Azure (2020) Microsoft Azure. Accessed January, 17, 2022, https://azure.microsoft.com/.Google Scholar
- (2015) Mean field and propagation of chaos in multi-class heterogeneous loss models. Performance Evaluation 91:117–131.Google Scholar
- (2017) On non-preemptive VM scheduling in the cloud. Proc. ACM on Measurement and Analysis of Comput. Systems, vol. 1 (ACM, New York), 1–29.Google Scholar
- (2018) Randomized algorithms for scheduling multi-resource jobs in the cloud. IEEE/ACM Trans. Networks 26(5):2202–2215.Google Scholar
- (2018) Auto-scaling web applications in clouds: A taxonomy and survey. ACM Comput. Survey 51(4):1–33.Google Scholar
- (2014) A sharing-aware greedy algorithm for virtual machine maximization. Proc. IEEE 13th Internat. Sympos. on Network Comput. and Applications (IEEE Computer Society, Los Alamitos, CA), 113–120.Google Scholar
- RedHat (2021) Cloud native applications: Stateful vs stateless. Accessed January, 17, 2022, https://www.redhat.com/en/topics/cloud-native-apps/stateful-vs-stateless.Google Scholar
- (2011) Efficient autoscaling in the cloud using predictive models for workload forecasting. Proc. IEEE 4th Internat. Conf. on Cloud Comput., (IEEE Computer Society, Los Alamitos, CA), 500–507.Google Scholar
- (2018) Fast multi-resource allocation with patterns in large scale cloud data center. J. Comput. Sci. 26:389–401.Google Scholar
- (2013) Adaptive resource provisioning for the cloud using online bin packing. IEEE Trans. Comput. 63(11):2647–2660.Google Scholar
- (2012) Virtual machine resource allocation for service hosting on heterogeneous distributed platforms. Proc. IEEE Internat. Parallel Distributed Processing Sympos. (IPDPS) 2012, Shanghai, China, 786–797.Google Scholar
- (2013) An infinite server system with general packing constraints. Oper. Res. 61(5):1200–1217.Link, Google Scholar
- (2017) Large-scale heterogeneous service systems with general packing constraints. Adv. Appl. Probability 49(1):61–83.Google Scholar
- (2013) A large-scale service system with packing constraints: Minimizing the number of occupied servers. ACM SIGMETRICS Performance Evaluation Rev. 41(1):41–52.Google Scholar
- (2015) Asymptotic optimality of a greedy randomized algorithm in a large-scale service system with general packing constraints. Queueing Systems 79(2):117–143.Google Scholar
- (2015) Large-scale cluster management at Google with Borg. Proc. 10th Eur. Conf. on Comput. Systems (Association for Computing Machinery, New York), 1–17.Google Scholar
- (1985) Blocking when service is required from several facilities simultaneously. ATT Tech. J. 64(8):1807–1856.Google Scholar
- (2011) Google cluster data. Accessed January 17, 2022, https://github.com/google/cluster-data.Google Scholar
- (2015) Power of d choices for large-scale bin packing: A loss model. Performance Evaluation Rev. 43(1):321–334.Google Scholar
- (2015) Joint VM placement and topology optimization for traffic scalability in dynamic datacenter networks. Comput. Networks 80:109–123.Google Scholar

