Online Learning and Optimization for Queues with Unknown Arrival Rate and Service Distribution
References
- (1993) Calculation of the GI/G/1 waiting-time distribution and its cumulants from Pollaczek’s formulas. Arch. Elektronik Ubertragungstechnik 47(5/6):311–321.Google Scholar
- (2003) Applied Probability and Queues, vol. 2 (Springer, Berlin).Google Scholar
- (2024) Supervised ML for solving the GI/GI/1 queue. INFORMS J. Comput. 36(3):766–786.Link, Google Scholar
- (2023) Static pricing guarantees for queueing systems. Preprint, submitted May 16, https://arxiv.org/abs/2305.09168. Google Scholar
- (2009) Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Oper. Res. 57(6):1407–1420.Link, Google Scholar
- (2015) On the (surprising) sufficiency of linear models for dynamic pricing with demand learning. Management Sci. 61(4):723–739.Link, Google Scholar
- (2020) Rates of convergence to stationarity for reflected Brownian motion. Math. Oper. Res. 45(2):660–681.Link, Google Scholar
- (2011) General bounds and finite-time improvement for the Kiefer-Wolfowitz stochastic approximation algorithm. Oper. Res. 59(5):1211–1224.Link, Google Scholar
- (2012) Dynamic pricing under a general parametric choice model. Oper. Res. 60(4):965–980.Link, Google Scholar
- (2024) An online learning approach to dynamic pricing and capacity sizing in service systems. Oper. Res. 72(6):2677–2697.Link, Google Scholar
- (2017) Dynamic pricing and demand learning with limited price experimentation. Oper. Res. 65(6):1722–1731.Link, Google Scholar
- (1993) Optimization of queues using an infnitesimal perturbation analysis-based stochastic algorithm with general update times. SIAM J. Control Optim. 31(3):698–732.Crossref, Google Scholar
- (2023) Score-aware policy-gradient methods and performance guarantees using local lyapunov conditions: Applications to product-form stochastic networks and queueing systems. Preprint, submitted December 5, https://arxiv.org/abs/2312.02804.Google Scholar
- (1976) Some approximate equilibrium results for the multi-server queue (M/G/r). J. Oper. Res. Soc. 27(3):615–620.Crossref, Google Scholar
- (2021) Queueing network controls via deep reinforcement learning. Stochastic Systems 12(1):30–67.Link, Google Scholar
- (2023) The power of static pricing for reusable resources. Preprint, submitted February 23, https://arxiv.org/abs/2302.11723.Google Scholar
- (1990) Convergence of a stochastic approximation algorithm for the GI/G/1 queue using infinitesimal perturbation analysis. J. Optim. Theory Appl. 65:149–160.Crossref, Google Scholar
- (2024) Service level prediction in non-Markovian nonstationary queues: A simulation-based deep learning approach. Lam H, Azar E, Batur D, Gao S, Xie W, eds. Proc. Winter Simulation Conf. (IEEE, Piscataway, NJ), 2655–2666.Google Scholar
- (2025) NeuraliNQ: A neural network method for the transient performance analysis in non-Markovian queues. Queueing Systems 109(4):24.Google Scholar
- (1992) Stationary waiting time derivatives. Queueing Systems 12(3):369–390.Crossref, Google Scholar
- (2009) An adaptive algorithm for finding the optimal base-stock policy in lost sales inventory systems with censored demand. Math. Oper. Res. 34(2):397–416.Link, Google Scholar
- (2022) Online learning and pricing with reusable resources: Linear bandits with sub-exponential rewards. Chaudhuri K, Jegelka S, Song L, Szepesvari C, Niu G, Sabato S, eds. Proc. Internat. Conf. Machine Learn. (International Conference on Machine Learning (ICML), San Diego), 10135–10160.Google Scholar
- (2024) Online learning and pricing for service systems with reusable resources. Oper. Res. 72(3):1203–1241.Link, Google Scholar
- (2014) Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies. Oper. Res. 62(5):1142–1167.Link, Google Scholar
- (2018) The value of dynamic pricing in large queueing systems. Oper. Res. 66(2):409–425.Link, Google Scholar
- (2021) Learning unknown service rates in queues: A multiarmed bandit approach. Oper. Res. 69(1):315–330.Link, Google Scholar
- (2010) Exploiting market size in service systems. Manufacturing Service Oper. Management 12(3):511–526.Link, Google Scholar
- (1994) Stochastic optimization by simulation: Convergence proofs for the GI/GI/1 queue in steady state. Management Sci. 40(11):1562–1578.Link, Google Scholar
- (1994) Stochastic optimization by simulation: Numerical experiments with the M/M/1 queue in steady-state. Management Sci. 40(10):1245–1261.Link, Google Scholar
- (2014) Optimal pricing and capacity sizing for the GI/GI/1 queue. Oper. Res. Lett. 42:527–531.Crossref, Google Scholar
- (2019) Pricing and capacity sizing of a service facility: Customer abandonment effects. Production Oper. Management 28(8):2031–2043.Crossref, Google Scholar
- (2019) Reinforcement learning for optimal control of queueing systems. Liberzon D, Dominguez-Garcia A, eds. Proc. 57th Annual Allerton Conf. Comm. Control Comput. (IEEE, Piscataway, NJ), 663–670.Google Scholar
- (2003) Pricing and capacity sizing for systems with shared resources: Approximate solutions and scaling relations. Management Sci. 49(8):1018–1038.Link, Google Scholar
- (2024) Performance of NPG in countable state-space average-cost RL. Preprint, submitted May 30, https://arxiv.org/abs/2405.20467.Google Scholar
- (2016) Provisioning of large-scale systems: The interplay between network effects and strategic behavior in the user base. Management Sci. 62(6):1830–1841.Link, Google Scholar
- (1930) Über eine aufgabe der wahrscheinlichkeitstheorie. I. Math. Z 32(1):64–100.Crossref, Google Scholar
- (2021) Queue-learning: A reinforcement learning approach for providing quality of service. Leyton-Brown K, Mausam M, eds. Proc. AAAI Conf. Artificial Intelligence, vol. 35 (AAAI Press, Palo Alto, CA), 461–468.Crossref, Google Scholar
- (2020) Stable reinforcement learning with unbounded state space. Bayen AM, Jadbabaie A, Pappas G, Parrilo PA, Recht B, Tomlin C, Zeilinger M, eds. Proc. 2nd Conf. Learn. Dynamics Control, Proceedings of Machine Learning Research, vol. 120, 581.Google Scholar
- (2018) Reinforcement Learning: An Introduction, 2nd ed. (MIT Press, Cambridge, MA).Google Scholar
- (2021) Learning and information in stochastic networks and queues. Carlsson JG, ed. Tutorials in Operations Research: Emerging Optimization Methods and Modeling Techniques with Applications (INFORMS, Catonsville, MD), 161–198.Link, Google Scholar
- (2021) Marrying stochastic gradient descent with bandits: Learning algorithms for inventory systems with fixed costs. Management Sci. 67(10):6089–6115.Link, Google Scholar
- (2020) Closing the gap: A learning algorithm for lost-sales inventory systems with lead times. Management Sci. 66(5):1962–1980.Link, Google Scholar
- (2025) Learning to schedule in multiclass many-server queues with abandonment. Oper. Res. 73(6):3085–3103.Link, Google Scholar

