Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies
Published Online:4 Mar 2024https://doi.org/10.1287/mnsc.2022.02476
References
- (2022) Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management. Oper. Res. 70(3):1646–1664.Link, Google Scholar
- (2014) Managing uncertainty through supply chain flexibility: Reactive vs. proactive approaches. Production Manufacturing Res. 2(1):50–70.Crossref, Google Scholar
- (1993) Diversification under supply uncertainty. Management Sci. 39(8):944–963.Link, Google Scholar
- (2008) Applied Probability and Queues, vol. 51 (Springer Science & Business Media, New York).Google Scholar
- (2007) Competition and diversification effects in supply chains with supplier default risk. Manufacturing Service Oper. Management 9(2):123–146.Link, Google Scholar
- (2020) Asymptotic optimality of semi-open-loop policies in Markov decision processes with large lead times. Preprint, submitted October 20, https://dx.doi.org/10.2139/ssrn.3685551.Google Scholar
- (2016) A simple heuristic for joint inventory and pricing models with lead time and backorders. Management Sci. 62(8):2358–2373.Link, Google Scholar
- (2011) Lost-sales inventory theory: A review. Eur. J. Oper. Res. 215(1):1–13.Crossref, Google Scholar
- (1999) Myopic heuristics for the random yield problem. Oper. Res. 47(5):713–722.Link, Google Scholar
- (2020) Constant-order policies for lost-sales inventory models with random supply functions: Asymptotics and heuristic. Oper. Res. 68(4):1063–1073.Link, Google Scholar
- (2003) Supply chain coordination with contracts. Graves SC, de Kok AG, eds. Supply Chain Management: Design, Coordination and Operation, Handbooks in Operations Research and Management Science, vol. 11 (Elsevier, Amsterdam), 227–339.Crossref, Google Scholar
- (2008) Joint replenishment and pricing decisions in inventory systems with stochastically dependent supply capacity. Eur. J. Oper. Res. 191(1):142–155.Crossref, Google Scholar
- (2019) Tailored base-surge policies in dual-sourcing inventory systems with demand learning. Preprint, submitted September 27, https://dx.doi.org/10.2139/ssrn.3456834.Google Scholar
- (2019) Coordinating pricing and inventory replenishment with nonparametric demand learning. Oper. Res. 67(4):1035–1052.Abstract, Google Scholar
- (2021) Nonparametric learning algorithms for joint pricing and inventory control with lost sales and censored demand. Math. Oper. Res. 46(2):726–756.Link, Google Scholar
- (2023) Asymptotic optimality of constant-order policies in joint pricing and inventory control models. Math. Oper. Res., ePub ahead of print April 17, https://doi.org/10.1287/moor.2023.1367.Google Scholar
- (2023) Optimal policies for dynamic pricing and inventory control with nonparametric censored demands. Management Sci., ePub ahead of print August 31, https://doi.org/10.1287/mnsc.2023.4859.Google Scholar
- (1994) A periodic review, production planning model with uncertain capacity and uncertain demand-optimality of extended myopic policies. Management Sci. 40(3):320–332.Link, Google Scholar
- (2007) A newsvendor’s procurement problem when suppliers are unreliable. Manufacturing Service Oper. Management 9(1):9–32.Link, Google Scholar
- (2020) A primal-dual approach to analyzing ato systems. Management Sci. 66(11):5389–5407.Link, Google Scholar
- (2019) Provably efficient reinforcement learning with aggregated states. Preprint, submitted December 13, https://arxiv.org/abs/1912.06366.Google Scholar
- (2006) Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. J. Machine Learn. Res. 7(6):1079–1105.Google Scholar
- (2008) Selecting a portfolio of suppliers under demand and supply risks. Oper. Res. 56(4):916–936.Link, Google Scholar
- (2009) Optimal supply diversification under general supply risks. Oper. Res. 57(6):1451–1468.Link, Google Scholar
- (2010) Integrating dynamic pricing and replenishment decisions under supply capacity uncertainty. Management Sci. 56(12):2154–2172.Link, Google Scholar
- (2018) Supply and demand functions in inventory models. Oper. Res. 66(1):77–91.Link, Google Scholar
- (2016) Asymptotic optimality of constant-order policies for lost sales inventory models with large lead times. Math. Oper. Res. 41(3):898–913.Link, Google Scholar
- (2021) Bandits atop reinforcement learning: Tackling online inventory models with cyclic demands. Preprint, submitted July 21, https://dx.doi.org/10.2139/ssrn.3637705.Google Scholar
- (2012) Supply-side story: Risks, guarantees, competition, and information asymmetry. Management Sci. 58(9):1694–1714.Link, Google Scholar
- (1990) The structure of periodic review policies in the presence of random yield. Oper. Res. 38(4):634–643.Link, Google Scholar
- (2010) Linear inflation rules for the random yield problem: Analysis and computations. Oper. Res. 58(1):244–251.Link, Google Scholar
- (2009) A nonparametric asymptotic analysis of inventory planning with censored demand. Math. Oper. Res. 34(1):103–123.Link, Google Scholar
- (2009a) An adaptive algorithm for finding the optimal base-stock policy in lost sales inventory systems with censored demand. Math. Oper. Res. 34(2):397–416.Link, Google Scholar
- (2009b) Asymptotic optimality of order-up-to policies in lost sales inventory systems. Management Sci. 55(3):404–420.Link, Google Scholar
- (2015) Exact and heuristic linear-inflation policies for an inventory model with random yield and arbitrary lead times. Eur. J. Oper. Res. 245(1):109–120.Crossref, Google Scholar
- (2004) Lost-sales problems with stochastic lead times: Convexity results for base-stock policies. Oper. Res. 52(5):795–803.Link, Google Scholar
- (2018) Is q-learning provably efficient? Adv. Neural Inform. Processing Systems, vol. 31 (MIT press, Cambridge, MA).Google Scholar
- (2004) Production planning under yield and demand uncertainty with yield-dependent cost and price. Manufacturing Service Oper. Management 6(3):209–224.Link, Google Scholar
- Lei M, Liu S, Jasin S, Vakhutinsky A (2022) Joint inventory and pricing for a one-warehouse multistore problem: Spiraling phenomena, near optimal policies, and the value of dynamic pricing. Oper. Res., ePub ahead of print October 17, https://doi.org/10.1287/opre.2022.2389.Google Scholar
- (2008) A 2-approximation algorithm for stochastic inventory control models with lost sales. Math. Oper. Res. 33(2):351–374.Link, Google Scholar
- (2013) Supply diversification with responsive pricing. Production Oper. Management 22(2):447–458.Crossref, Google Scholar
- (2006) Model uncertainty, robust optimization, and learning. INFORMS TutORials Oper. Res. 66–94.Link, Google Scholar
- (2022) Technical note—Data-driven newsvendor problem: Performance of the sample average approximation. Oper. Res. 70(4):1996–2012.Link, Google Scholar
- (2021) UCB-type learning algorithms for lost-sales inventory models with lead times. Preprint, submitted October 18, https://dx.doi.org/10.2139/ssrn.3944354.Google Scholar
- (2022) Supply chain management during and post-Covid-19 pandemic: Mitigation strategies and practical lessons learned. J. Bus. Res. 142:1125–1139.Crossref, Google Scholar
- (2004) A new and simple policy for the continuous review lost sales inventory model. Working paper, Bell Labs, Lucent Technologies, Murray Hill, NJ.Google Scholar
- (2014) Pay-back-revenue-sharing contract in coordinating supply chains with random yield. Production Oper. Management 23(12):2089–2102.Crossref, Google Scholar
- (2006) On the value of mitigation and contingency strategies for managing supply chain disruption risks. Management Sci. 52(5):639–657.Link, Google Scholar
- (1996) Periodic review production models with variable capacity, random yield, and uncertain demand. Management Sci. 42(1):130–137.Link, Google Scholar
- (2021) Understanding the performance of capped base-stock policies in lost-sales inventory models. Oper. Res. 69(1):61–70.Link, Google Scholar
- (2016) Optimality gap of constant-order policies decays exponentially in the lead time for lost sales models. Oper. Res. 64(6):1556–1565.Link, Google Scholar
- (2009) Supply disruptions, asymmetric information, and a backup production option. Management Sci. 55(2):192–209.Link, Google Scholar
- (1995) Lot sizing with random yields: A review. Oper. Res. 43(2):311–334.Link, Google Scholar
- (2021) Marrying stochastic gradient descent with bandits: Learning algorithms for inventory systems with fixed costs. Management Sci. 67(10):6089–6115.Link, Google Scholar
- (2018) Perishable inventory systems: Convexity results for base-stock policies and learning algorithms under censored demand. Oper. Res. 66(5):1276–1286.Link, Google Scholar
- (2020) Closing the gap: A learning algorithm for lost-sales inventory systems with lead times. Management Sci. 66(5):1962–1980.Link, Google Scholar
- (2008) Old and new methods for lost-sales inventory systems. Oper. Res. 56(5):1256–1263.Link, Google Scholar

