Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies

Published Online:https://doi.org/10.1287/mnsc.2022.02476

References

  • Agrawal S, Jia R (2022) Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management. Oper. Res. 70(3):1646–1664.LinkGoogle Scholar
  • Angkiriwang R, Pujawan IN, Santosa B (2014) Managing uncertainty through supply chain flexibility: Reactive vs. proactive approaches. Production Manufacturing Res. 2(1):50–70.CrossrefGoogle Scholar
  • Anupindi R, Akella R (1993) Diversification under supply uncertainty. Management Sci. 39(8):944–963.LinkGoogle Scholar
  • Asmussen S (2008) Applied Probability and Queues, vol. 51 (Springer Science & Business Media, New York).Google Scholar
  • Babich V, Burnetas AN, Ritchken PH (2007) Competition and diversification effects in supply chains with supplier default risk. Manufacturing Service Oper. Management 9(2):123–146.LinkGoogle Scholar
  • Bai X, Chen X, Li M, Stolyar A (2020) Asymptotic optimality of semi-open-loop policies in Markov decision processes with large lead times. Preprint, submitted October 20, https://dx.doi.org/10.2139/ssrn.3685551.Google Scholar
  • Bernstein F, Li Y, Shang K (2016) A simple heuristic for joint inventory and pricing models with lead time and backorders. Management Sci. 62(8):2358–2373.LinkGoogle Scholar
  • Bijvank M, Vis IF (2011) Lost-sales inventory theory: A review. Eur. J. Oper. Res. 215(1):1–13.CrossrefGoogle Scholar
  • Bollapragada S, Morton TE (1999) Myopic heuristics for the random yield problem. Oper. Res. 47(5):713–722.LinkGoogle Scholar
  • Bu J, Gong X, Yao D (2020) Constant-order policies for lost-sales inventory models with random supply functions: Asymptotics and heuristic. Oper. Res. 68(4):1063–1073.LinkGoogle Scholar
  • Cachon GP (2003) Supply chain coordination with contracts. Graves SC, de Kok AG, eds. Supply Chain Management: Design, Coordination and Operation, Handbooks in Operations Research and Management Science, vol. 11 (Elsevier, Amsterdam), 227–339.CrossrefGoogle Scholar
  • Chao X, Chen H, Zheng S (2008) Joint replenishment and pricing decisions in inventory systems with stochastically dependent supply capacity. Eur. J. Oper. Res. 191(1):142–155.CrossrefGoogle Scholar
  • Chen B, Shi C (2019) Tailored base-surge policies in dual-sourcing inventory systems with demand learning. Preprint, submitted September 27, https://dx.doi.org/10.2139/ssrn.3456834.Google Scholar
  • Chen B, Chao X, Ahn H-S (2019) Coordinating pricing and inventory replenishment with nonparametric demand learning. Oper. Res. 67(4):1035–1052.AbstractGoogle Scholar
  • Chen B, Chao X, Shi C (2021) Nonparametric learning algorithms for joint pricing and inventory control with lost sales and censored demand. Math. Oper. Res. 46(2):726–756.LinkGoogle Scholar
  • Chen X, Stolyar A, Xin L (2023) Asymptotic optimality of constant-order policies in joint pricing and inventory control models. Math. Oper. Res., ePub ahead of print April 17, https://doi.org/10.1287/moor.2023.1367.Google Scholar
  • Chen B, Wang Y, Zhou Y (2023) Optimal policies for dynamic pricing and inventory control with nonparametric censored demands. Management Sci., ePub ahead of print August 31, https://doi.org/10.1287/mnsc.2023.4859.Google Scholar
  • Ciarallo FW, Akella R, Morton TE (1994) A periodic review, production planning model with uncertain capacity and uncertain demand-optimality of extended myopic policies. Management Sci. 40(3):320–332.LinkGoogle Scholar
  • Dada M, Petruzzi NC, Schwarz LB (2007) A newsvendor’s procurement problem when suppliers are unreliable. Manufacturing Service Oper. Management 9(1):9–32.LinkGoogle Scholar
  • DeValve L, Pekeč S, Wei Y (2020) A primal-dual approach to analyzing ato systems. Management Sci. 66(11):5389–5407.LinkGoogle Scholar
  • Dong S, Van Roy B, Zhou Z (2019) Provably efficient reinforcement learning with aggregated states. Preprint, submitted December 13, https://arxiv.org/abs/1912.06366.Google Scholar
  • Even-Dar E, Mannor S, Mansour Y, Mahadevan S (2006) Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. J. Machine Learn. Res. 7(6):1079–1105.Google Scholar
  • Federgruen A, Yang N (2008) Selecting a portfolio of suppliers under demand and supply risks. Oper. Res. 56(4):916–936.LinkGoogle Scholar
  • Federgruen A, Yang N (2009) Optimal supply diversification under general supply risks. Oper. Res. 57(6):1451–1468.LinkGoogle Scholar
  • Feng Q (2010) Integrating dynamic pricing and replenishment decisions under supply capacity uncertainty. Management Sci. 56(12):2154–2172.LinkGoogle Scholar
  • Feng Q, Shanthikumar JG (2018) Supply and demand functions in inventory models. Oper. Res. 66(1):77–91.LinkGoogle Scholar
  • Goldberg DA, Katz-Rogozhnikov DA, Lu Y, Sharma M, Squillante MS (2016) Asymptotic optimality of constant-order policies for lost sales inventory models with large lead times. Math. Oper. Res. 41(3):898–913.LinkGoogle Scholar
  • Gong X-Y, Simchi-Levi D (2021) Bandits atop reinforcement learning: Tackling online inventory models with cyclic demands. Preprint, submitted July 21, https://dx.doi.org/10.2139/ssrn.3637705.Google Scholar
  • Gümüş M, Ray S, Gurnani H (2012) Supply-side story: Risks, guarantees, competition, and information asymmetry. Management Sci. 58(9):1694–1714.LinkGoogle Scholar
  • Henig M, Gerchak Y (1990) The structure of periodic review policies in the presence of random yield. Oper. Res. 38(4):634–643.LinkGoogle Scholar
  • Huh WT, Nagarajan M (2010) Linear inflation rules for the random yield problem: Analysis and computations. Oper. Res. 58(1):244–251.LinkGoogle Scholar
  • Huh WT, Rusmevichientong P (2009) A nonparametric asymptotic analysis of inventory planning with censored demand. Math. Oper. Res. 34(1):103–123.LinkGoogle Scholar
  • Huh WT, Janakiraman G, Muckstadt JA, Rusmevichientong P (2009a) An adaptive algorithm for finding the optimal base-stock policy in lost sales inventory systems with censored demand. Math. Oper. Res. 34(2):397–416.LinkGoogle Scholar
  • Huh WT, Janakiraman G, Muckstadt JA, Rusmevichientong P (2009b) Asymptotic optimality of order-up-to policies in lost sales inventory systems. Management Sci. 55(3):404–420.LinkGoogle Scholar
  • Inderfurth K, Kiesmüller GP (2015) Exact and heuristic linear-inflation policies for an inventory model with random yield and arbitrary lead times. Eur. J. Oper. Res. 245(1):109–120.CrossrefGoogle Scholar
  • Janakiraman G, Roundy RO (2004) Lost-sales problems with stochastic lead times: Convexity results for base-stock policies. Oper. Res. 52(5):795–803.LinkGoogle Scholar
  • Jin C, Allen-Zhu Z, Bubeck S, Jordan MI (2018) Is q-learning provably efficient? Adv. Neural Inform. Processing Systems, vol. 31 (MIT press, Cambridge, MA).Google Scholar
  • Kazaz B (2004) Production planning under yield and demand uncertainty with yield-dependent cost and price. Manufacturing Service Oper. Management 6(3):209–224.LinkGoogle Scholar
  • Lei M, Liu S, Jasin S, Vakhutinsky A (2022) Joint inventory and pricing for a one-warehouse multistore problem: Spiraling phenomena, near optimal policies, and the value of dynamic pricing. Oper. Res., ePub ahead of print October 17, https://doi.org/10.1287/opre.2022.2389.Google Scholar
  • Levi R, Janakiraman G, Nagarajan M (2008) A 2-approximation algorithm for stochastic inventory control models with lost sales. Math. Oper. Res. 33(2):351–374.LinkGoogle Scholar
  • Li T, Sethi SP, Zhang J (2013) Supply diversification with responsive pricing. Production Oper. Management 22(2):447–458.CrossrefGoogle Scholar
  • Lim AE, Shanthikumar JG, Shen ZM (2006) Model uncertainty, robust optimization, and learning. INFORMS TutORials Oper. Res. 66–94.LinkGoogle Scholar
  • Lin M, Huh WT, Krishnan H, Uichanco J (2022) Technical note—Data-driven newsvendor problem: Performance of the sample average approximation. Oper. Res. 70(4):1996–2012.LinkGoogle Scholar
  • Lyu C, Zhang H, Xin L (2021) UCB-type learning algorithms for lost-sales inventory models with lead times. Preprint, submitted October 18, https://dx.doi.org/10.2139/ssrn.3944354.Google Scholar
  • Raj A, Mukherjee AA, de Sousa Jabbour ABL, Srivastava SK (2022) Supply chain management during and post-Covid-19 pandemic: Mitigation strategies and practical lessons learned. J. Bus. Res. 142:1125–1139.CrossrefGoogle Scholar
  • Reiman MI (2004) A new and simple policy for the continuous review lost sales inventory model. Working paper, Bell Labs, Lucent Technologies, Murray Hill, NJ.Google Scholar
  • Tang SY, Kouvelis P (2014) Pay-back-revenue-sharing contract in coordinating supply chains with random yield. Production Oper. Management 23(12):2089–2102.CrossrefGoogle Scholar
  • Tomlin B (2006) On the value of mitigation and contingency strategies for managing supply chain disruption risks. Management Sci. 52(5):639–657.LinkGoogle Scholar
  • Wang Y, Gerchak Y (1996) Periodic review production models with variable capacity, random yield, and uncertain demand. Management Sci. 42(1):130–137.LinkGoogle Scholar
  • Xin L (2021) Understanding the performance of capped base-stock policies in lost-sales inventory models. Oper. Res. 69(1):61–70.LinkGoogle Scholar
  • Xin L, Goldberg DA (2016) Optimality gap of constant-order policies decays exponentially in the lead time for lost sales models. Oper. Res. 64(6):1556–1565.LinkGoogle Scholar
  • Yang Z(B), Aydin G, Babich V, Beil DR (2009) Supply disruptions, asymmetric information, and a backup production option. Management Sci. 55(2):192–209.LinkGoogle Scholar
  • Yano CA, Lee HL (1995) Lot sizing with random yields: A review. Oper. Res. 43(2):311–334.LinkGoogle Scholar
  • Yuan H, Luo Q, Shi C (2021) Marrying stochastic gradient descent with bandits: Learning algorithms for inventory systems with fixed costs. Management Sci. 67(10):6089–6115.LinkGoogle Scholar
  • Zhang H, Chao X, Shi C (2018) Perishable inventory systems: Convexity results for base-stock policies and learning algorithms under censored demand. Oper. Res. 66(5):1276–1286.LinkGoogle Scholar
  • Zhang H, Chao X, Shi C (2020) Closing the gap: A learning algorithm for lost-sales inventory systems with lead times. Management Sci. 66(5):1962–1980.LinkGoogle Scholar
  • Zipkin P (2008) Old and new methods for lost-sales inventory systems. Oper. Res. 56(5):1256–1263.LinkGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.