Tailored Base-Surge Policies in Dual-Sourcing Inventory Systems with Demand Learning

Boxiao Chen
Corresponding Author
Boxiao Chen
[email protected]
https://orcid.org/0000-0002-5967-4822
Information and Decision Sciences, College of Business Administration, University of Illinois, Chicago, Illinois 60607
Search for more papers by this author
,
Cong Shi
Cong Shi
[email protected]
https://orcid.org/0000-0003-3564-3391
Management, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146
Search for more papers by this author

Boxiao Chen

Corresponding Author

Boxiao Chen

[email protected]

https://orcid.org/0000-0002-5967-4822

Information and Decision Sciences, College of Business Administration, University of Illinois, Chicago, Illinois 60607

Search for more papers by this author

Cong Shi

[email protected]

https://orcid.org/0000-0003-3564-3391

Management, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146

Search for more papers by this author

Published Online:20 Dec 2024https://doi.org/10.1287/opre.2022.0624

Abstract

We consider a periodic-review dual-sourcing inventory system in which the expedited supplier is faster and more costly, whereas the regular supplier is slower and cheaper. Under full demand distributional information, it is well known that the optimal policy is extremely complex but the celebrated Tailored Base-Surge (TBS) policy performs near optimally. Under such a policy, a constant order is placed at the regular source in each period, whereas the order placed at the expedited source follows a simple order-up-to rule. In this paper, we assume that the firm does not know the demand distribution a priori and makes adaptive inventory ordering decisions in each period based only on the past sales (a.k.a. censored demand) data. The standard performance measure is regret, which is the cost difference between a feasible learning algorithm and the clairvoyant (full-information) benchmark. When the benchmark is chosen to be the (full-information) best Tailored Base-Surge policy, we develop the first nonparametric learning algorithm that admits a regret bound of $O (\sqrt{T} {(\log T)}^{3} \log \log T)$ , which is provably tight up to a logarithmic factor. Leveraging the structure of this problem, our approach combines the power of bisection search and stochastic gradient descent and also involves a delicate high-probability coupling argument between our and the clairvoyant optimal system dynamics.

Funding: The research of C. Shi is partially supported by an Amazon research award.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/opre.2022.0624.

Volume 73, Issue 4

July-August 2025

Pages iii-viii, 1723-2295, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:November 29, 2022
Accepted:September 11, 2024
Published Online:December 20, 2024

Cite as

Boxiao Chen, Cong Shi (2024) Tailored Base-Surge Policies in Dual-Sourcing Inventory Systems with Demand Learning. Operations Research 73(4):1723-1743.

https://doi.org/10.1287/opre.2022.0624

Keywords

Acknowledgments

The authors thank the area editor Professor Tava Olsen, the associate editor, and the three anonymous referees for their very detailed and constructive comments, which have helped significantly improve the content and exposition of this paper.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Tailored Base-Surge Policies in Dual-Sourcing Inventory Systems with Demand Learning

Abstract

Volume 73, Issue 4

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News