An Online Mirror Descent Learning Algorithm for Multiproduct Inventory Systems

Sichen Guo
Sichen Guo
[email protected]
https://orcid.org/0009-0002-7637-4829
Department of Management, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146; and Research Institute for Interdisciplinary Sciences, School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China
Search for more papers by this author
,
Cong Shi
Cong Shi
[email protected]
https://orcid.org/0000-0003-3564-3391
Department of Management, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146
Search for more papers by this author
,
Chaolin Yang
Chaolin Yang
[email protected]
https://orcid.org/0000-0001-8857-5877
Research Institute for Interdisciplinary Sciences, School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China
Search for more papers by this author
,
Christos Zacharias
Corresponding Author
Christos Zacharias
[email protected]
https://orcid.org/0000-0002-9911-7860
Department of Management Science, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146
Search for more papers by this author

Department of Management, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146; and Research Institute for Interdisciplinary Sciences, School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China

Search for more papers by this author

Cong Shi

[email protected]

https://orcid.org/0000-0003-3564-3391

Department of Management, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146

Search for more papers by this author

Chaolin Yang

[email protected]

https://orcid.org/0000-0001-8857-5877

Research Institute for Interdisciplinary Sciences, School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China

Search for more papers by this author

Christos Zacharias

Corresponding Author

Christos Zacharias

[email protected]

https://orcid.org/0000-0002-9911-7860

Department of Management Science, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146

Search for more papers by this author

Published Online:29 Apr 2026https://doi.org/10.1287/opre.2024.0982

Abstract

We study a canonical inventory control problem: a multiproduct, periodic-review, lost-sales inventory system with a warehouse-capacity constraint. We study this well-researched problem under the lens of demand learning from censored data. Unlike the traditional literature, we do not assume that demand distributions are known a priori. Instead, the decision maker only has access to observed sales data, whereas the lost-sales quantity remains unobserved. Existing online learning algorithms bear limitations in providing good-quality solutions to inventory systems offering a large variety of products. We employ and innovate mirror descent with cyclic update techniques to address the challenge of high dimensionality in product menus. We prove theoretically that our algorithm’s regret bound exhibits a logarithmic dependence on the number of products. This constitutes a significant improvement compared with the square-root regret bound established in the existing literature. Using empirical data, we implemented our methods to assess their practical merit and expose additional managerial insights. Our numerical study confirms that our methodology indeed produces inventory policies superior to existing state-of-the-art solutions, especially when managing a large menu of products. Drawing from our numerical observations and theory-informed insights, we provide clear guidelines for practical implementation along with fine-tuning recommendations.

Funding: C. Shi acknowledges support from Amazon [Amazon Research Award] and the University of Miami [Provost Research Award]. C. Yang acknowledges support from the National Natural Science Foundation of China [Grants 72531005, 72122012, and 72071126] and the Program for Innovative Research Team at Shanghai University of Finance and Economics.

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2024.0982.

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:April 24, 2024
Accepted:February 22, 2026
Published Online:April 29, 2026

Cite as

Sichen Guo, Cong Shi, Chaolin Yang, Christos Zacharias (2026) An Online Mirror Descent Learning Algorithm for Multiproduct Inventory Systems. Operations Research 0(0).

https://doi.org/10.1287/opre.2024.0982

Keywords

Acknowledgments

The authors thank area editor Rouba Ibrahim, the associate editor, and two anonymous referees for their careful reading and constructive comments, which led to several substantial improvements to the paper. Part of this research was conducted while Sichen Guo was at the Miami Herbert Business School at the University of Miami as a visiting PhD student.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

An Online Mirror Descent Learning Algorithm for Multiproduct Inventory Systems

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News