A Minibatch Stochastic Gradient Descent-Based Learning Metapolicy for Inventory Systems with Myopic Optimal Policy

Jiameng Lyu
Corresponding Author
Jiameng Lyu
[email protected]
https://orcid.org/0000-0002-4688-5276
Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, China; and Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China
Search for more papers by this author
,
Jinxing Xie
Corresponding Author
Jinxing Xie
[email protected]
https://orcid.org/0000-0002-9269-6468
Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China
Search for more papers by this author
,
Shilin Yuan
Corresponding Author
Shilin Yuan
[email protected]
https://orcid.org/0009-0002-7892-0344
Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China
Search for more papers by this author
,
Yuan Zhou
Corresponding Author
Yuan Zhou
[email protected]
https://orcid.org/0009-0008-1706-6539
Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, China; and Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China; and Beijing Institute of Mathematical Sciences and Application, Beijing 100084, China
Search for more papers by this author

Corresponding Author

Jiameng Lyu

Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, China; and Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China

Search for more papers by this author

Jinxing Xie

Corresponding Author

Jinxing Xie

[email protected]

https://orcid.org/0000-0002-9269-6468

Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China

Search for more papers by this author

Shilin Yuan

Corresponding Author

Shilin Yuan

[email protected]

https://orcid.org/0009-0002-7892-0344

Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China

Search for more papers by this author

Yuan Zhou

Corresponding Author

Yuan Zhou

[email protected]

https://orcid.org/0009-0008-1706-6539

Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, China; and Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China; and Beijing Institute of Mathematical Sciences and Application, Beijing 100084, China

Search for more papers by this author

Published Online:9 Oct 2024https://doi.org/10.1287/mnsc.2023.00920

Supplemental Material

mnsc.2023.00920.sm1.pdf

Volume 71, Issue 7

July 2025

Pages iv-vi, 5419-6318

Article Information

Supplemental Material

Metrics

Information

Received:March 23, 2023
Accepted:August 18, 2024
Published Online:October 09, 2024

Cite as

Jiameng Lyu, Jinxing Xie, Shilin Yuan, Yuan Zhou (2024) A Minibatch Stochastic Gradient Descent-Based Learning Metapolicy for Inventory Systems with Myopic Optimal Policy. Management Science 71(7):5572-5588.

https://doi.org/10.1287/mnsc.2023.00920

Keywords

Acknowledgments

The authors thank the department editor, associate editor, and three anonymous referees for detailed and constructive comments that considerably improved the quality of this paper. The authors Jiameng Lyu, Jinxing Xie, Shilin Yuan, and Yuan Zhou are listed in alphabetical order.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

A Minibatch Stochastic Gradient Descent-Based Learning Metapolicy for Inventory Systems with Myopic Optimal Policy

Supplemental Material

Volume 71, Issue 7

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News