Online Learning for Constrained Assortment Optimization Under Markov Chain Choice Model

Shukai Li
Shukai Li
[email protected]
https://orcid.org/0009-0005-1406-5803
Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, Illinois 60208
Search for more papers by this author
,
Qi Luo
Qi Luo
[email protected]
https://orcid.org/0000-0002-4103-7112
Department of Business Analytics, University of Iowa, Iowa City, Iowa 52242
Search for more papers by this author
,
Zhiyuan Huang
Zhiyuan Huang
[email protected]
https://orcid.org/0000-0003-1284-2128
Department of Management Science and Engineering, Tongji University, Shanghai 200092, China
Search for more papers by this author
,
Cong Shi
Corresponding Author
Cong Shi
[email protected]
https://orcid.org/0000-0003-3564-3391
Management Science, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146
Search for more papers by this author

Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, Illinois 60208

Search for more papers by this author

Qi Luo

[email protected]

https://orcid.org/0000-0002-4103-7112

Department of Business Analytics, University of Iowa, Iowa City, Iowa 52242

Search for more papers by this author

Zhiyuan Huang

[email protected]

https://orcid.org/0000-0003-1284-2128

Department of Management Science and Engineering, Tongji University, Shanghai 200092, China

Search for more papers by this author

Cong Shi

Corresponding Author

Cong Shi

[email protected]

https://orcid.org/0000-0003-3564-3391

Management Science, Miami Herbert Business School, University of Miami, Coral Gables, Florida 33146

Search for more papers by this author

Published Online:15 May 2024https://doi.org/10.1287/opre.2022.0693

Abstract

We study a dynamic assortment selection problem where arriving customers make purchase decisions among offered products from a universe of products under a Markov chain choice (MCC) model. The retailer only observes the assortment and the customer’s single choice per period. Given limited display capacity, resource constraints, and no a priori knowledge of problem parameters, the retailer’s objective is to sequentially learn the choice model and optimize cumulative revenues over a finite selling horizon. We develop a fast linear system based explore-then-commit (FastLinETC for short) learning algorithm that balances the tradeoff between exploration and exploitation. The algorithm can simultaneously estimate the arrival and transition probabilities in the MCC model by solving a linear system of equations and determining the near-optimal assortment based on these estimates. Furthermore, our consistent estimators offer superior computational times compared with existing heuristic estimation methods, which often suffer from inconsistency or a significant computational burden.

Funding: The research of Q. Luo is partially supported by the National Science Foundation [Grant CMMI-2308750]. The research of Z. Huang is partially supported by the Shanghai Sailing Program [Grant 22YF1451100 and the Fundamental Research Funds for the Central Universities]. The research of C. Shi is partially supported by Amazon [Research Award].

Volume 73, Issue 1

January-February 2025

Pages iii-vii, 1-582, C2-C3

Article Information

Metrics

Information

Received:December 31, 2022
Accepted:March 07, 2024
Published Online:May 15, 2024

Cite as

Shukai Li; , Qi Luo; , Zhiyuan Huang; , Cong Shi (2024) Online Learning for Constrained Assortment Optimization Under Markov Chain Choice Model. Operations Research 73(1):109-138.

https://doi.org/10.1287/opre.2022.0693

Keywords

Acknowledgments

The authors thank the area editor Professor Ilan Lobel, associate editor, and anonymous referees for detailed and constructive comments that significantly improved the content and exposition of this paper.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Online Learning for Constrained Assortment Optimization Under Markov Chain Choice Model

Abstract

Volume 73, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News