Online Learning for Constrained Assortment Optimization Under Markov Chain Choice Model
References
- (2017) Thompson sampling for the MNL-bandit. Kale S, Shamir O, eds. Proc. 30th Conf. Learning Theory (PMLR, New York), 76–78.Google Scholar
- (2019) MNL-bandit: A dynamic learning approach to assortment selection. Oper. Res. 67(5):1453–1485.Link, Google Scholar
- (2002) The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32(1):48–77.Crossref, Google Scholar
- (2017) Statistical guarantees for the EM algorithm: From population to sample-based analysis. Ann. Statist. 45(1):77–120.Crossref, Google Scholar
- (2016) Discrete choice models based on random walks. Oper. Res. Lett. 44(2):234–237.Crossref, Google Scholar
- (2022) A comparative empirical study of discrete choice models in retail operations. Management Sci. 68(6):4005–4023.Link, Google Scholar
- (2019) A dynamic clustering approach to data-driven assortment personalization. Management Sci. 65(5):2095–2115.Abstract, Google Scholar
- (2016) A Markov chain approximation to choice modeling. Oper. Res. 64(4):886–905.Link, Google Scholar
- (2018) A note on a tight lower bound for capacitated MNL-bandit assortment selection models. Oper. Res. Lett. 46(5):534–537.Crossref, Google Scholar
- (2020) Dynamic assortment optimization with changing contextual information. J. Machine Learn. Res. 21:216–1.Google Scholar
- (2021b) Optimal policy for dynamic assortment planning under multinomial logit models. Math. Oper. Res. 46(4):1639–1657.Link, Google Scholar
- (2021a) Dynamic assortment planning under nested logit models. Production Oper. Management 30(1):85–102.Crossref, Google Scholar
- (2013) Assortment planning under the multinomial logit model with totally unimodular constraint structures. Technical report, Cornell University, Ithaca, NY.Google Scholar
- (2020) Constrained assortment optimization under the Markov chain-based choice model. Management Sci. 66(2):698–721.Link, Google Scholar
- (2019) Pricing problems under the Markov chain choice model. Production Oper. Management 28(1):157–175.Crossref, Google Scholar
- (2021) Joint assortment and inventory planning for heavy tailed demand. Technical report, Cornell Tech, New York.Crossref, Google Scholar
- (2017) Revenue management under the Markov chain choice model. Oper. Res. 65(5):1322–1342.Link, Google Scholar
- (2020) Joint pricing and inventory decisions for substitutable products. Technical report, Hong Kong University of Science and Technology, Hong Kong.Google Scholar
- (2021) An optimal greedy heuristic with minimal learning regret for the Markov chain choice model. Technical report, Hong Kong University of Science and Technology, Hong Kong.Google Scholar
- (2019) Revenue Management and Pricing Analytics (Springer, New York).Crossref, Google Scholar
- (2015) A general attraction model and sales-based linear program for network revenue management under customer choice. Oper. Res. 63(1):212–232.Link, Google Scholar
- (2020) Parameter identification in Markov chain choice models. Theoretical Comput. Sci. 808:99–107.Crossref, Google Scholar
- (2020) Dynamic assortment personalization in high dimensions. Oper. Res. 68(4):1020–1037.Link, Google Scholar
- (2006) Introduction to Empirical Processes and Semiparametric Inference (Springer, New York).Google Scholar
- (2021) Dynamic joint assortment and pricing optimization with demand learning. Manufacturing Service Oper. Management 23(2):525–545.Google Scholar
- (2021) Assortment optimization under a single transition choice model. Production Oper. Management 30(7):2122–2142.Crossref, Google Scholar
- (2019) Thompson sampling for multinomial logit contextual bandits. Wallach HM, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox EB, Garnett R, eds. Proc. Adv. Neural Inform. Processing Systems: Annual Conf. Neural Inform. Processing Systems (Curran Associates, Inc., Red Hook, NY), 3145–3155.Google Scholar
- (2016) Batched bandit problems. Ann. Statist. 44(2):660–681.Google Scholar
- (2016) Pairwise choice Markov chains. Lee DD, Sugiyama M, von Luxburg U, Guyon I, Garnett R, eds. Proc. Adv. Neural Inform. Processing Systems: Annual Conf. Neural Inform. Processing Systems (Curran Associates, Inc., Red Hook, NY), 3198–3206.Google Scholar
- (2010) Dynamic assortment optimization with a multinomial logit choice model and capacity constraint. Oper. Res. 58(6):1666–1680.Link, Google Scholar
- (2013) Optimal dynamic assortment planning with demand learning. Manufacturing Service Oper. Management 15(3):387–404.Link, Google Scholar
- (2018) An expectation-maximization algorithm to estimate the parameters of the Markov chain choice model. Oper. Res. 66(3):748–760.Link, Google Scholar
- (2021) Submodular order functions and assortment optimization. Technical report, University of California, Berkeley, Berkeley, CA.Google Scholar
- (2013) Assortment management under the generalized attraction model with a capacity constraint. J. Revenue Pricing Management 12(3):254–270.Crossref, Google Scholar
- (2018) Near-optimal policies for dynamic multinomial logit assortment selection models. Bengio S, Wallach HM, Larochelle H, Grauman K, Cesa-Bianchi N, Garnett R, eds. Proc. Adv. Neural Inform. Processing Systems: Annual Conf. Neural Inform. Processing Systems (Curran Associates, Inc., Red Hook, NY), 3105–3114.Google Scholar
- (2005) Revenue management for parallel flights with customer-choice behavior. Oper. Res. 53(3):415–431.Link, Google Scholar
- (2022) Learning the scheduling policy in time-varying multiclass many server queues with abandonment. Technical report, University of Chicago, Chicago.Google Scholar

