Rate-Optimal Online Learning for Dynamic Assortment Selection with Positioning
References
- (2011) Improved algorithms for linear stochastic bandits. Shawe-Taylor J, Zemel RS, Bartlett PL, Pereira F, Weinberger KQ, eds. Adv. Neural Inform. Processing Systems, vol. 24 (Curran Associates, Red Hook, NY), 2312–2320.Google Scholar
- (2016) Assortment optimization under a multinomial logit model with position bias and social influence. 4OR 14(1):57–75.Crossref, Google Scholar
- (2017) Thompson sampling for the MNL-bandit. Kale S, Shamir O, eds. Proc. 30th Conf. Learn. Theory (PMLR, New York), 76–78.Google Scholar
- (2019) Mnl-bandit: A dynamic learning approach to assortment selection. Oper. Res. 67(5):1453–1485.Link, Google Scholar
- (2021) Display optimization for vertically differentiated locations under multinomial logit preferences. Management Sci. 67(6):3519–3550.Link, Google Scholar
- (2021) Assortment optimization under consider-then-choose choice models. Management Sci. 67(6):3368–3386.Link, Google Scholar
- (2021) MNL-bandit with knapsacks: A near optimal algorithm. Preprint, submitted June 2, https://arxiv.org/abs/2106.01135.Google Scholar
- (1963) Topological Spaces (Oliver and Boyd, Edinburgh, UK).Google Scholar
- (2004) Convex Optimization (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (2013) Bandits with heavy tail. IEEE Trans. Inform. Theory 59(11):7711–7717.Crossref, Google Scholar
- (2007) Dynamic assortment with demand learning for seasonal consumer goods. Management Sci. 53(2):276–292.Link, Google Scholar
- (2018) A note on a tight lower bound for capacitated mnl-bandit assortment selection models. Oper. Res. Lett. 46(5):534–537.Crossref, Google Scholar
- (2021a) Optimal policy for dynamic assortment planning under multinomial logit models. Math. Oper. Res. 46(4):1639–1657.Link, Google Scholar
- (2021b) Dynamic assortment planning under nested logit models. Production Oper. Management 30(1):85–102.Crossref, Google Scholar
- (2023) Bias and debias in recommender system: A survey and future directions. ACM Trans. Inform. Systems 41(3):1–39.Crossref, Google Scholar
- (2019) A Thompson sampling algorithm for cascading bandits. Chaudhuri K, Sugiyama M, eds. Proc. 22nd Internat. Conf. Artificial Intelligence Statist., vol. 89 (PMLR, New York), 438–447.Google Scholar
- (2008) An experimental comparison of click position-bias models. Najork M, Broder AZ, Chakrabarti S, eds. Proc. 2008 Internat. Conf. Web Search Data Mining (Palo Alto, California), 87–94.Google Scholar
- (2022) The multinomial logit model with sequential offerings: Algorithmic frameworks for product recommendation displays. Oper. Res. 70(4):2162–2184.Link, Google Scholar
- (2023) MNL-bandit in non-stationary environments. Preprint, submitted March 4, https://arxiv.org/abs/2303.02504.Google Scholar
- (2020) Approximation algorithms for product framing and pricing. Oper. Res. 68(1):134–160.Link, Google Scholar
- (2023) Discrete choice models with piecewise linear utility: Modeling, estimation and pricing. Preprint, submitted March 20, http://dx.doi.org/10.2139/ssrn.4394213.Google Scholar
- (2015) Cascading bandits: Learning to rank in the cascade model. Bach F, Blei D, eds. Proc. 32nd Internat. Conf. Machine Learn., vol. 36 (PMLR, New York), 767–776.Google Scholar
- (2020) Bandit Algorithms (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (2025) Online learning for constrained assortment optimization under Markov chain choice model. Oper. Res. 73(1):109–138.Google Scholar
- (2016) Contextual combinatorial cascading bandits. Balcan MF, Weinberger KQ, eds. Proc. 33rd Internat. Conf. Machine Learn., vol. 48 (PMLR, New York), 1245–1253.Google Scholar
- (2016) No-regret algorithms for heavy-tailed linear bandits. Balcan MF, Weinberger KQ, eds. Proc. 33rd Internat. Conf. Machine Learn., vol. 48 (PMLR, New York), 1642–1650.Google Scholar
- (2021) Dynamic joint assortment and pricing optimization with demand learning. Manufacturing Service Oper. Management 23(2):525–545.Google Scholar
- (2010) Dynamic assortment optimization with a multinomial logit choice model and capacity constraint. Oper. Res. 58(6):1666–1680.Link, Google Scholar
- (2013) Optimal dynamic assortment planning with demand learning. Manufacturing Service Oper. Management 15(3):387–404.Link, Google Scholar
- (2023) Combinatorial inference on the optimal assortment in multinomial logit models. Preprint, submitted January 28, https://arxiv.org/abs/2301.12254.Google Scholar
- (2022) Modeling consumer choice and optimizing assortment under the threshold multinomial logit model. Preprint, submitted August 8, https://doi.org/10.2139/ssrn.4184044.Google Scholar
- (2022) Online resource allocation with personalized learning. Oper. Res. 70(4):2138–2161.Link, Google Scholar

