Effective Adaptive Exploration of Prices and Promotions in Choice-Based Demand Models

Published Online:https://doi.org/10.1287/mksc.2023.0322

References

  • Abeille M, Lazaric A (2017) Linear Thompson sampling revisited. Singh A, Zhu J, eds. Proc. 20th Internat. Conf. Artificial Intelligence Statist., Proceedings of Machine Learning Research, vol. 54 (PMLR, New York), 176–184.Google Scholar
  • Agrawal S, Avadhanula V, Goyal V, Zeevi A (2019) MNL-bandit: A dynamic learning approach to assortment selection. Oper. Res. 67(5):1453–1485.LinkGoogle Scholar
  • Allenby GM, Rossi PE (1998) Marketing models of consumer heterogeneity. J. Econometrics 89(1–2):57–78.CrossrefGoogle Scholar
  • Aramayo N, Schiappacasse M, Goic M (2023) A multi-armed bandit approach for house ads recommendations. Marketing Sci. 42(2):271–292.Google Scholar
  • Aydin G, Ryan JK (2000) Product line selection and pricing. Proc. 2021 ACM Conf. Fairness, Accountability, Transparency (Association for Computing Machinery, New York), 296–314.Google Scholar
  • Bakry D, Gentil I, Ledoux M (2014) Analysis and Geometry of Markov Diffusion Operators, vol. 103 (Springer, Berlin).CrossrefGoogle Scholar
  • Ban G-Y, Keskin NB (2021) Personalized dynamic pricing with machine learning: High-dimensional features and heterogeneous elasticity. Management Sci. 67(9):5549–5568.LinkGoogle Scholar
  • Bastani H, Simchi-Levi D, Zhu R (2022) Meta dynamic pricing: Transfer learning across experiments. Management Sci. 68(3):1865–1881.LinkGoogle Scholar
  • Berry ST (1994) Estimating discrete-choice models of product differentiation. RAND J. Econom. 25(2):242–262.CrossrefGoogle Scholar
  • Berry ST, Haile PA (2021) Foundations of demand estimation. Ho K, Hortaçsu A, Lizzeri A, eds. Handbook of Industrial Organization (Elsevier, Amsterdam), 1–62.Google Scholar
  • Berry S, Levinsohn J, Pakes A (1995) Automobile prices in market equilibrium. Econometrica 63(4):841–890.CrossrefGoogle Scholar
  • Besbes O, Zeevi A (2009) Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Oper. Res. 57(6):1407–1420.LinkGoogle Scholar
  • Besbes O, Zeevi A (2015) On the (surprising) sufficiency of linear models for dynamic pricing with demand learning. Management Sci. 61(4):723–739.LinkGoogle Scholar
  • Bradlow ET, Gangwar M, Kopalle P, Voleti S (2017) The role of big data and predictive analytics in retailing. J. Retailing 93(1):79–95.CrossrefGoogle Scholar
  • Broder J, Rusmevichientong P (2012) Dynamic pricing under a general parametric choice model. Oper. Res. 60(4):965–980.LinkGoogle Scholar
  • Casella G, Berger RL (2021) Statistical Inference (Chapman and Hall/CRC, New York).Google Scholar
  • Chapelle O, Li L (2011) An empirical evaluation of Thompson sampling. Shawe-Taylor J, Zemel R, Bartlett P, Pereira F, Weinberger KQ, eds. Advances in Neural Information Processing Systems, vol. 24 (Curran Associates Inc., Red Hook, NY).Google Scholar
  • Cheung WC, Simchi-Levi D, Wang H (2017) Dynamic pricing and demand learning with limited price experimentation. Oper. Res. 65(6):1722–1731.LinkGoogle Scholar
  • DellaVigna S, Gentzkow M (2019) Uniform pricing in us retail chains. Quart. J. Econom. 134(4):2011–2084.CrossrefGoogle Scholar
  • den Boer AV, Zwart B (2014) Simultaneously learning and optimizing using controlled variance pricing. Management Sci. 60(3):770–783.LinkGoogle Scholar
  • Dong L, Kouvelis P, Tian Z (2009) Dynamic pricing and inventory control of substitute products. Manufacturing Service Oper. Management 11(2):317–339.LinkGoogle Scholar
  • Dubé J-P, Misra S (2023) Personalized pricing and consumer welfare. J. Political Econom. 131(1):131–189.CrossrefGoogle Scholar
  • Faury L, Abeille M, Calauzènes C, Fercoq O (2020) Improved optimistic algorithms for logistic bandits. Internat. Conf. Machine Learn. (PMLR, New York), 3052–3060.Google Scholar
  • Feldman J, Zhang DJ, Liu X, Zhang N (2022) Customer choice models vs. machine learning: Finding optimal product displays on Alibaba. Oper. Res. 70(1):309–328.LinkGoogle Scholar
  • Ferreira KJ, Simchi-Levi D, Wang H (2018) Online network revenue management using Thompson sampling. Oper. Res. 66(6):1586–1602.LinkGoogle Scholar
  • Gabel S, Timoshenko A (2022) Product choice with large assortments: A scalable deep-learning model. Management Sci. 68(3):1808–1827.LinkGoogle Scholar
  • Ganti R, Sustik M, Tran Q, Seaman B (2018) Thompson Sampling for dynamic pricing. Preprint, submitted February 8, https://arxiv.org/abs/1802.03050.Google Scholar
  • Goeree MS (2008) Limited information and advertising in the us personal computer industry. Econometrica 76(5):1017–1074.CrossrefGoogle Scholar
  • Goli A, Reiley D, Zhang H (2021) Personalized versioning: Product strategies constructed from experiments on Pandora. Preprint, submitted June 25, http://dx.doi.org/10.2139/ssrn.3874243.Google Scholar
  • Guadagni PM, Little JD (1983) A logit model of brand choice calibrated on scanner data. Marketing Sci. 2(3):203–238.LinkGoogle Scholar
  • Hamidi N, Bayati M (2020) On worst-case regret of linear Thompson sampling. Preprint, submitted June 11, https://arxiv.org/abs/2006.06790.Google Scholar
  • Hanson W, Martin K (1996) Optimizing multinomial logit profit functions. Management Sci. 42(7):992–1003.LinkGoogle Scholar
  • Hauser JR, Urban GL, Liberali G, Braun M (2009) Website morphing. Marketing Sci. 28(2):202–223.LinkGoogle Scholar
  • Hitsch GJ, Hortaçsu A, Lin X (2021) Prices and promotions in US retail markets. Quant. Marketing Econom. 19(3):289–368.CrossrefGoogle Scholar
  • Hoch SJ, Kim B-D, Montgomery AL, Rossi PE (1995) Determinants of store-level price elasticity. J. Marketing Res. 32(1):17–29.CrossrefGoogle Scholar
  • Horsky D, Misra S, Nelson P (2006) Observed and unobserved preference heterogeneity in brand-choice models. Marketing Sci. 25(4):322–335.LinkGoogle Scholar
  • Howell JR, Lee S, Allenby GM (2016) Price promotions in choice models. Marketing Sci. 35(2):319–334.LinkGoogle Scholar
  • Huang Y, Ellickson PB, Lovett MJ (2022) Learning to set prices. J. Marketing Res. 59(2):411–434.CrossrefGoogle Scholar
  • Javanmard A (2017) Perishability of data: Dynamic pricing under varying-coefficient models. J. Machine Learn. Res. 18(1):1714–1744.Google Scholar
  • Javanmard A, Nazerzadeh H (2019) Dynamic pricing in high-dimensions. J. Machine Learn. Res. 20(9):1–49.Google Scholar
  • Javanmard A, Nazerzadeh H, Shao S (2020) Multi-product dynamic pricing in high-dimensions with heterogeneous price sensitivity. 2020 IEEE Internat. Sympos. Inform. Theory (ISIT) (IEEE, Piscataway, NJ), 2652–2657.Google Scholar
  • Johnson J, Tellis GJ, Ip EH (2013) To whom, when, and how much to discount? A constrained optimization of customized temporal discounts. J. Retailing 89(4):361–373.CrossrefGoogle Scholar
  • Kadiyali V, Sudhir K, Rao VR (2001) Structural analysis of competitive behavior: New empirical industrial organization methods in marketing. Internat. J. Res. Marketing 18(1–2):161–186.CrossrefGoogle Scholar
  • Kallus N, Zhou A (2021) Fairness, welfare, and equity in personalized pricing. Proc. 2021 ACM Conf. Fairness Accountability Transparency (ACM, New York), 296–314.Google Scholar
  • Keskin NB, Zeevi A (2014) Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies. Oper. Res. 62(5):1142–1167.LinkGoogle Scholar
  • Kleinberg R, Leighton T (2003) The value of knowing a demand curve: Bounds on regret for online posted-price auctions. 44th Annual IEEE Sympos. Foundations Comput. Sci. 2003 Proc. (IEEE, Piscataway, NJ), 594–605.Google Scholar
  • Kveton B, Zaheer M, Szepesvari C, Li L, Ghavamzadeh M, Boutilier C (2020) Randomized exploration in generalized linear bandits. Internat. Conf. Artificial Intelligence Statist. (PMLR, New York), 2066–2076.Google Scholar
  • Lattimore T, Szepesvári C (2020) Bandit Algorithms (Cambridge University Press, Cambridge, UK).CrossrefGoogle Scholar
  • Li H, Huh WT (2011) Pricing multiple products with the multinomial logit and nested logit models: Concavity and implications. Manufacturing Service Oper. Management 13(4):549–563.LinkGoogle Scholar
  • Liberali G, Ferecatu A (2022) Morphing for consumer dynamics: Bandits meet hidden Markov models. Marketing Sci. 41(4):769–794.LinkGoogle Scholar
  • Luo L, Kannan P, Ratchford BT (2007) New product development under channel acceptance. Marketing Sci. 26(2):149–163.LinkGoogle Scholar
  • Mazumdar E, Pacchiano A, Ma Y-a, Bartlett PL, Jordan MI (2020) On Thompson sampling with Langevin algorithms. Preprint, submitted February 23, https://arxiv.org/abs/2002.10002.Google Scholar
  • McFadden D (1980) Econometric models for probabilistic choice among products. J. Bus. 53(3):S13–S29.CrossrefGoogle Scholar
  • McFadden D, Talvitie A, Cosslett S, Hasan I, Johnson M, Reid F, Train K (1977) Demand model estimation and validation. Urban Travel Demand Forecasting Project, Phase 1. Working paper, Institute of Transportation Studies, University of California, Berkeley, CA.Google Scholar
  • Mehta N, Detroja P, Agashe A (2018) Amazon changes prices on its products about every 10 minutes—Here’s how and why they do it. Business Insider (August 10), https://www.businessinsider.com/amazon-price-changes-2018-8?international=true&r=US&IR=T.Google Scholar
  • Miao S, Chao X (2021) Dynamic joint assortment and pricing optimization with demand learning. Manufacturing Service Oper. Management 23(2):525–545.Google Scholar
  • Misra K, Schwartz EM, Abernethy J (2019) Dynamic online pricing with incomplete information using multiarmed bandit experiments. Marketing Sci. 38(2):226–252.LinkGoogle Scholar
  • Mueller J, Syrgkanis V, Taddy M (2018) Low-rank bandit methods for high-dimensional dynamic pricing. Wallach H, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox E, Garnett R, eds. Advances in Neural Information Processing Systems, vol. 32 (Curran Associates Inc., Red Hook, NY).Google Scholar
  • Pancras J, Sudhir K (2007) Optimal marketing strategies for a customer data intermediary. J. Marketing Res. 44(4):560–578.CrossrefGoogle Scholar
  • Priester A, Robbert T, Roth S (2020) A special price just for you: Effects of personalized dynamic pricing on consumer fairness perceptions. J. Revenue Pricing Management 19(2020):99–112.CrossrefGoogle Scholar
  • Qiang S, Bayati M (2016) Dynamic pricing with demand covariates. Preprint, submitted April 25, https://arxiv.org/abs/1604.07463.Google Scholar
  • Rafieian O, Yoganarasimhan H (2021) Targeting and privacy in mobile advertising. Marketing Sci. 40(2):193–218.LinkGoogle Scholar
  • Roberts GO, Tweedie RL (1996) Exponential convergence of Langevin distributions and their discrete approximations. Bernoulli 2(4):341–363.CrossrefGoogle Scholar
  • Rossi PE, McCulloch RE, Allenby GM (1996) The value of purchase history data in target marketing. Marketing Sci. 15(4):321–340.LinkGoogle Scholar
  • Russo DJ, Van Roy B, Kazerouni A, Osband I, Wen Z (2018) A tutorial on Thompson sampling. Foundations Trends Machine Learn. 11(1):1–96.CrossrefGoogle Scholar
  • Schwartz EM, Bradlow ET, Fader PS (2017) Customer acquisition via display advertising using multi-armed bandit experiments. Marketing Sci. 36(4):500–522.LinkGoogle Scholar
  • Smith AN, Seiler S, Aggarwal I (2023) Optimal price targeting. Marketing Sci. 42(3):476–499.LinkGoogle Scholar
  • Sudhir K (2001) Structural analysis of manufacturer pricing in the presence of a strategic retailer. Marketing Sci. 20(3):244–264.LinkGoogle Scholar
  • Train KE (2009) Discrete Choice Methods with Simulation (Cambridge University Press, Cambridge, UK).CrossrefGoogle Scholar
  • Trovo F, Paladino S, Restelli M, Gatti N (2018) Improving multi-armed bandit algorithms for pricing. 16th Eur. Conf. Multi-Agent Systems (Milano, Italy), 1–15.Google Scholar
  • Wang Y, Chen B, Simchi-Levi D (2021) Multimodal dynamic pricing. Management Sci. 67(10):6136–6152.LinkGoogle Scholar
  • Weaver I, Kumar V (2022) Nonparametric bandits leveraging informational externalities to learn the demand curve. Working paper, Yale School of Management, New Haven, CT.Google Scholar
  • Wei Y, Jiang Z (2022) Estimating parameters of structural models using neural networks. Working paper, University of Southern California – Marshall School of Business, Los Angeles.Google Scholar
  • Welling M, Teh YW (2011) Bayesian learning via stochastic gradient Langevin dynamics. Proc. 28th Internat. Conf. Machine Learn. (ICML-11) (Omnipress, Madison, WI), 681–688.Google Scholar
  • Xu J, Wang Y-x (2021) Logarithmic regret in feature-based dynamic pricing. Preprint, submitted February 20, https://arxiv.org/abs/2102.10221.Google Scholar
  • Xu P, Zheng H, Mazumdar EV, Azizzadenesheli K, Anandkumar A (2022) Langevin Monte Carlo for contextual bandits. Internat. Conf. Machine Learning (PMLR, New York), 24830–24850.Google Scholar
  • Yoganarasimhan H (2020) Search personalization using machine learning. Management Sci. 66(3):1045–1070.LinkGoogle Scholar
  • Zhang J, Krishnamurthi L (2004) Customizing promotions in online stores. Marketing Sci. 23(4):561–578.LinkGoogle Scholar
  • Zhang J, Wedel M (2009) The effectiveness of customized promotions in online and offline stores. J. Marketing Res. 46(2):190–206.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.