Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

Published Online:https://doi.org/10.1287/mksc.2018.1129

References

  • Acquisti A, Varian HR (2005) Conditioning prices on purchase history. Marketing Sci. 24(3):367–381.LinkGoogle Scholar
  • Aghion P, Bolton P, Harris C, Jullien B (1991) Optimal learning by experimentation. Rev. Econom. Stud. 58(4):621–654.CrossrefGoogle Scholar
  • Agrawal R (1955) Sample mean based index policies with O(log n) regret for the multi-armed bandit problem. Adv. Appl. Probab. 27(4):1054–1078.CrossrefGoogle Scholar
  • Akçay Y, Natarajan HP, Xu SH (2010) Joint dynamic pricing of multiple perishable products under consumer choice. Management Sci. 56(8):1345–1361.LinkGoogle Scholar
  • Anderson E, Jaimovich N, Simester D (2015) Price stickiness: Empirical evidence of the menu cost channel. Rev. Econom. Statist. 97(4):813–826.CrossrefGoogle Scholar
  • Audibert JY, Munos R, Szepesvári C (2009) Exploration-exploitation trade-off using variance estimates in multi-armed bandits. Theoret. Comput. Sci. 410(19):1876–1902.CrossrefGoogle Scholar
  • Auer P (2002) Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. Res. 3(November):397–422.Google Scholar
  • Auer P, Cesa-Bianchi N, Fischer P (2002) Finite-time analysis of the multiarmed bandit problem. Machine Learn. 47(2/3):235–256.CrossrefGoogle Scholar
  • Aviv Y, Pazcal A (2002) Pricing of short life-cycle products through active learning. Working paper, Washington University of St. Louis, St. Louis.Google Scholar
  • Baker W, Kiewell D, Winkler G (2014) Using big data to make better pricing decisions. McKinsey & Company (June), http://www.mckinsey.com/business-functions/marketing-and-sales/our-insights/using-big-data-to-make-better-pricing-decisions.Google Scholar
  • Bayus BL (1992) The dynamic pricing of next generation consumer durables. Marketing Sci. 11(3):251–265.LinkGoogle Scholar
  • Bergemann D, Schlag K (2008) Pricing without priors. J. Eur. Econom. Assoc. 6(2-3):560–569.CrossrefGoogle Scholar
  • Bergemann D, Schlag K (2011) Robust monopoly pricing. J. Econom. Theory 146(6):2527–2543.CrossrefGoogle Scholar
  • Bergemann D, Valimaki J (2000) Experimentation in markets. Rev. Econom. Stud. 67:213–234.CrossrefGoogle Scholar
  • Berger JO (1985) Statistical Decision Theory and Bayesian Analysis, 2nd ed. (Springer, New York).CrossrefGoogle Scholar
  • Besbes O, Zeevi A (2009) Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Oper. Res. 57(6):1407–1420.LinkGoogle Scholar
  • Biyalogorsky E, Gerstner E (2004) Contingent pricing to reduce price risks. Marketing Sci. 23(1):146–155.LinkGoogle Scholar
  • Biyalogorsky E, Koenigsberg O (2014) The design and introduction of product lines when consumer valuations are uncertain. Production Oper. Management 23(9):1539–1548.CrossrefGoogle Scholar
  • Bonatti A (2011) Menu pricing and learning. Amer. Econom. J. Microeconom. 3(3):124–163.CrossrefGoogle Scholar
  • Braden DJ, Oren SS (1994) Nonlinear pricing to produce information. Marketing Sci. 13(3):310–326.LinkGoogle Scholar
  • Brezzi M, Lai TL.(2002) Optimal learning and experimentation in bandit problems. J. Econom. Dynam. Control 27(1):87–108.CrossrefGoogle Scholar
  • Cesa-Bianchi N, Lugosi G (2006) Prediction, Learning, and Games (Cambridge University Press, New York).CrossrefGoogle Scholar
  • Chintagunta P, Hanssens DM, Hauser JR (2016) Marketing science and big data. Marketing Sci. 35(3):341–342.LinkGoogle Scholar
  • den Boer AV (2015)Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys Oper. Res. Management Sci. 20(1):1–18.CrossrefGoogle Scholar
  • Desai PS, Koenigsberg O, Purohit D (2010) Forward buying by retailers. J. Marketing Res. 47(1):90–102.CrossrefGoogle Scholar
  • Dubé J-P, Misra S (2017) Scalable price targeting. NBER Working Paper 23775, National Bureau of Economic Research, Cambridge, MA. http://www.nber.org/papers/w23775.CrossrefGoogle Scholar
  • Elmaghraby W, Keskinocak P (2003) Dynamic pricing in the presence of inventory considerations: Research overview, current practices, and future directions. Management Sci. 49(10):1287–1309.LinkGoogle Scholar
  • Erdem T, Keane MP (1996) Decision-making under uncertainty: Capturing dynamic brand choice processes in turbulent consumer goods markets. Marketing Sci. 15(1):1–20.LinkGoogle Scholar
  • Furman J, Simcoe T (2015) The economics of big data and differential pricing. The White House: President Barack Obama (blog) (February 6), https://obamawhitehouse.archives.gov/blog/2015/02/06/economics-big-data-and-differential-pricing.Google Scholar
  • Gittins JC (1989) Multi-Armed Bandit Allocation Indices, 1st ed. (John Wiley & Sons, Chichester, UK).Google Scholar
  • Gittins JC, Glazebrook K, Weber R (2011) Multi-Armed Bandit Allocation Indices, 2nd ed. (John Wiley & Sons, New York).CrossrefGoogle Scholar
  • Hall P, Park BU (2002) New methods for bias correction at endpoints and boundaries. Ann. Statist. 30(5):1460–1479.CrossrefGoogle Scholar
  • Handel B, Misra K (2015) Robust new product pricing. Marketing Sci. 34(6):864–881.LinkGoogle Scholar
  • Handel B, Misra K, Roberts K (2013) Robust firm pricing with panel data. J. Econometrics 174(2):165–185.CrossrefGoogle Scholar
  • Hauser JR, Urban GL, Liberali G, Braun M (2009) Website morphing. Marketing Sci. 28(2):202–223.LinkGoogle Scholar
  • Hendel I, Nevo A (2006) Measuring the implications of sales and consumer inventory behavior. Econometrica 74(6):1637–1673.CrossrefGoogle Scholar
  • Hitsch G (2006) Optimal dynamic product launch and exit under demand uncertainty. Marketing Sci. 25(1):25–30.LinkGoogle Scholar
  • Hviid M, Shaffer G (1999) Hassle costs: The achilles’ heel of price-matching guarantees. J. Econom. Management Strategy 8(4):489–521.CrossrefGoogle Scholar
  • Jiang Y, Shang J, Kemerer CF, Liu Y (2011) Optimizing e-tailer profits and customer savings: Pricing multistage customized online bundles. Marketing Sci. 30(4):737–752.LinkGoogle Scholar
  • Kalyanam K (1996) Pricing decisions under demand uncertainty: A bayesian mixture model approach. Marketing Sci. 15(3):207–221.LinkGoogle Scholar
  • Kalyanaram G, Winer RS (1995) Empirical generalizations from reference price research. Marketing Sci. 14(3, Part 2):G161–G169.LinkGoogle Scholar
  • Karunamuni RJ, Alberts T (2005) On boundary correction in kernel density estimation. Statist. Methodol. 2(3):191–212.CrossrefGoogle Scholar
  • Kleinberg R, Leighton FT (2014) The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions, Working paper, Akamai Technologies, Cambridge, MA.Google Scholar
  • Kuleshov V, Precup D (2014) Algorithms for the multi-armed bandit problem Working paper, McGill University, Montréal. https://arxiv.org/abs/1402.6028.Google Scholar
  • Lai TL (1987) Adaptive treatment allocation and the multi-armed bandit problem. Ann. Statist. 15(3):1091–1114.CrossrefGoogle Scholar
  • Lai TL, Robbins H (1985) Asymptotically efficient adaptive allocation rules. Adv. Appl. Math. 6(1):4–22.CrossrefGoogle Scholar
  • Lei Y, Jasin S, Sinha A (2014) Near-optimal bisection search for nonparametric dynamic pricing with inventory constraint. Ross School of Business Working Paper 1252, University of Michigan, Ann Arbor. https://ssrn.com/abstract=2509425.Google Scholar
  • Lodish LM (1980) Applied dynamic pricing and production models with specific application to broadcast spot pricing. J. Marketing Res. 17(2):203–211.CrossrefGoogle Scholar
  • Manski C (2005) Social Choice with Partial Knowledge of Treatment Response (Princeton University Press, Princeton, NJ).Google Scholar
  • Mas-Colell A, Whinston M, Green J (1995) Microeconomic Theory (Oxford University Press, New York).Google Scholar
  • Milnor J (1954) Games against nature. Thrall RM, Coombs CH, Davis RL, eds. Decision Processes (John Wiley & Sons, New York), 49–59.Google Scholar
  • Nair H (2007) Intertemporal price discrimination with forward-looking consumers: Application to the US market for console video-games. Quant. Marketing Econom. 5(3): 239–292.CrossrefGoogle Scholar
  • Nair H, Chintagunta P, Dube J-P (2004) Empirical analysis of indirect network effects in the market for personal digital assistants. Quant. Marketing Econom. 2(1):23–58.CrossrefGoogle Scholar
  • Oren SS, Smith SA, Wilson RB (1982) Nonlinear pricing in markets with interdependent demand. Marketing Sci. 1(3):287–313.LinkGoogle Scholar
  • Rajan A, Steinberg R, Steinberg R (1992) Dynamic pricing and ordering decisions by a monopolist. Management Sci. 38(2):240–262.LinkGoogle Scholar
  • Rao RC, Bass FM (1985) Competition, strategy, and price dynamics: A theoretical and empirical investigation. J. Marketing Res. 22(3):283–296.CrossrefGoogle Scholar
  • Rothschild M (1974) A two-armed bandit theory of market pricing. J. Econom. Theory 9(2):185–202.CrossrefGoogle Scholar
  • Schwartz EM, Bradlow ET, Fader PS (2017) Customer acquisition via display advertisements using multi-armed bandit experiments. Marketing Sci. 36(4):500–522.LinkGoogle Scholar
  • Smith SA (1986) New product pricing in quality sensitive markets. Marketing Sci. 5(1):70–87.LinkGoogle Scholar
  • Stoye J (2011) Axioms for minimax regret choice correspondences. J. Econom. Theory 146(11):2226–2251.CrossrefGoogle Scholar
  • Sutton RS, Barto AG (1998) Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA).Google Scholar
  • Thompson WR (1933) On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3):285–294.CrossrefGoogle Scholar
  • Urban GL, Liberali G, MacDonald E, Bordley R, Hauser JR (2014) Morphing banner advertising. Marketing Sci. 33(1):27–46.LinkGoogle Scholar
  • Wald A (1950) Statistical Decision Functions (John Wiley & Sons, New York).Google Scholar
  • Wang Z, Hu M (2014) Committed versus contingent pricing under competition. Production Oper. Management 23(11):1919–1936.CrossrefGoogle Scholar
  • Wernerfelt B (1986) A special case of dynamic pricing policy. Management Sci. 32(12):1562–1566.LinkGoogle Scholar
  • Whittle P (1980) Multi-armed bandits and the Gittins index. J. Roy. Statist. Soc. Ser. B 42(2):143–149.CrossrefGoogle Scholar
  • Winer RS (1986) A reference price model of brand choice for frequently purchased products. J. Consumer Res. 13(2):250–256.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.