Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

Kanishka Misra
Corresponding Author
Kanishka Misra
http://orcid.org/0000-0002-0106-1230
Rady School of Management, University of California, San Diego, La Jolla, California 92093;
Search for more papers by this author
,
Eric M. Schwartz
Eric M. Schwartz
Ross School of Business, University of Michigan, Ann Arbor, Michigan 48109;
Search for more papers by this author
,
Jacob Abernethy
Jacob Abernethy
http://orcid.org/0000-0002-3115-6804
School of Computer Science, College of Computing, Georgia Institute of Technologyy, Atlanta, Georgia 30332
Search for more papers by this author

Kanishka Misra

Corresponding Author

Kanishka Misra

http://orcid.org/0000-0002-0106-1230

Rady School of Management, University of California, San Diego, La Jolla, California 92093;

Search for more papers by this author

Eric M. Schwartz

Ross School of Business, University of Michigan, Ann Arbor, Michigan 48109;

Search for more papers by this author

Jacob Abernethy

http://orcid.org/0000-0002-3115-6804

School of Computer Science, College of Computing, Georgia Institute of Technologyy, Atlanta, Georgia 30332

Search for more papers by this author

Published Online:29 Mar 2019https://doi.org/10.1287/mksc.2018.1129

References

Acquisti A, Varian HR (2005) Conditioning prices on purchase history. Marketing Sci. 24(3):367–381.Link, Google Scholar
Aghion P, Bolton P, Harris C, Jullien B (1991) Optimal learning by experimentation. Rev. Econom. Stud. 58(4):621–654.Crossref, Google Scholar
Agrawal R (1955) Sample mean based index policies with O(log n) regret for the multi-armed bandit problem. Adv. Appl. Probab. 27(4):1054–1078.Crossref, Google Scholar
Akçay Y, Natarajan HP, Xu SH (2010) Joint dynamic pricing of multiple perishable products under consumer choice. Management Sci. 56(8):1345–1361.Link, Google Scholar
Anderson E, Jaimovich N, Simester D (2015) Price stickiness: Empirical evidence of the menu cost channel. Rev. Econom. Statist. 97(4):813–826.Crossref, Google Scholar
Audibert JY, Munos R, Szepesvári C (2009) Exploration-exploitation trade-off using variance estimates in multi-armed bandits. Theoret. Comput. Sci. 410(19):1876–1902.Crossref, Google Scholar
Auer P (2002) Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. Res. 3(November):397–422.Google Scholar
Auer P, Cesa-Bianchi N, Fischer P (2002) Finite-time analysis of the multiarmed bandit problem. Machine Learn. 47(2/3):235–256.Crossref, Google Scholar
Aviv Y, Pazcal A (2002) Pricing of short life-cycle products through active learning. Working paper, Washington University of St. Louis, St. Louis.Google Scholar
Baker W, Kiewell D, Winkler G (2014) Using big data to make better pricing decisions. McKinsey & Company (June), http://www.mckinsey.com/business-functions/marketing-and-sales/our-insights/using-big-data-to-make-better-pricing-decisions.Google Scholar
Bayus BL (1992) The dynamic pricing of next generation consumer durables. Marketing Sci. 11(3):251–265.Link, Google Scholar
Bergemann D, Schlag K (2008) Pricing without priors. J. Eur. Econom. Assoc. 6(2-3):560–569.Crossref, Google Scholar
Bergemann D, Schlag K (2011) Robust monopoly pricing. J. Econom. Theory 146(6):2527–2543.Crossref, Google Scholar
Bergemann D, Valimaki J (2000) Experimentation in markets. Rev. Econom. Stud. 67:213–234.Crossref, Google Scholar
Berger JO (1985) Statistical Decision Theory and Bayesian Analysis, 2nd ed. (Springer, New York).Crossref, Google Scholar
Besbes O, Zeevi A (2009) Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Oper. Res. 57(6):1407–1420.Link, Google Scholar
Biyalogorsky E, Gerstner E (2004) Contingent pricing to reduce price risks. Marketing Sci. 23(1):146–155.Link, Google Scholar
Biyalogorsky E, Koenigsberg O (2014) The design and introduction of product lines when consumer valuations are uncertain. Production Oper. Management 23(9):1539–1548.Crossref, Google Scholar
Bonatti A (2011) Menu pricing and learning. Amer. Econom. J. Microeconom. 3(3):124–163.Crossref, Google Scholar
Braden DJ, Oren SS (1994) Nonlinear pricing to produce information. Marketing Sci. 13(3):310–326.Link, Google Scholar
Brezzi M, Lai TL.(2002) Optimal learning and experimentation in bandit problems. J. Econom. Dynam. Control 27(1):87–108.Crossref, Google Scholar
Cesa-Bianchi N, Lugosi G (2006) Prediction, Learning, and Games (Cambridge University Press, New York).Crossref, Google Scholar
Chintagunta P, Hanssens DM, Hauser JR (2016) Marketing science and big data. Marketing Sci. 35(3):341–342.Link, Google Scholar
den Boer AV (2015)Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys Oper. Res. Management Sci. 20(1):1–18.Crossref, Google Scholar
Desai PS, Koenigsberg O, Purohit D (2010) Forward buying by retailers. J. Marketing Res. 47(1):90–102.Crossref, Google Scholar
Dubé J-P, Misra S (2017) Scalable price targeting. NBER Working Paper 23775, National Bureau of Economic Research, Cambridge, MA. http://www.nber.org/papers/w23775.Crossref, Google Scholar
Elmaghraby W, Keskinocak P (2003) Dynamic pricing in the presence of inventory considerations: Research overview, current practices, and future directions. Management Sci. 49(10):1287–1309.Link, Google Scholar
Erdem T, Keane MP (1996) Decision-making under uncertainty: Capturing dynamic brand choice processes in turbulent consumer goods markets. Marketing Sci. 15(1):1–20.Link, Google Scholar
Furman J, Simcoe T (2015) The economics of big data and differential pricing. The White House: President Barack Obama (blog) (February 6), https://obamawhitehouse.archives.gov/blog/2015/02/06/economics-big-data-and-differential-pricing.Google Scholar
Gittins JC (1989) Multi-Armed Bandit Allocation Indices, 1st ed. (John Wiley & Sons, Chichester, UK).Google Scholar
Gittins JC, Glazebrook K, Weber R (2011) Multi-Armed Bandit Allocation Indices, 2nd ed. (John Wiley & Sons, New York).Crossref, Google Scholar
Hall P, Park BU (2002) New methods for bias correction at endpoints and boundaries. Ann. Statist. 30(5):1460–1479.Crossref, Google Scholar
Handel B, Misra K (2015) Robust new product pricing. Marketing Sci. 34(6):864–881.Link, Google Scholar
Handel B, Misra K, Roberts K (2013) Robust firm pricing with panel data. J. Econometrics 174(2):165–185.Crossref, Google Scholar
Hauser JR, Urban GL, Liberali G, Braun M (2009) Website morphing. Marketing Sci. 28(2):202–223.Link, Google Scholar
Hendel I, Nevo A (2006) Measuring the implications of sales and consumer inventory behavior. Econometrica 74(6):1637–1673.Crossref, Google Scholar
Hitsch G (2006) Optimal dynamic product launch and exit under demand uncertainty. Marketing Sci. 25(1):25–30.Link, Google Scholar
Hviid M, Shaffer G (1999) Hassle costs: The achilles’ heel of price-matching guarantees. J. Econom. Management Strategy 8(4):489–521.Crossref, Google Scholar
Jiang Y, Shang J, Kemerer CF, Liu Y (2011) Optimizing e-tailer profits and customer savings: Pricing multistage customized online bundles. Marketing Sci. 30(4):737–752.Link, Google Scholar
Kalyanam K (1996) Pricing decisions under demand uncertainty: A bayesian mixture model approach. Marketing Sci. 15(3):207–221.Link, Google Scholar
Kalyanaram G, Winer RS (1995) Empirical generalizations from reference price research. Marketing Sci. 14(3, Part 2):G161–G169.Link, Google Scholar
Karunamuni RJ, Alberts T (2005) On boundary correction in kernel density estimation. Statist. Methodol. 2(3):191–212.Crossref, Google Scholar
Kleinberg R, Leighton FT (2014) The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions, Working paper, Akamai Technologies, Cambridge, MA.Google Scholar
Kuleshov V, Precup D (2014) Algorithms for the multi-armed bandit problem Working paper, McGill University, Montréal. https://arxiv.org/abs/1402.6028.Google Scholar
Lai TL (1987) Adaptive treatment allocation and the multi-armed bandit problem. Ann. Statist. 15(3):1091–1114.Crossref, Google Scholar
Lai TL, Robbins H (1985) Asymptotically efficient adaptive allocation rules. Adv. Appl. Math. 6(1):4–22.Crossref, Google Scholar
Lei Y, Jasin S, Sinha A (2014) Near-optimal bisection search for nonparametric dynamic pricing with inventory constraint. Ross School of Business Working Paper 1252, University of Michigan, Ann Arbor. https://ssrn.com/abstract=2509425.Google Scholar
Lodish LM (1980) Applied dynamic pricing and production models with specific application to broadcast spot pricing. J. Marketing Res. 17(2):203–211.Crossref, Google Scholar
Manski C (2005) Social Choice with Partial Knowledge of Treatment Response (Princeton University Press, Princeton, NJ).Google Scholar
Mas-Colell A, Whinston M, Green J (1995) Microeconomic Theory (Oxford University Press, New York).Google Scholar
Milnor J (1954) Games against nature. Thrall RM, Coombs CH, Davis RL, eds. Decision Processes (John Wiley & Sons, New York), 49–59.Google Scholar
Nair H (2007) Intertemporal price discrimination with forward-looking consumers: Application to the US market for console video-games. Quant. Marketing Econom. 5(3): 239–292.Crossref, Google Scholar
Nair H, Chintagunta P, Dube J-P (2004) Empirical analysis of indirect network effects in the market for personal digital assistants. Quant. Marketing Econom. 2(1):23–58.Crossref, Google Scholar
Oren SS, Smith SA, Wilson RB (1982) Nonlinear pricing in markets with interdependent demand. Marketing Sci. 1(3):287–313.Link, Google Scholar
Rajan A, Steinberg R, Steinberg R (1992) Dynamic pricing and ordering decisions by a monopolist. Management Sci. 38(2):240–262.Link, Google Scholar
Rao RC, Bass FM (1985) Competition, strategy, and price dynamics: A theoretical and empirical investigation. J. Marketing Res. 22(3):283–296.Crossref, Google Scholar
Rothschild M (1974) A two-armed bandit theory of market pricing. J. Econom. Theory 9(2):185–202.Crossref, Google Scholar
Schwartz EM, Bradlow ET, Fader PS (2017) Customer acquisition via display advertisements using multi-armed bandit experiments. Marketing Sci. 36(4):500–522.Link, Google Scholar
Smith SA (1986) New product pricing in quality sensitive markets. Marketing Sci. 5(1):70–87.Link, Google Scholar
Stoye J (2011) Axioms for minimax regret choice correspondences. J. Econom. Theory 146(11):2226–2251.Crossref, Google Scholar
Sutton RS, Barto AG (1998) Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA).Google Scholar
Thompson WR (1933) On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3):285–294.Crossref, Google Scholar
Urban GL, Liberali G, MacDonald E, Bordley R, Hauser JR (2014) Morphing banner advertising. Marketing Sci. 33(1):27–46.Link, Google Scholar
Wald A (1950) Statistical Decision Functions (John Wiley & Sons, New York).Google Scholar
Wang Z, Hu M (2014) Committed versus contingent pricing under competition. Production Oper. Management 23(11):1919–1936.Crossref, Google Scholar
Wernerfelt B (1986) A special case of dynamic pricing policy. Management Sci. 32(12):1562–1566.Link, Google Scholar
Whittle P (1980) Multi-armed bandits and the Gittins index. J. Roy. Statist. Soc. Ser. B 42(2):143–149.Crossref, Google Scholar
Winer RS (1986) A reference price model of brand choice for frequently purchased products. J. Consumer Res. 13(2):250–256.Crossref, Google Scholar

Volume 38, Issue 2

March-April 2019

Pages 193-364, ii-ii

Article Information

Supplemental Material

Metrics

Information

Received:June 04, 2017
Accepted:May 25, 2018
Published Online:March 29, 2019

Cite as

Kanishka Misra, Eric M. Schwartz, Jacob Abernethy (2019) Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments. Marketing Science 38(2):226-252.

https://doi.org/10.1287/mksc.2018.1129

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

References

Volume 38, Issue 2

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News