Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments
Published Online:29 Mar 2019https://doi.org/10.1287/mksc.2018.1129
References
- (2005) Conditioning prices on purchase history. Marketing Sci. 24(3):367–381.Link, Google Scholar
- (1991) Optimal learning by experimentation. Rev. Econom. Stud. 58(4):621–654.Crossref, Google Scholar
- (1955) Sample mean based index policies with O(log n) regret for the multi-armed bandit problem. Adv. Appl. Probab. 27(4):1054–1078.Crossref, Google Scholar
- (2010) Joint dynamic pricing of multiple perishable products under consumer choice. Management Sci. 56(8):1345–1361.Link, Google Scholar
- (2015) Price stickiness: Empirical evidence of the menu cost channel. Rev. Econom. Statist. 97(4):813–826.Crossref, Google Scholar
- (2009) Exploration-exploitation trade-off using variance estimates in multi-armed bandits. Theoret. Comput. Sci. 410(19):1876–1902.Crossref, Google Scholar
- (2002) Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. Res. 3(November):397–422.Google Scholar
- (2002) Finite-time analysis of the multiarmed bandit problem. Machine Learn. 47(2/3):235–256.Crossref, Google Scholar
- (2002) Pricing of short life-cycle products through active learning. Working paper, Washington University of St. Louis, St. Louis.Google Scholar
- (2014) Using big data to make better pricing decisions. McKinsey & Company (June), http://www.mckinsey.com/business-functions/marketing-and-sales/our-insights/using-big-data-to-make-better-pricing-decisions.Google Scholar
- (1992) The dynamic pricing of next generation consumer durables. Marketing Sci. 11(3):251–265.Link, Google Scholar
- (2008) Pricing without priors. J. Eur. Econom. Assoc. 6(2-3):560–569.Crossref, Google Scholar
- (2011) Robust monopoly pricing. J. Econom. Theory 146(6):2527–2543.Crossref, Google Scholar
- (2000) Experimentation in markets. Rev. Econom. Stud. 67:213–234.Crossref, Google Scholar
- (1985) Statistical Decision Theory and Bayesian Analysis, 2nd ed. (Springer, New York).Crossref, Google Scholar
- (2009) Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Oper. Res. 57(6):1407–1420.Link, Google Scholar
- (2004) Contingent pricing to reduce price risks. Marketing Sci. 23(1):146–155.Link, Google Scholar
- (2014) The design and introduction of product lines when consumer valuations are uncertain. Production Oper. Management 23(9):1539–1548.Crossref, Google Scholar
- (2011) Menu pricing and learning. Amer. Econom. J. Microeconom. 3(3):124–163.Crossref, Google Scholar
- (1994) Nonlinear pricing to produce information. Marketing Sci. 13(3):310–326.Link, Google Scholar
- .(2002) Optimal learning and experimentation in bandit problems. J. Econom. Dynam. Control 27(1):87–108.Crossref, Google Scholar
- (2006) Prediction, Learning, and Games (Cambridge University Press, New York).Crossref, Google Scholar
- (2016) Marketing science and big data. Marketing Sci. 35(3):341–342.Link, Google Scholar
- (2015)Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys Oper. Res. Management Sci. 20(1):1–18.Crossref, Google Scholar
- (2010) Forward buying by retailers. J. Marketing Res. 47(1):90–102.Crossref, Google Scholar
- (2017) Scalable price targeting. NBER Working Paper 23775, National Bureau of Economic Research, Cambridge, MA. http://www.nber.org/papers/w23775.Crossref, Google Scholar
- (2003) Dynamic pricing in the presence of inventory considerations: Research overview, current practices, and future directions. Management Sci. 49(10):1287–1309.Link, Google Scholar
- (1996) Decision-making under uncertainty: Capturing dynamic brand choice processes in turbulent consumer goods markets. Marketing Sci. 15(1):1–20.Link, Google Scholar
- , (2015) The economics of big data and differential pricing. The White House: President Barack Obama (blog) (February 6), https://obamawhitehouse.archives.gov/blog/2015/02/06/economics-big-data-and-differential-pricing.Google Scholar
- (1989) Multi-Armed Bandit Allocation Indices, 1st ed. (John Wiley & Sons, Chichester, UK).Google Scholar
- (2011) Multi-Armed Bandit Allocation Indices, 2nd ed. (John Wiley & Sons, New York).Crossref, Google Scholar
- (2002) New methods for bias correction at endpoints and boundaries. Ann. Statist. 30(5):1460–1479.Crossref, Google Scholar
- (2015) Robust new product pricing. Marketing Sci. 34(6):864–881.Link, Google Scholar
- (2013) Robust firm pricing with panel data. J. Econometrics 174(2):165–185.Crossref, Google Scholar
- (2009) Website morphing. Marketing Sci. 28(2):202–223.Link, Google Scholar
- (2006) Measuring the implications of sales and consumer inventory behavior. Econometrica 74(6):1637–1673.Crossref, Google Scholar
- (2006) Optimal dynamic product launch and exit under demand uncertainty. Marketing Sci. 25(1):25–30.Link, Google Scholar
- (1999) Hassle costs: The achilles’ heel of price-matching guarantees. J. Econom. Management Strategy 8(4):489–521.Crossref, Google Scholar
- (2011) Optimizing e-tailer profits and customer savings: Pricing multistage customized online bundles. Marketing Sci. 30(4):737–752.Link, Google Scholar
- (1996) Pricing decisions under demand uncertainty: A bayesian mixture model approach. Marketing Sci. 15(3):207–221.Link, Google Scholar
- (1995) Empirical generalizations from reference price research. Marketing Sci. 14(3, Part 2):G161–G169.Link, Google Scholar
- (2005) On boundary correction in kernel density estimation. Statist. Methodol. 2(3):191–212.Crossref, Google Scholar
- (2014) The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions, Working paper, Akamai Technologies, Cambridge, MA.Google Scholar
- (2014) Algorithms for the multi-armed bandit problem Working paper, McGill University, Montréal. https://arxiv.org/abs/1402.6028.Google Scholar
- (1987) Adaptive treatment allocation and the multi-armed bandit problem. Ann. Statist. 15(3):1091–1114.Crossref, Google Scholar
- (1985) Asymptotically efficient adaptive allocation rules. Adv. Appl. Math. 6(1):4–22.Crossref, Google Scholar
- (2014) Near-optimal bisection search for nonparametric dynamic pricing with inventory constraint. Ross School of Business Working Paper 1252, University of Michigan, Ann Arbor. https://ssrn.com/abstract=2509425.Google Scholar
- (1980) Applied dynamic pricing and production models with specific application to broadcast spot pricing. J. Marketing Res. 17(2):203–211.Crossref, Google Scholar
- (2005) Social Choice with Partial Knowledge of Treatment Response (Princeton University Press, Princeton, NJ).Google Scholar
- (1995) Microeconomic Theory (Oxford University Press, New York).Google Scholar
- (1954) Games against nature. Thrall RM, Coombs CH, Davis RL, eds. Decision Processes (John Wiley & Sons, New York), 49–59.Google Scholar
- (2007) Intertemporal price discrimination with forward-looking consumers: Application to the US market for console video-games. Quant. Marketing Econom. 5(3): 239–292.Crossref, Google Scholar
- (2004) Empirical analysis of indirect network effects in the market for personal digital assistants. Quant. Marketing Econom. 2(1):23–58.Crossref, Google Scholar
- (1982) Nonlinear pricing in markets with interdependent demand. Marketing Sci. 1(3):287–313.Link, Google Scholar
- (1992) Dynamic pricing and ordering decisions by a monopolist. Management Sci. 38(2):240–262.Link, Google Scholar
- (1985) Competition, strategy, and price dynamics: A theoretical and empirical investigation. J. Marketing Res. 22(3):283–296.Crossref, Google Scholar
- (1974) A two-armed bandit theory of market pricing. J. Econom. Theory 9(2):185–202.Crossref, Google Scholar
- (2017) Customer acquisition via display advertisements using multi-armed bandit experiments. Marketing Sci. 36(4):500–522.Link, Google Scholar
- (1986) New product pricing in quality sensitive markets. Marketing Sci. 5(1):70–87.Link, Google Scholar
- (2011) Axioms for minimax regret choice correspondences. J. Econom. Theory 146(11):2226–2251.Crossref, Google Scholar
- (1998) Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA).Google Scholar
- (1933) On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3):285–294.Crossref, Google Scholar
- (2014) Morphing banner advertising. Marketing Sci. 33(1):27–46.Link, Google Scholar
- (1950) Statistical Decision Functions (John Wiley & Sons, New York).Google Scholar
- (2014) Committed versus contingent pricing under competition. Production Oper. Management 23(11):1919–1936.Crossref, Google Scholar
- (1986) A special case of dynamic pricing policy. Management Sci. 32(12):1562–1566.Link, Google Scholar
- (1980) Multi-armed bandits and the Gittins index. J. Roy. Statist. Soc. Ser. B 42(2):143–149.Crossref, Google Scholar
- (1986) A reference price model of brand choice for frequently purchased products. J. Consumer Res. 13(2):250–256.Crossref, Google Scholar

