Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

Kanishka Misra
Corresponding Author
Kanishka Misra
http://orcid.org/0000-0002-0106-1230
Rady School of Management, University of California, San Diego, La Jolla, California 92093;
Search for more papers by this author
,
Eric M. Schwartz
Eric M. Schwartz
Ross School of Business, University of Michigan, Ann Arbor, Michigan 48109;
Search for more papers by this author
,
Jacob Abernethy
Jacob Abernethy
http://orcid.org/0000-0002-3115-6804
School of Computer Science, College of Computing, Georgia Institute of Technologyy, Atlanta, Georgia 30332
Search for more papers by this author

Kanishka Misra

Corresponding Author

Kanishka Misra

http://orcid.org/0000-0002-0106-1230

Rady School of Management, University of California, San Diego, La Jolla, California 92093;

Search for more papers by this author

Eric M. Schwartz

Ross School of Business, University of Michigan, Ann Arbor, Michigan 48109;

Search for more papers by this author

Jacob Abernethy

http://orcid.org/0000-0002-3115-6804

School of Computer Science, College of Computing, Georgia Institute of Technologyy, Atlanta, Georgia 30332

Search for more papers by this author

Published Online:29 Mar 2019https://doi.org/10.1287/mksc.2018.1129

Abstract

Pricing managers at online retailers face a unique challenge. They must decide on real-time prices for a large number of products with incomplete demand information. The manager runs price experiments to learn about each product’s demand curve and the profit-maximizing price. In practice, balanced field price experiments can create high opportunity costs, because a large number of customers are presented with suboptimal prices. In this paper, we propose an alternative dynamic price experimentation policy. The proposed approach extends multiarmed bandit (MAB) algorithms from statistical machine learning to include microeconomic choice theory. Our automated pricing policy solves this MAB problem using a scalable distribution-free algorithm. We prove analytically that our method is asymptotically optimal for any weakly downward sloping demand curve. In a series of Monte Carlo simulations, we show that the proposed approach performs favorably compared with balanced field experiments and standard methods in dynamic pricing from computer science. In a calibrated simulation based on an existing pricing field experiment, we find that our algorithm can increase profits by 43% during the month of testing and 4% annually.

Data files and the online appendix are available at https://doi.org/10.1287/mksc.2018.1129.

Cited by
- The impact of competitive intelligence services on online marketplaces
  6 June 2026 | Production and Operations Management, Vol. 84
- Bid Shading in First-Price Auction: Nonstationary Bayesian Multiarmed Bandit Methods for Real-Time Bidding
  Mengzhuo Guo,
  Wuqi Zhang,
  Yiwen Shen,
  Qingpeng Zhang
  1 June 2026 | Information Systems Research, Vol. 0, No. 0
- State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards
  21 May 2026 | Algorithms, Vol. 19, No. 5
- Unleashing the predators: Autonomous predation and manipulation through algorithms
  Journal of Retailing, Vol. 24
- How Effective Is Suggested Pricing? Experimental Evidence from an E-Commerce Platform
  1 March 2026 | Journal of Marketing Research, Vol. 71
- Towards Human-AI Complementarity in Matching Tasks
  1 April 2026
- Dynamisches und personalisiertes Pricing – Anwendungsszenarien und Algorithmen
  1 June 2026
- The Impact of Competitive Intelligence Services on Online Marketplaces
  1 January 2026 | SSRN Electronic Journal, Vol. 3
- An Approach to the Conceptual and Strategic Dimensions of Marketing from Human and Artificial Intelligence Perspectives
  1 January 2026 | Interdisciplinary Description of Complex Systems, Vol. 24, No. 3
- A Tutorial on Distributed Multi-Armed Bandits
- Improving network dynamic pricing policies through offline reinforcement learning
  19 June 2025 | OR Spectrum, Vol. 47, No. 4
- Sustainable Development in Transition: Empirical Insights on Green Economy and Carbon Emissions from India
- Nonparametric Pricing Bandits Leveraging Informational Externalities to Learn the Demand Curve
  Ian N. Weaver,
  Vineet Kumar,
  Lalit Jain
  26 September 2025 | Marketing Science, Vol. 44, No. 6
- Data-Driven Dynamic Pricing: Sticky Fairness Concerns and the Exploitation-Exploration Trade-Off
  18 October 2025 | Journal of the Operational Research Society, Vol. 76
- LOLA: LLM-Assisted Online Learning Algorithm for Content Experiments
  Zikun Ye,
  Hema Yoganarasimhan,
  Yufeng Zheng
  24 March 2025 | Marketing Science, Vol. 44, No. 5
- Secure Best Arm Identification in the Presence of a Copycat
- Online Learning and Pricing for Multiple Products With Reference Price Effects
  14 January 2025 | Naval Research Logistics (NRL), Vol. 72, No. 5
- CluE: Cluster, Sample & Eliminate - Bayesian Block Elimination for Pure-Exploration with Non-binary Rewards and Limited Budget
- AI and Its Impact on the Growth of the Small Medium Enterprises
- Optimal Payment for Dynamic Treatment Regimes
  7 January 2025 | Production and Operations Management, Vol. 34, No. 7
- Decrease the price now, increase it later: a novel approach to demand learning and dynamic pricing of new experiential products through the lens of construal level theory
  27 December 2024 | Journal of Revenue and Pricing Management, Vol. 24, No. 3
- Cost-aware prediction service pricing with incomplete information
  6 March 2025 | The VLDB Journal, Vol. 34, No. 3
- Photon-Atom Hybrid Decision-Framework with Concurrent Exploration Acceleration
  12 February 2025 | ACS Photonics, Vol. 12, No. 3
- Selective Reviews of Bandit Problems in AI via a Statistical View
  18 February 2025 | Mathematics, Vol. 13, No. 4
- Online Causal Inference for Advertising in Real-Time Bidding Auctions
  Caio Waisman; ,
  Harikesh S. Nair; ,
  Carlos Carrion
  14 August 2024 | Marketing Science, Vol. 44, No. 1
- Diversify and Conquer: Bandits and Diversity for an Enhanced E-commerce Homepage Experience
  26 January 2025
- Rough Set Theoretic Approach for Solving the Multi-Armed Bandit Problems
  11 December 2024
- Dynamisches und personalisiertes Pricing – Anwendungsszenarien und Algorithmen
  25 October 2025
- Thompson sampling-based recursive block elimination for dynamic assignment under limited budget in pure-exploration
  18 December 2024 | Data Mining and Knowledge Discovery, Vol. 39, No. 1
- Health Consulting Services Recommendation Considering Patients’ Decision-Making Behaviors: A CNN and Multiarmed Bandit Approach
  IEEE Transactions on Engineering Management, Vol. 72
- Platform Pricing Algorithms: Examples, Fundamental Challenges, Potential Solutions
  1 January 2025 | SSRN Electronic Journal, Vol. 30
- Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making
- Selection of Spread by Multi-armed Bandits in Stationary Environment
  9 May 2025
- Sequential Block Elimination for Dynamic Pricing
- Pre-Registered Interim Analysis Designs (PRIADs): Increasing the Cost-Effectiveness of Hypothesis Testing
  22 April 2024 | Journal of Consumer Research, Vol. 51, No. 4
- Nonparametric multi-product dynamic pricing with demand learning via simultaneous price perturbation
  European Journal of Operational Research, Vol. 319, No. 1
- Artificial Intelligence for Web 3.0: A Comprehensive Survey
  14 May 2024 | ACM Computing Surveys, Vol. 56, No. 10
- Effective Adaptive Exploration of Prices and Promotions in Choice-Based Demand Models
  Lalit Jain,
  Zhaoqi Li,
  Erfan Loghmani,
  Blake Mason,
  Hema Yoganarasimhan
  30 May 2024 | Marketing Science, Vol. 43, No. 5
- Multi-Task Neural Linear Bandit for Exploration in Recommender Systems
  24 August 2024
- Exploratory Bandit Experiments with “Starter Packs” in a Free-to-Play Mobile Game
- AI-powered marketing: What, where, and how?
  International Journal of Information Management, Vol. 77
- Revolutionizing Marketing by Utilizing the Power of Artificial Intelligence
- МАРКЕТИНГОВІ МОЖЛИВОСТІ ПІДПРИЄМСТВ НА ОСНОВІ ШТУЧНОГО ІНТЕЛЕКТУ
  24 June 2024 | Трансформаційна економіка, No. 2 (07)
- A Bibliometric Analysis on Artificial Intelligence in Marketing: Implications for Scholars and Managers
  16 May 2024 | Journal of Internet Commerce, Vol. 23, No. 3
- A novel two-stage dynamic pricing model for logistics planning using an exploration–exploitation framework: A multi-armed bandit problem
  Expert Systems with Applications, Vol. 246
- Artificial Intelligence in Costumer Acquisition
- Optimizing Marketing Campaigns With AI-Driven Insights on Mobile User Behavior
- Dynamic Pricing Prediction with Machine Learning Algorithm
- Corruption Robust Dynamic Pricing in Liner Shipping under Capacity Constraint
- Finding the optimal exploration-exploitation trade-off online through Bayesian risk estimation and minimization
  Artificial Intelligence, Vol. 330
- Personalized Dynamic Pricing Based on Improved Thompson Sampling
  9 April 2024 | Mathematics, Vol. 12, No. 8
- Long-Term Value of Exploration: Measurements, Findings and Algorithms
  4 March 2024
- The pricing strategies of online grocery retailers
  28 November 2023 | Quantitative Marketing and Economics, Vol. 22, No. 1
- How Does Competition Affect Exploration vs. Exploitation? A Tale of Two Recommendation Algorithms
  H. Henry Cao,
  Liye Ma,
  Z. Eddie Ning,
  Baohong Sun
  30 March 2023 | Management Science, Vol. 70, No. 2
- Distribution-Free Contextual Dynamic Pricing
  Yiyun Luo,
  Will Wei Sun,
  Yufeng Liu
  11 May 2023 | Mathematics of Operations Research, Vol. 49, No. 1
- Data Management
  24 May 2024
- Forecasting e-learning Course Purchases Using Deep Learning Based on Customer Retention
  7 August 2024
- Main Martechs in Brief
  20 April 2024
- Predictive Analytics in Marketing Using Artificial Intelligence
  11 April 2024
- Meta-Learning Customer Preference Dynamics on Digital Platforms
  1 January 2024 | SSRN Electronic Journal, Vol. 112
- How Effective is Suggested Pricing?: Experimental Evidence from an E-Commerce Platform
  1 January 2024 | SSRN Electronic Journal, Vol. 41
- PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison
  IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 45, No. 12
- Machine Learning-driven Dynamic Pricing Strategies in E-Commerce
- Unravelling the techno-functional building blocks of metaverse ecosystems – A review and research agenda
  International Journal of Information Management Data Insights, Vol. 3, No. 2
- Dynamic pricing of regulated field services using reinforcement learning
  19 January 2023 | IISE Transactions, Vol. 55, No. 10
- Examining the gamified effect of the blindbox design: The moderating role of price
  Journal of Retailing and Consumer Services, Vol. 74
- How to Maximize Clicks for Display Advertisement in Digital Marketing? A Reinforcement Learning Approach
  21 July 2022 | Information Systems Frontiers, Vol. 25, No. 4
- Rethinking the role of uncertainty and risk in Marketing
  6 July 2023 | Journal of Decision Systems, Vol. 6
- Dynamic Coupon Targeting Using Batch Deep Reinforcement Learning: An Application to Livestream Shopping
  Xiao Liu
  20 October 2022 | Marketing Science, Vol. 42, No. 4
- Demand Learning and Pricing for Varying Assortments
  Kris Johnson Ferreira,
  Emily Mower
  14 March 2022 | Manufacturing & Service Operations Management, Vol. 25, No. 4
- (Private) Kernelized Bandits with Distributed Biased Feedback
  27 June 2023 | ACM SIGMETRICS Performance Evaluation Review, Vol. 51, No. 1
- (Private) Kernelized Bandits with Distributed Biased Feedback
  19 June 2023
- Reinforcement Learning in Economics and Finance
  23 April 2021 | Computational Economics, Vol. 62, No. 1
- Personal antecedents of perceived deceptive pricing in online retailing: the moderating role of price inequality
  9 June 2021 | Electronic Commerce Research, Vol. 23, No. 2
- Dynamic Pricing with Parametric Demand Learning and Reference-Price Effects
  21 May 2023 | Mathematics, Vol. 11, No. 10
- Pricing Prediction Services for Profit Maximization with Incomplete Information
- The Economics of Artificial Intelligence: A Marketing Perspective
- Artificial Intelligence and Pricing
- (Private) Kernelized Bandits with Distributed Biased Feedback
  2 March 2023 | Proceedings of the ACM on Measurement and Analysis of Computing Systems, Vol. 7, No. 1
- Sequential Experimentation and Learning
  24 March 2023
- Robust Near-Optimal Arm Identification With Strongly-Adaptive Adversaries
  IEEE Transactions on Signal Processing, Vol. 71
- Online Learning and Pricing for Multiple Products with Reference Price Effects
  1 January 2023 | SSRN Electronic Journal, Vol. 64
- Buy When? Survival Machine Learning Model Comparison for Purchase Timing.
  1 January 2023 | SSRN Electronic Journal, Vol. 10
- Reducing Participant Costs Without Sacrificing Statistical Power in Consumer Research: An Introduction to Pre-registered Interim Analysis Designs (Priads)
  1 January 2023 | SSRN Electronic Journal, Vol. 74
- Unleashing the Predators: Autonomous Predation and Manipulation Through Algorithms
  1 January 2023 | SSRN Electronic Journal, Vol. 24
- Amazon's Artificial Intelligence in Retail Novelty - Case Study
  31 December 2022 | International Journal of Case Studies in Business, IT, and Education
- Risk-Averse Multi-Armed Bandits with Unobserved Confounders: A Case Study in Emotion Regulation in Mobile Health
- Dismemberment and Design for Controlling the Replication Variance of Regret for the Multi-Armed Bandit
  1 August 2022 | Journal of Statistical Theory and Practice, Vol. 16, No. 4
- SPECTS: Price Elasticity Computation Model Using Thompson Sampling
- Setting Reserve Prices in Second-Price Auctions with Unobserved Bids
  Jason Rhuggenaath,
  Alp Akcay,
  Yingqian Zhang,
  Uzay Kaymak
  29 July 2022 | INFORMS Journal on Computing, Vol. 34, No. 6
- Bandit-based inventory optimisation: Reinforcement learning in multi-echelon supply chains
  International Journal of Production Economics, Vol. 252
- How displaying price discounts can mitigate negative customer reactions to dynamic pricing
  Journal of Business Research, Vol. 148
- Pricing the Long Tail by Explainable Product Aggregation and Monotonic Bandits
  14 August 2022
- Dynamic Data Transaction in Crowdsensing Based on Multi-Armed Bandits and Shapley Value
  IEEE Transactions on Sustainable Computing, Vol. 7, No. 3
- A Framework for Collaborative Artificial Intelligence in Marketing
  Journal of Retailing, Vol. 98, No. 2
- Dynamic pricing of differentiated products with incomplete information based on reinforcement learning
  24 May 2022 | IET Collaborative Intelligent Manufacturing, Vol. 4, No. 2
- Learning to Set Prices
  23 February 2022 | Journal of Marketing Research, Vol. 59, No. 2
- Dynamic pricing of perishable food as a sustainable business model
  27 September 2021 | British Food Journal, Vol. 124, No. 5
- Online and offline retailing: What we know and directions for future research
  Journal of Retailing, Vol. 98, No. 1
- Show Me the Whole World
  15 February 2022
- Managing Crowding and Consumers' Perceived Store Density
- Understanding Managers’ Trade-Offs Between Exploration and Exploitation
  Alina Ferecatu,
  Arnaud De Bruyn
  21 October 2021 | Marketing Science, Vol. 41, No. 1
- Dynamic Pricing with Demand Learning: Emerging Topics and State of the Art
  12 April 2022
- Machine Learning and Marketing: A Systematic Literature Review
  IEEE Access, Vol. 10
- Learning in Restless Bandits Under Exogenous Global Markov Process
  IEEE Transactions on Signal Processing, Vol. 70
- Artificial Intelligence and Pricing
  SSRN Electronic Journal, Vol. 39
- The Effect of the Number of E-Stores Subscribers on Chinese Smartphone Brand Purchases: Evidence from a Machine Learning Model.
  SSRN Electronic Journal, Vol. 43
- A Scalable Recommendation Engine for New Users and Items
  SSRN Electronic Journal, Vol. 10
- Nonparametric Bandits Leveraging Informational Externalities to Learn the Demand Curve
  SSRN Electronic Journal, Vol. 58
- A Prescriptive Analytics Framework for Optimal Policy Deployment Using Heterogeneous Treatment Effects
  1 December 2021 | MIS Quarterly, Vol. 45, No. 4
- Shopping with(out) distancing: modelling the personal space to limit the spread of contagious disease among consumers in retail stores
  6 December 2021 | Journal of Marketing Management, Vol. 37, No. 17-18
- Enhancing store layout decision with agent-based simulations of consumers’ density
  Expert Systems with Applications, Vol. 182
- Theory development in servitization through the application of fsQCA and experiments
  29 April 2021 | International Journal of Operations & Production Management, Vol. 41, No. 5
- Reinforcement learning for content's customization: a first step of experimentation in Skyscanner
  15 January 2021 | Industrial Management & Data Systems, Vol. 121, No. 6
- Artificial intelligence in marketing: Systematic review and future research direction
  International Journal of Information Management Data Insights, Vol. 1, No. 1
- As the wheel turns toward the future of retailing
  22 January 2021 | Journal of Marketing Theory and Practice, Vol. 29, No. 1
- Frontiers: Algorithmic Collusion: Supra-competitive Prices via Independent Algorithms
  Karsten T. Hansen,
  Kanishka Misra,
  Mallesh M. Pai
  8 January 2021 | Marketing Science, Vol. 40, No. 1
- A strategic framework for artificial intelligence in marketing
  4 November 2020 | Journal of the Academy of Marketing Science, Vol. 49, No. 1
- Exploration in Action: The Role of Randomized Control Trials in Online Demand Generation
  SSRN Electronic Journal, Vol. 68
- The Pricing Strategies of Online Grocery Retailers
  SSRN Electronic Journal, Vol. 11
- The Effects of Verbal and Visual Marketing Content in Social Media Settings: A Deep Learning Approach
  SSRN Electronic Journal, Vol. 2
- Emergence of Disruptive Technologies & Their Impact on Marketing of Products and Services
  SSRN Electronic Journal, Vol. 72
- Soul and machine (learning)
  27 August 2020 | Marketing Letters, Vol. 31, No. 4
- Multi-Armed-Bandit-based Shilling Attack on Collaborative Filtering Recommender Systems
- Introduction to the Special Issue on Marketing Science and Field Experiments
  Leif Nelson,
  Duncan Simester,
  K. Sudhir
  4 November 2020 | Marketing Science, Vol. 39, No. 6
- Machine Learning in Marketing: Overview, Learning Strategies, Applications, and Future Developments
  31 August 2020 | Foundations and Trends® in Marketing, Vol. 14, No. 3
- How Does Competition Affect Exploration vs. Exploitation? A Tale of Two Recommendation Algorithms
  SSRN Electronic Journal, Vol. 54
- Man(ager Heuristics) vs. Machine (Learning): Automation for Prediction of Customer Value for Brick-and-Mortar Retailers
  SSRN Electronic Journal, Vol. 28
- Test & Roll: Profit-Maximizing A/B Tests
  Elea McDonnell Feit,
  Ron Berman
  14 November 2019 | Marketing Science, Vol. 38, No. 6
- How price promotions work: A review of practice and theory
- Soul and Machine (Learning)
  SSRN Electronic Journal, Vol. 47
- Learning to Set Prices in the Washington State Liquor Market
  SSRN Electronic Journal, Vol. 58
- Profit-Maximizing A/B Tests
  SSRN Electronic Journal, Vol. 28

Volume 38, Issue 2

March-April 2019

Pages 193-364, ii-ii

Article Information

Supplemental Material

Metrics

Information

Received:June 04, 2017
Accepted:May 25, 2018
Published Online:March 29, 2019

Cite as

Kanishka Misra, Eric M. Schwartz, Jacob Abernethy (2019) Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments. Marketing Science 38(2):226-252.

https://doi.org/10.1287/mksc.2018.1129

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

Abstract

Volume 38, Issue 2

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News