Effective Adaptive Exploration of Prices and Promotions in Choice-Based Demand Models

Published Online:https://doi.org/10.1287/mksc.2023.0322

We consider the problem of setting the optimal prices and promotions for a multi product category when the firm lacks demand information. At each time, a customer arrives and chooses a product based on a discrete choice model where each product’s utility depends on product features, its price and promotion, and the customer’s features. Using a Thompson Sampling approach, we develop a regret-minimizing or alternatively, profit-maximizing algorithm for the retailer. We provide the first adaptive algorithm that simultaneously incorporates pricing and promotions into a discrete choice model. To make our algorithm computationally feasible over an infinite space of prices and promotions, we provide a novel method for learning the optimal price and promotion given a set of demand parameters. We also provide theoretical justification for our results and improve upon existing regret guarantees. Using simulations based on real-life grocery store data, we show that our method significantly outperforms existing approaches. In addition, we extend our methodology to a contextual setting, which allows for consumer heterogeneity and personalized pricing and promotion. Compared with existing works, our approach is agnostic to the parametric specification of the utility model and needs no assumptions on the underlying distribution of customer features.

History: Olivier Toubia served as the senior editor.

Supplemental Material: The online appendix and data files are available at https://doi.org/10.1287/mksc.2023.0322.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.