Context-Based Dynamic Pricing with Separable Demand Models
Abstract
Motivated by the empirical evidence observed from the real-world data set, this paper studies context-based dynamic pricing with separable demand models. Consider a seller selling a product over a finite horizon of T periods and facing an unknown expected demand function that admits a separable structure , where and denote the product’s price and features, respectively. The seller does not know the exact expression of or but can dynamically adjust prices in each period based on the observed features and demands to learn their forms. The seller’s objective is to maximize the T-period expected revenue. We systematically characterize the statistical complexity of the online learning problem under three configurations of demand models with different structures of and . For each model, we design an efficient online learning algorithm with a provable regret upper bound. We also show that the upper bound is generally unimprovable by proving a matching regret lower bound in certain parameter regimes. Our results reveal fundamental differences in the optimal regret rates when and are endowed with different structures. The numerical results demonstrate that our learning algorithms are more effective than benchmark algorithms for all the three models and also show the effects of the parameters associated with and on the algorithm’s empirical regret.
This paper was accepted by J. George Shanthikumar, data science.
Funding: The authors acknowledge support from the MIT Data Science Laboratory. J. Bu acknowledges support from the Research Grants Council of Hong Kong [Early Career Scheme Grant PolyU 25505322].
Supplemental Material: The online appendices and data files are available at https://doi.org/10.1287/mnsc.2022.02260.

