Data Aggregation and Demand Prediction

Maxime C. Cohen
Maxime C. Cohen
[email protected]
https://orcid.org/0000-0002-2474-3875
Desautels Faculty of Management, McGill University, Montreal, Quebec H3A 1G5, Canada;
Search for more papers by this author
,
Renyu Zhang
Corresponding Author
Renyu Zhang
[email protected]
https://orcid.org/0000-0003-0284-164X
Department of Decision Sciences and Managerial Economics, Business School, The Chinese University of Hong Kong, Hong Kong, China;
Search for more papers by this author
,
Kevin Jiao
Kevin Jiao
[email protected]
Stern School of Business, New York University, New York 10012
Search for more papers by this author

Desautels Faculty of Management, McGill University, Montreal, Quebec H3A 1G5, Canada;

Corresponding Author

Renyu Zhang

Department of Decision Sciences and Managerial Economics, Business School, The Chinese University of Hong Kong, Hong Kong, China;

Search for more papers by this author

Kevin Jiao

[email protected]

Stern School of Business, New York University, New York 10012

Search for more papers by this author

Published Online:7 Jul 2022https://doi.org/10.1287/opre.2022.2301

Abstract

We study how retailers can use data aggregation and clustering to improve demand prediction. High accuracy in demand prediction allows retailers to effectively manage their inventory as well as mitigate stock-outs and excess supply. A typical retail setting involves predicting demand for hundreds of items simultaneously. Although some items have a large amount of historical data, others were recently introduced and, thus, transaction data can be scarce. A common approach is to cluster several items and estimate a joint model for each cluster. In this vein, one can estimate some model parameters by aggregating the data from several items and other parameters at the individual-item level. We propose a practical method referred to as data aggregation with clustering ( $DAC$ ), which balances the tradeoff between data aggregation and model flexibility. $DAC$ allows us to predict demand while optimally identifying the features that should be estimated at the (i) item, (ii) cluster, and (iii) aggregate levels. We show that the $DAC$ algorithm yields a consistent and normal estimate, along with improved prediction errors relative to the decentralized benchmark, which estimates a different model for each item. Using both simulated and real data, we illustrate $DAC$ ’s improvement in prediction accuracy relative to a wide range of common benchmarks. Interestingly, the $DAC$ algorithm has theoretical and practical advantages and helps retailers uncover meaningful managerial insights.

Volume 70, Issue 5

September-October 2022

Pages iii-vi, 2597-3033, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:June 02, 2021
Accepted:March 11, 2022
Published Online:July 07, 2022

Cite as

Maxime C. Cohen, Renyu Zhang, Kevin Jiao (2022) Data Aggregation and Demand Prediction. Operations Research 70(5):2597-2618.

https://doi.org/10.1287/opre.2022.2301

Keywords

Acknowledgments

The authors thank Paul-Emile Gras and Arthur Pentecoste who helped us conduct the computations presented in Sections 5 and 6; the retail partner for sharing data; and Compute Canada for allowing us to use their computing resources.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Data Aggregation and Demand Prediction

Abstract

Volume 70, Issue 5

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News