Comparing Sequential Forecasters

Yo Joong Choe
Corresponding Author
Yo Joong Choe
[email protected]
https://orcid.org/0000-0002-0614-9477
Data Science Institute, University of Chicago, Chicago, Illinois 60637;
Search for more papers by this author
,
Aaditya Ramdas
Aaditya Ramdas
[email protected]
https://orcid.org/0000-0003-0497-311X
Department of Statistics and Data Science, Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
Search for more papers by this author

Yo Joong Choe

Corresponding Author

Yo Joong Choe

[email protected]

https://orcid.org/0000-0002-0614-9477

Data Science Institute, University of Chicago, Chicago, Illinois 60637;

Search for more papers by this author

Aaditya Ramdas

[email protected]

https://orcid.org/0000-0003-0497-311X

Department of Statistics and Data Science, Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213

Search for more papers by this author

Published Online:17 Oct 2023https://doi.org/10.1287/opre.2021.0792

Abstract

Consider two forecasters, each making a single prediction for a sequence of events over time. We ask a relatively basic question: how might we compare these forecasters, either online or post hoc, avoiding unverifiable assumptions on how the forecasts and outcomes were generated? In this paper, we present a rigorous answer to this question by designing novel sequential inference procedures for estimating the time-varying difference in forecast scores. To do this, we employ confidence sequences (CS), which are sequences of confidence intervals that can be continuously monitored and are valid at arbitrary data-dependent stopping times (“anytime-valid”). The widths of our CSs are adaptive to the underlying variance of the score differences. Underlying their construction is a game-theoretic statistical framework in which we further identify e-processes and p-processes for sequentially testing a weak null hypothesis—whether one forecaster outperforms another on average (rather than always). Our methods do not make distributional assumptions on the forecasts or outcomes; our main theorems apply to any bounded scores, and we later provide alternative methods for unbounded scores. We empirically validate our approaches by comparing real-world baseball and weather forecasters.

Funding: A. Ramdas acknowledges funding from the National Science Foundation Division of Mathematical Sciences [Grant 1916320]. Research reported in this paper was sponsored in part by the DEVCOM Army Research Laboratory under Cooperative Agreement W911NF-17-2-0196 (ARL IoBT CRA).

Supplemental Material: The e-companion is available at https://doi.org/10.1287/opre.2021.0792.

Volume 72, Issue 4

July-August 2024

Pages iii-vi, 1317-1750, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:December 20, 2021
Accepted:July 05, 2023
Published Online:October 17, 2023

Cite as

Yo Joong Choe, Aaditya Ramdas (2023) Comparing Sequential Forecasters. Operations Research 72(4):1368-1387.

https://doi.org/10.1287/opre.2021.0792

Keywords

Acknowledgments

The authors thank Alexander Henzi, Johanna F. Ziegel, Rafael M. Frongillo, and the anonymous reviewers for their valuable feedback on this work. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Army Research Laboratory or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation herein. The manuscript was submitted and revised when Y. J. Choe was at Carnegie Mellon University.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Comparing Sequential Forecasters

Abstract

Volume 72, Issue 4

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News