Estimation Errors as Regret Lower Bounds for Linear Contextual Bandits

Jiahao He
Jiahao He
[email protected]
https://orcid.org/0000-0002-1825-9649
Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong Special Administrative Region, China
Search for more papers by this author
,
Jiheng Zhang
Jiheng Zhang
[email protected]
https://orcid.org/0000-0003-3025-1495
Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong Special Administrative Region, China
Search for more papers by this author
,
Rachel Q. Zhang
Corresponding Author
Rachel Q. Zhang
[email protected]
https://orcid.org/0000-0002-0789-8488
Tsingshan Institute for Advanced Business Studies and School of Management, Zhejiang University, Hangzhou 310058, China
Search for more papers by this author

Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong Special Administrative Region, China

Search for more papers by this author

Jiheng Zhang

[email protected]

https://orcid.org/0000-0003-3025-1495

Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong Special Administrative Region, China

Search for more papers by this author

Rachel Q. Zhang

Corresponding Author

Rachel Q. Zhang

[email protected]

https://orcid.org/0000-0002-0789-8488

Tsingshan Institute for Advanced Business Studies and School of Management, Zhejiang University, Hangzhou 310058, China

Search for more papers by this author

Published Online:2 Jun 2026https://doi.org/10.1287/mnsc.2023.02827

Abstract

Linear contextual bandits and their variants represent a fundamental class of models with wide real-world applications, usually solved using algorithms guided by parameter estimation. The Cauchy-Schwarz inequality established analytically that estimation errors dominate algorithm regrets. Therefore, accurate parameter estimation suffices to guarantee algorithms with low regrets. In this paper, we establish the necessity of accurate estimations in effective algorithms for linear contextual bandit problems by first constructing an estimator for any given algorithm. We then show that algorithm regrets dominate the estimation errors of their induced estimators under mild conditions. In other words, low-regret algorithms must imply accurate estimators, and developing low-regret algorithms is equivalent to finding efficient estimators, either implicitly or explicitly. Thus, our analysis reduces regret lower bounds to estimation errors, bridging lower bound analysis in bandit problems and regression analysis. This provides a framework for finding practical and informative regret lower bounds by leveraging the extensive estimation literature in Statistics. It leads to insightful lower bounds for a variety of contextual bandit problems in the literature, which are either new or tighter than existing ones.

This paper was accepted by J. George Shanthikumar, data science.

Funding: Financial support from the Hong Kong Research Grants Council [Grants 16200821, 16500023, 16500225, and T32-615/24-R] is gratefully acknowledged.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/mnsc.2023.02827.

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:September 05, 2023
Accepted:March 08, 2026
Published Online:June 02, 2026

Cite as

Jiahao He, Jiheng Zhang, Rachel Q. Zhang (2026) Estimation Errors as Regret Lower Bounds for Linear Contextual Bandits. Management Science 0(0).

https://doi.org/10.1287/mnsc.2023.02827

Keywords

Acknowledgments

A preliminary version of this paper (He et al. 2022) appeared in the 39th International Conference on Machine Learning, and the current paper is a significantly enhanced version of it.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Estimation Errors as Regret Lower Bounds for Linear Contextual Bandits

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News