Satisficing in Time-Sensitive Bandit Learning

Daniel Russo
Daniel Russo
[email protected]
https://orcid.org/0000-0001-5926-8624
Columbia Business School, Columbia University, New York, New York 10027;
Search for more papers by this author
,
Benjamin Van Roy
Benjamin Van Roy
[email protected]
Department of Electrical Engineering and Department of Management Science and Engineering, Stanford University, Stanford, California 94305
Search for more papers by this author

Columbia Business School, Columbia University, New York, New York 10027;

Department of Electrical Engineering and Department of Management Science and Engineering, Stanford University, Stanford, California 94305

Search for more papers by this author

Published Online:14 Mar 2022https://doi.org/10.1287/moor.2021.1229

Supplemental Material

moor.2021.1229.sm1.pdf

cover image Mathematics of Operations Research

Volume 47, Issue 4

November 2022

Pages 2547-3399, C2

Article Information

Supplemental Material

Metrics

Information

Received:March 05, 2018
Accepted:December 02, 2020
Published Online:March 14, 2022

Cite as

Daniel Russo, Benjamin Van Roy (2022) Satisficing in Time-Sensitive Bandit Learning. Mathematics of Operations Research 47(4):2815-2839.

https://doi.org/10.1287/moor.2021.1229

Keywords

Acknowledgments

A special thanks is owed to David Tse, who played an important role in the early stages of this work. It was David who first emphasized that bounds based on entropy can be vacuous and pointed us to references on rate-distortion theory. The authors also thank Tor Lattimore for thoughtful comments on an early draft of this work.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Satisficing in Time-Sensitive Bandit Learning

Supplemental Material

Volume 47, Issue 4

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News