Strategies for Prediction Under Imperfect Monitoring

Gábor Lugosi
Gábor Lugosi
[email protected]
ICREA and Department of Economics, Pompeu Fabra University, Barcelona, Spain
Search for more papers by this author
,
Shie Mannor
Shie Mannor
[email protected]
Department of Electrical and Computer Engineering, McGill University, Montreal, Québec, Canada
Search for more papers by this author
,
Gilles Stoltz
Gilles Stoltz
[email protected]
Département de Mathématiques et Applications, Ecole Normale Supérieure, CNRS, Paris, France, and HEC Paris School of Management, CNRS, Jouy-en-Josas, France
Search for more papers by this author

Gábor Lugosi

[email protected]

ICREA and Department of Economics, Pompeu Fabra University, Barcelona, Spain

Search for more papers by this author

Shie Mannor

[email protected]

Department of Electrical and Computer Engineering, McGill University, Montreal, Québec, Canada

Search for more papers by this author

Gilles Stoltz

[email protected]

Département de Mathématiques et Applications, Ecole Normale Supérieure, CNRS, Paris, France, and HEC Paris School of Management, CNRS, Jouy-en-Josas, France

Search for more papers by this author

Published Online:1 Aug 2008https://doi.org/10.1287/moor.1080.0312

Abstract

We propose simple randomized strategies for sequential decision (or prediction) under imperfect monitoring, that is, when the decision maker (forecaster) does not have access to the past outcomes but rather to a feedback signal. The proposed strategies are consistent in the sense that they achieve, asymptotically, the best-possible average reward among all fixed actions. It was Rustichini [Rustichini, A. 1999. Minimizing regret: The general case. Games Econom. Behav.29 224–243] who first proved the existence of such consistent predictors. The forecasters presented here offer the first constructive proof of consistency. Moreover, the proposed algorithms are computationally efficient. We also establish upper bounds for the rates of convergence. In the case of deterministic feedback signals, these rates are optimal up to logarithmic terms.

cover image Mathematics of Operations Research

Volume 33, Issue 3

August 2008

Pages 513-768

Article Information

Metrics

Information

Received:May 29, 2007
Published Online:August 01, 2008

Cite as

Gábor Lugosi, Shie Mannor, Gilles Stoltz, (2008) Strategies for Prediction Under Imperfect Monitoring. Mathematics of Operations Research 33(3):513-528.

https://doi.org/10.1287/moor.1080.0312

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Strategies for Prediction Under Imperfect Monitoring

Abstract

Volume 33, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News