Regret Minimization Under Partial Monitoring

Nicolò Cesa-Bianchi
Nicolò Cesa-Bianchi
[email protected]
Dipartimento de Scienze dell’ Informazione, Università di Milano, Milano, Italy
Search for more papers by this author
,
Gábor Lugosi
Gábor Lugosi
[email protected]
ICREA and Department of Economics, Pompeu Fabra University, Barcelona, Spain
Search for more papers by this author
,
Gilles Stoltz
Gilles Stoltz
[email protected]
CNRS and Département de Mathématiques et Applications, Ecole Normale Supérieure, Paris, France
Search for more papers by this author

Nicolò Cesa-Bianchi

[email protected]

Dipartimento de Scienze dell’ Informazione, Università di Milano, Milano, Italy

Search for more papers by this author

Gábor Lugosi

[email protected]

ICREA and Department of Economics, Pompeu Fabra University, Barcelona, Spain

Search for more papers by this author

Gilles Stoltz

[email protected]

CNRS and Département de Mathématiques et Applications, Ecole Normale Supérieure, Paris, France

Search for more papers by this author

Published Online:1 Aug 2006https://doi.org/10.1287/moor.1060.0206

Abstract

We consider repeated games in which the player, instead of observing the action chosen by the opponent in each game round, receives a feedback generated by the combined choice of the two players. We study Hannan-consistent players for these games, that is, randomized playing strategies whose per-round regret vanishes with probability one as the number n of game rounds goes to infinity. We prove a general lower bound of Ω(n^−1/3) for the convergence rate of the regret, and exhibit a specific strategy that attains this rate for any game for which a Hannan-consistent player exists.

cover image Mathematics of Operations Research

Volume 31, Issue 3

August 2006

Pages 433-648

Article Information

Metrics

Information

Received:April 04, 2005
Published Online:August 01, 2006

Cite as

Nicolò Cesa-Bianchi, Gábor Lugosi, Gilles Stoltz, (2006) Regret Minimization Under Partial Monitoring. Mathematics of Operations Research 31(3):562-580.

https://doi.org/10.1287/moor.1060.0206

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Regret Minimization Under Partial Monitoring

Abstract

Volume 31, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News