Better-Reply Dynamics with Bounded Recall

Published Online:https://doi.org/10.1287/moor.1080.0323

A decision maker is engaged in a repeated interaction with Nature. The objective of the decision maker is to guarantee to himself the average payoff as large as the best-reply payoff to Nature's empirical distribution of play, no matter what Nature does. The decision maker with perfect recall can achieve this objective by a simple better-reply strategy. In this paper we demonstrate that the relationship between perfect recall and bounded recall is not straightforward: The decision maker with bounded recall may fail to achieve this objective, no matter how long his recall and no matter what better-reply strategy he uses.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.