The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes
Published Online:1 May 2003https://doi.org/10.1287/moor.28.2.327.14483
References
- Gambling in a rigged casino: The adversarial multi-armed bandit problem. Proc. 36th Annual Sympos. on Foundations of Comput. Sci. (1995) (IEEE Computer Society Press, Los Alamitos, CA) 322–331Google Scholar
- Neuro-Dynamic Programming (1995) (Athena Scientific, Belmont, MA) Google Scholar
- An analog of the minimax theorem for vector payoffs. Pacific J. Math. (1956a) 6(1):1–8Crossref, Google Scholar
- Controlled random walks. Proc. Internat. Congress of Mathematicians, 1954 (1956b) III(North-Holland, Amsterdam, The Netherlands) 336–338Google Scholar
- Universal prediction. IEEE Trans. on Inform. Theory (1998) 44(6):2124–2147Crossref, Google Scholar
- Competitive Markov Decision Processes (1996) (Springer-Verlag, New York) Crossref, Google Scholar
- Simplifying optimal strategies in stochastic games. SIAM J. Control Optim. (1998) 36(4):1331–1347Crossref, Google Scholar
- Adaptive game playing using multiplicative weights. Games and Econom. Behavior (1999) 29:79–103Crossref, Google Scholar
- Universal consistency and cautious fictitious play. J. Econom. Dynamics and Control (1995) 19:1065–1990Crossref, Google Scholar
- The Theory of Learning in Games (1998) (MIT Press, Cambridge, MA) Google Scholar
- , Dresher M., Tucker A. W., Wolde P. Approximation to Bayes risk in repeated play. Contribution to the Theory of Games, III (1957) (Princeton University Press, Princeton, NJ) 97–139Google Scholar
- A simple adaptive procedure leading to correlated equilibrium. Econometrica (2000) 68:1127–1150Crossref, Google Scholar
- A general class of adaptive strategies. J. Econom. Theory (2001) 98:26–54Crossref, Google Scholar
- Reinforcement learning—A survey. J. Artificial Intelligence Res. (1996) 4:237–285Google Scholar
- Stochastic Systems: Estimation, Identification and Adaptive Control (1986) (Prentice Hall, Englewood Cliffs, NJ) Google Scholar
- Reliable communication under channel uncertainty. IEEE Trans. on Inform. Theory (1998) 44:2148–2177Crossref, Google Scholar
- Approachability in infinite dimensional spaces and an application: A universal algorithm for generating extended normal numbers. (1998) . Preprint, MayGoogle Scholar
- The empirical Bayes envelope approach to regret minimization in stochastic games. (2000a) . Technical report No. EE-1262, Faculty of Electrical Engineering, Technion, Haifa, IsraelGoogle Scholar
- Generalized approachability results for stochastic games with a single communicating state. (2000b) . Technical report No. EE-1263, Faculty of Electrical Engineering, Technion, Haifa, IsraelGoogle Scholar
- Stochastic games. Internat. J. Game Theory (1981) 10(2):53–66Crossref, Google Scholar
- Repeated games. (1994) . CORE Reprint Nos. Discussion Papers 9420, 9421, and 9422. Center for Operations Research and Economics, Universite Catholique de Louvain, Louvain, BelgiumGoogle Scholar
- Uniform properties of stochastic games and approachability. (2000) . Unpublished master's thesis, Tel Aviv University, Tel Aviv, IsraelGoogle Scholar
- Stochastic shortest path games. (1997) . Unpublished Ph.D., Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MAGoogle Scholar
- Markov Decision Processes (1994) (Wiley-Interscience, New York) Crossref, Google Scholar
- Convex Analysis (1970) (Princeton University Press, Princeton, NJ) Crossref, Google Scholar
- Minimizing regret: The general case. Games and Econom. Behavior (1999) 29:224–243Crossref, Google Scholar
- Guaranteed performance regions in Markovian systems with competing decision makers. IEEE Trans. on Automatic Control (1993) 38(1):84–95Crossref, Google Scholar
- An approachability condition for general sets. (1999) . Technical Report No. 496, Ecole Polytechnique, Paris, FranceGoogle Scholar
- Convexity and Optimization in Finite Dimensions (1970) I(Springer-Verlag, New York) Crossref, Google Scholar
- Vohra R., Levine D. K., Foster D. Special issue on learning in games. Games and Econom. Behavior (1999) 29(1). see entire issueGoogle Scholar
- A game of prediction with expert advice. J. Comput. Systems Sci. (1998) 56(2):153–173Crossref, Google Scholar

