Contraction Conditions for Average and α-Discount Optimality in Countable State Markov Games with Unbounded Rewards

E. Altman
E. Altman
INRIA Sophia Antipolis, B.P.93, 2004 Rue des Lucioles, 06902 Sophia Antipolis Cedex, France
Search for more papers by this author
,
A. Hordijk
A. Hordijk
Department of Mathematics and Computer Science, Leiden University, P.O. Box 9512, 2300RA Leiden, The Netherlands
Search for more papers by this author
,
F. M. Spieksma
F. M. Spieksma
Department of Mathematics and Computer Science, Leiden University, P.O. Box 9512, 2300RA Leiden, The Netherlands
Search for more papers by this author

E. Altman

INRIA Sophia Antipolis, B.P.93, 2004 Rue des Lucioles, 06902 Sophia Antipolis Cedex, France

Search for more papers by this author

A. Hordijk

Department of Mathematics and Computer Science, Leiden University, P.O. Box 9512, 2300RA Leiden, The Netherlands

Search for more papers by this author

F. M. Spieksma

Department of Mathematics and Computer Science, Leiden University, P.O. Box 9512, 2300RA Leiden, The Netherlands

Search for more papers by this author

Published Online:1 Aug 1997https://doi.org/10.1287/moor.22.3.588

Abstract

The goal of this paper is to provide a theory of N-person Markov games with unbounded cost, for a countable state space and compact action spaces. We investigate both the finite and infinite horizon problems. For the latter, we consider the discounted cost as well as the expected average cost. We present conditions for the infinite horizon problems for which equilibrium policies exist for all players within the stationary policies, and show that the costs in equilibrium policies exist for all players within the stationary policies, and show that the costs in equilibrium satisfy the optimality equations. Similar results are obtained for the finite horizon costs, for which equilibrium policies are shown to exist for all players within the Markov policies. As special case of N-person games, we investigate the zero-sum (2 players) game, for which we establish the convergence of the value iteration algorithm. We conclude by studying an application of a zero-sum Markov game in a queueing model.

cover image Mathematics of Operations Research

Volume 22, Issue 3

August 1997

Pages 513-768

Article Information

Metrics

Information

Published Online:August 01, 1997

Cite as

E. Altman, A. Hordijk, F. M. Spieksma, (1997) Contraction Conditions for Average and α-Discount Optimality in Countable State Markov Games with Unbounded Rewards. Mathematics of Operations Research 22(3):588-618.

https://doi.org/10.1287/moor.22.3.588

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Contraction Conditions for Average and α-Discount Optimality in Countable State Markov Games with Unbounded Rewards

Abstract

Volume 22, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News