On the Relation Between Recurrence and Ergodicity Properties in Denumerable Markov Decision Chains

R. Dekker
R. Dekker
Econometric Institute, Erasmus University, P.O. Box 1738, 3000DR Rotterdam, The Netherlands
Search for more papers by this author
,
A. Hordijk
A. Hordijk
Department of Mathematics and Computer Science, University of Leiden, P.O. Box 9512, 2300RA Leiden, The Netherlands
Search for more papers by this author
,
F. M. Spieksma
F. M. Spieksma
Department of Mathematics and Computer Science, University of Leiden, P.O. Box 9512, 2300RA Leiden, The Netherlands
Search for more papers by this author

R. Dekker

Econometric Institute, Erasmus University, P.O. Box 1738, 3000DR Rotterdam, The Netherlands

Search for more papers by this author

A. Hordijk

Department of Mathematics and Computer Science, University of Leiden, P.O. Box 9512, 2300RA Leiden, The Netherlands

Search for more papers by this author

F. M. Spieksma

Department of Mathematics and Computer Science, University of Leiden, P.O. Box 9512, 2300RA Leiden, The Netherlands

Search for more papers by this author

Published Online:1 Aug 1994https://doi.org/10.1287/moor.19.3.539

Abstract

This paper studies two properties of the set of Markov chains induced by the deterministic policies in a Markov decision chain. These properties are called μ-uniform geometric ergodicity and μ-uniform geometric recurrence. μ-uniform ergodicity generalises a quasi-compactness condition. It can be interpreted as a strong version of stability, as it implies that the Markov chains generated by the deterministic stationary policies are uniformly stable. μ-uniform geometric recurrence can be shown to be equivalent to the simultaneous Doeblin condition, If μ is bounded. Both properties imply the existence of deterministic average and sensitive optimal policies.

The second Key theorem in this paper shows the equivalence of μ-uniform geometric ergodicity and weak μ-uniform geometric recurrence under appropriate continuity conditions.

In the literature numerous recurrence conditions have been used. The first Key theorem derives the relation between several of these conditions, which interestingly turn out to be equivalent in most cases.

cover image Mathematics of Operations Research

Volume 19, Issue 3

August 1994

Pages 513-768

Article Information

Metrics

Information

Published Online:August 01, 1994

Cite as

R. Dekker, A. Hordijk, F. M. Spieksma, (1994) On the Relation Between Recurrence and Ergodicity Properties in Denumerable Markov Decision Chains. Mathematics of Operations Research 19(3):539-559.

https://doi.org/10.1287/moor.19.3.539

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

On the Relation Between Recurrence and Ergodicity Properties in Denumerable Markov Decision Chains

Abstract

Volume 19, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News