Rate of Convergence of Empirical Measures and Costs in Controlled Markov Chains and Transient Optimality

Eitan Altman
Eitan Altman
INRIA–Sophia Antipolis, 2004 Route des Lucioles, 06565 Valbonne Cedex, France
Search for more papers by this author
,
Ofer Zeitouni
Ofer Zeitouni
Department of Electrical Engineering, Technion-Israel Institute of Technology, Haifa 32000, Israel
Search for more papers by this author

Eitan Altman

INRIA–Sophia Antipolis, 2004 Route des Lucioles, 06565 Valbonne Cedex, France

Search for more papers by this author

Ofer Zeitouni

Department of Electrical Engineering, Technion-Israel Institute of Technology, Haifa 32000, Israel

Search for more papers by this author

Published Online:1 Nov 1994https://doi.org/10.1287/moor.19.4.955

Abstract

The purpose of this paper is two fold. First, bounds on the rate of convergence of empirical measures in controlled Markov chains are obtained under some recurrence conditions. These include bounds obtained through large deviations and central limit theorem arguments. These results are then applied to optimal control problems. Bounds on the rate of convergence of the empirical measures that are uniform over different sets of policies are derived, resulting in bounds on the rate of convergence of the costs. Finally, new optimal control problems that involve not only average cost criteria but also measures on the transient behavior of the cost, namely the rate of convergence, are introduced and applied to a problem in telecommunications. The solution to these problems rely on the bounds introduced in previous sections.

cover image Mathematics of Operations Research

Volume 19, Issue 4

November 1994

Pages 769-1022

Article Information

Metrics

Information

Published Online:November 01, 1994

Cite as

Eitan Altman, Ofer Zeitouni, (1994) Rate of Convergence of Empirical Measures and Costs in Controlled Markov Chains and Transient Optimality. Mathematics of Operations Research 19(4):955-974.

https://doi.org/10.1287/moor.19.4.955

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Rate of Convergence of Empirical Measures and Costs in Controlled Markov Chains and Transient Optimality

Abstract

Volume 19, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News