A Central Limit Theorem for Temporally Nonhomogenous Markov Chains with Applications to Dynamic Programming

Alessandro Arlotto
Corresponding Author
Alessandro Arlotto
[email protected]
The Fuqua School of Business, Duke University, Durham, North Carolina, 27708
Search for more papers by this author
,
J. Michael Steele
J. Michael Steele
[email protected]
Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania, 19104
Search for more papers by this author

Alessandro Arlotto

Corresponding Author

Alessandro Arlotto

[email protected]

The Fuqua School of Business, Duke University, Durham, North Carolina, 27708

Search for more papers by this author

J. Michael Steele

[email protected]

Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania, 19104

Search for more papers by this author

Published Online:23 Sep 2016https://doi.org/10.1287/moor.2016.0784

Abstract

We prove a central limit theorem for a class of additive processes that arise naturally in the theory of finite horizon Markov decision problems. The main theorem generalizes a classic result of Dobrushin for temporally nonhomogeneous Markov chains, and the principal innovation is that here the summands are permitted to depend on both the current state and a bounded number of future states of the chain. We show through several examples that this added flexibility gives one a direct path to asymptotic normality of the optimal total reward of finite horizon Markov decision problems. The same examples also explain why such results are not easily obtained by alternative Markovian techniques such as enlargement of the state space.

cover image Mathematics of Operations Research

Volume 41, Issue 4

November 2016

Pages 1161-1207

Article Information

Metrics

Information

Received:May 04, 2015
Published Online:September 23, 2016

Cite as

Alessandro Arlotto, J. Michael Steele (2016) A Central Limit Theorem for Temporally Nonhomogenous Markov Chains with Applications to Dynamic Programming. Mathematics of Operations Research 41(4):1448-1468.

https://doi.org/10.1287/moor.2016.0784

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

A Central Limit Theorem for Temporally Nonhomogenous Markov Chains with Applications to Dynamic Programming

Abstract

Volume 41, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News