The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems

P. J. Schweitzer
P. J. Schweitzer
I.B.M. Thomas J. Watson Research Center, Yorktown Heights, New York 10598, and Graduate School of Management, University of Rochester, Rochester, New York 14627
Search for more papers by this author
,
A. Federgruen
A. Federgruen
Foundation Mathematisch Centrum, 2e Boerhaavestraat 49, Amsterdam 1005, The Netherlands
Search for more papers by this author

I.B.M. Thomas J. Watson Research Center, Yorktown Heights, New York 10598, and Graduate School of Management, University of Rochester, Rochester, New York 14627

Search for more papers by this author

A. Federgruen

Foundation Mathematisch Centrum, 2e Boerhaavestraat 49, Amsterdam 1005, The Netherlands

Search for more papers by this author

Published Online:1 Nov 1977https://doi.org/10.1287/moor.2.4.360

Abstract

This paper considers undiscounted Markov Decision Problems. For the general multichain case, we obtain necessary and sufficient conditions which guarantee that the maximal total expected reward for a planning horizon of n epochs minus n times the long run average expected reward has a finite limit as n → ∞ for each initial state and each final reward vector. In addition, we obtain a characterization of the chain and periodicity structure of the set of one-step and J-step maximal gain policies. Finally, we discuss the asymptotic properties of the undiscounted value-iteration method.

cover image Mathematics of Operations Research

Volume 2, Issue 4

November 1977

Pages 297-382

Article Information

Metrics

Information

Published Online:November 01, 1977

Cite as

P. J. Schweitzer, A. Federgruen, (1977) The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems. Mathematics of Operations Research 2(4):360-381.

https://doi.org/10.1287/moor.2.4.360

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems

Abstract

Volume 2, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News